课程: Cloud Hadoop: Scaling Apache Spark
今天就学习课程吧!
今天就开通帐号,24,700 门业界名师课程任您挑!
Scale Spark on the cloud by example - Apache Spark教程
课程: Cloud Hadoop: Scaling Apache Spark
Scale Spark on the cloud by example
- [Instructor] In this section, I'm going to take you through some work that my team did in collaboration with C-S-I-R-O bioinformatics in Sydney Australia on moving to the cloud and scaling real-world Spark workload. The use cases for genomic analysis or bioinformatics research and there are several constraints for our customer here. They were researched focused, the didn't at the time we started to have a dedicated devops or cloud person. And they really wanted to make their solution flexible to work across any cloud. So, as starting point they had written a library called VariantSpark which runs on top of Spark and implements custom machine learning. We'll look at it a little bit more detail in a minute. They recorded it in Scala and they had open sourced it on GitHub. When we first stared working together they were using it internally. They were using it on a shared Hadoop Spark cluster and their frustration…
随堂练习,边学边练
下载课堂讲义。学练结合,紧跟进度,轻松巩固知识。
内容
-
-
-
-
-
-
-
-
-
(已锁定)
Scale Spark on the cloud by example5 分钟 11 秒
-
(已锁定)
Build a quick start with Databricks AWS6 分钟 50 秒
-
(已锁定)
Scale Spark cloud compute with VMs6 分钟 16 秒
-
(已锁定)
Optimize cloud Spark virtual machines6 分钟 5 秒
-
(已锁定)
Use AWS EKS containers and data lake7 分钟 8 秒
-
(已锁定)
Optimize Spark cloud data tiers on Kubernetes4 分钟 17 秒
-
(已锁定)
Build reproducible cloud infrastructure8 分钟 37 秒
-
(已锁定)
Scale on GCP Dataproc or on Terra.bio8 分钟 34 秒
-
(已锁定)
Serverless Spark with Dataproc Notebook5 分钟 25 秒
-
(已锁定)
-