Introduction to Spark SQL and DataFrames 预览

Introduction to Spark SQL and DataFrames

讲师: Dan Sullivan 2,384 位用户赞了
时长: 1 小时 54 分钟 技能水平: 中级 发布日期: 2019/5/30

课程详情

Explore DataFrames, a widely used data structure in Apache Spark. DataFrames allow Spark developers to perform common data operations, such as filtering and aggregation, as well as advanced data analysis on large collections of distributed data. With the addition of Spark SQL, developers have access to an even more popular and powerful query language than the built-in DataFrames API. In this course, instructor Dan Sullivan shows how to perform basic operations—loading, filtering, and aggregating data in DataFrames—with the API and SQL, as well as more advanced techniques that are easily performed in SQL. In this section of the course, Dan explains how to join data, eliminate duplicates, and deal with null or NA values. The lessons conclude with three in-depth examples of using DataFrames for data science: exploratory data analysis, time series analysis, and machine learning.

您将获得的技能

获取证书,展示成果

分享学到的内容,成为理想行业的达人,获取证书,展示您在课程中所学的知识。

证书范例

结业证书

  • 在领英档案中的“资格认证”版块下展示

  • 下载或打印为 PDF,与他人分享

  • 以图片形式在线分享,展现您的技能

了解讲师

学员评价

4.7 分,最高 5 分

529 条评分
  • 5 星
    当前数据: 401 76%
  • 4 星
    当前数据: 102 19%
  • 3 星
    当前数据: 15 3%
  • 2 星
    当前数据: 4 1%
  • 1 星
    当前数据: 7 1%

内容

课程内容

  • 边学边练 1 个练习文件
  • 知识测验 4 个测验
  • 随时随地学习 可在平板电脑和手机上访问

相似课程

下载课程

使用 iOS 或安卓版领英学习 APP,即可在移动设备上离线观看课程。