课程详情
Data integrity is the new focal point of the data science revolution. Now that everybody is onboard with the role of data in people's lives and business, it's not an unfair question to ask, "Can you prove that your data is accurate?" In this course, you can learn how to identify and address many of the data integrity issues facing modern data scientists, using R and the tidyverse. Discover how to handle missing values and duplicated data. Find out how to convert data between different units and tackle poorly formatted text. Plus, learn how to detect outliers, address structural issues, and identify red flags that indicate potential data quality issues.
Where possible, instructor Mike Chapple shows how to correct the issues using R, but the same principles can be applied to any statistical programing language.
您将获得的技能
获取证书,展示成果
分享学到的内容,成为理想行业的达人,获取证书,展示您在课程中所学的知识。
领英学习
结业证书
-
在领英档案中的“资格认证”版块下展示
-
下载或打印为 PDF,与他人分享
-
以图片形式在线分享,展现您的技能
了解讲师
学员评价
-
-
Kaja Nagarjuna
Kaja Nagarjuna
|| Data Science || Machine Learning & Statistics || SQL || Python || R Programming|Power BI.
-
内容
课程内容
- 边学边练 1 个练习文件
- 知识测验 6 个测验
- 随时随地学习 可在平板电脑和手机上访问