课程: R Code Challenges: Data Science

今天就学习课程吧!

今天就开通帐号,24,700 门业界名师课程任您挑!

Solution: Join three datasets

Solution: Join three datasets

- [Instructor] Data scientists spend 80% of their time 2 cleaning and joining datasets. 3 The other 20% of the time 4 they spend complaining about cleaning data sets. 5 So, cleaning and joining data sets is very important 6 and that's what this challenge is all about. 7 I've given you three data sets that are related 8 and you are asked to come up with some conclusions 9 based on houseColor versus maximumIncome. 10 Comparing house color to income 11 is a somewhat silly analysis, 12 but it's a great way to demonstrate 13 how R can be used to identify these correlations. 14 So, let's take a look at how I solve this problem. 15 You were given a set of databases, 16 AcmeData and StarResearch, asked to join them, 17 and then set up categories that would identify house color 18 and then result in the maximum income for each house color. 19 In line 5, I jump right into merging 20 AcmeData_Demographic and AcmeData_Survey by surveyID.…

内容