课程: Complete Guide to AI and Data Science for SQL: From Beginner to Advanced

今天就学习课程吧!

今天就开通帐号,24,700 门业界名师课程任您挑!

Checking correlation after removing outliers

Checking correlation after removing outliers

- [Instructor] Okay, in your previous step, you examine the relationship between highway access and property tax using a scatterplot. You discovered that while there was a strong correlation, no clear trend was visible, possibly due to outliers. So, what do we do next? Let's find out in our 10th step here. Outliers, as we learned earlier, can disrupt our analysis. They are data points that don't quite follow the usual pattern and can lead to misleading results. In this step, you're taking a proactive approach by removing these outliers associated with high property tax rates. That is, over 600. This is the Python code that does that for you. With the outliers temporarily set aside, we're now going to calculate the correlation between property tax and highway access using a statistical method called Pearson's Correlation Coefficient. Now, Pearson's Correlation Coefficient is like a data detective. It tells us how two…

内容