Data cleaning with Numpy and Pandas

Data cleaning with Numpy and Pandas

Proper data cleaning is the “secret” sauce behind machine learning. Better data beats fancier algorithms…

Garbage in = Garbage out. Plain and Simple! Better data beats fancier algorithms…

Thanks to realpython.org that i could get my hands dirty cleaning the data. Below are a highlights of the exercise.

Here is the detailed code on Git:

Dropping unnecessary columns in a DataFrame

Changing the Index of a DataFrame

Tidying up Fields in the Data

Combining str Methods with NumPy to Clean Columns

Cleaning the Entire Dataset Using the applymap Function

Renaming columns and skipping rows




要查看或添加评论,请登录

MANIDIPA CHAKRAVARTI的更多文章

社区洞察

其他会员也浏览了