Exploratory Data Analysis Using SweetViz!
Rishabh Singh
GCP Certified Professional ML Engineer | Data, Generative AI, Data Scientist@LTIMindtree
SweetViz is an open-source Python library that produces beautiful, highly detailed visualizations to start the EDA. It also covers all the result of a normal Pandas df.describe() method and much more. The output is a simple html file that you can download and open/use at your own convenience
Features of SweetViz:
Target analysis(optional) :
Compare?:
Automatically detects numerical and categorical features
Association :
Statistical summary :
Getting Started:
Firstly we will install the SweetViz library:
!pip install sweetviz
Setting up Dependencies:
import?pandas?as?pd
import?seaborn?as?sns
import?sweetviz?as?sv
Loading the dataset, we will use planets dataset from the seaborn library:
#Loading?the?dataset
planets?=?sns.load_dataset('planets')
planets.head()
Lets analyze our dataset:
#?Analyzing?the?dataset
report?=?sv.analyze(planets)
#?Display?the?report
report.show_html('planets.html')
We can also explore the relation by clicking the Associations tab.
And its done. The EDA report is ready and it contains a lot of information for all the features. It is easy to understand and the report requires only a few lines of code.
Databricks Certified| GCP ACE Certified | Snowpro Core Certified| Data Engineer -Advanced Data Analytics at ATGEIR Solutions, Research Scholar at VTU
2 年Intresting
Software Developer at Trisys IT Services Pvt Ltd. 5-star Hackerrank
2 年Informative