ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Data Visualization (Matplot + Pandas) using Python (Jupyter NoteBook)

jayamoorthi parasuraman

FSD|Azure|Cosmos, postgresql, docker,|AKS|DevOps|Dot.Net Core/.Net5/6/7/8 ||xunit,nunit, integration, graphQL,gRpc | EFCore|API |WCF| Angular/React |Microservices,DDD/TDD| Dapper | Sonar Mob: +919715783720/+6580537622

å‘å¸ƒæ—¥æœŸ: 2025å¹´3æœˆ15æ—¥

Jupyter :

Jupyter is an open-source, web-based interactive computing platform that allows you to create and share documents containing live code, equations, visualizations, and narrative text. It's widely used for data analysis, scientific research, and machine learning tasks.jupyter.org

Source : https://github.com/jayamoorthi/JupiterPython

Key Features of Jupyter:

Interactive Notebooks: Combine code execution, text, and visualizations in a single document.
Support for Multiple Languages: While primarily used with Python, Jupyter also supports languages like R, Julia, and Scala through the use of different kernels.
Extensibility: Enhance functionality with a wide range of plugins and extensions.

Getting Started with Jupyter:

To begin using Jupyter, you can choose from the following installation methods:

Using Anaconda:

What is Anaconda? Anaconda is a free, open-source distribution of Python and R for scientific computing and data science. It simplifies package management and deployment. Installation Steps: Installation Steps:

Download the Anaconda installer suitable for your operating system from the Anaconda Distribution page.
Run the installer and follow the on-screen instructions.
After installation, open the Anaconda Prompt (Windows) or terminal (macOS/Linux).
Launch Jupyter Notebook by typing:

Terminal 

jupyter notebook

This command will start the Jupyter Notebook server and open the notebook interface in your default web browser.

2. Using pip: Installation Steps:

Ensure that Python and pip are installed on your system. You can download Python from the official Python website.
Open your terminal or command prompt.
Install Jupyter Notebook using pip:

pip install notebook

After installation, start the Jupyter Notebook server by typing

jupyter notebook

Jupyter Server running and redirect to Home Page. https://localhost:8888/tree

Let 's start to create new project folder for Data Visualization

Setting up Jupyter Notebook within Visual Studio Code (VS Code) using a virtual environment is an excellent way to manage project dependencies and maintain an organized development environment. Here's a step-by-step guide to help you through the process:claudia-nikel.com

Step 1: Create a New Project Folder in VS Code

Open VS Code: Launch Visual Studio Code on your computer.
Create a New Folder: In the VS Code menu, click on File > Open Folder.... Choose a location on your system and create a new folder for your project. Open this folder in VS Code.claudia-nikel.com

Step 2: Set Up a Virtual Environment

Open the Terminal: In VS Code, open the integrated terminal by selecting Terminal > New Terminal from the menu.
Create the Virtual Environment: In the terminal, run the following command to create a virtual environment named myenv:

python3 -m venv myenv

3. Activate the Virtual Environment:

Activate the virtual environment using the appropriate command for your operating system: Windows:

.\myenv\Scripts\activate

After activation, your terminal prompt should change to indicate that you're now working within the myenv environment.

Step 3: Install Jupyter Notebook Using (Method 2) Using pip Installation

Install Jupyter: With the virtual environment activated, install Jupyter Notebook by running:

pip install notebook

After installation, start the Jupyter Notebook server by typing

Jupyter server running redirect broswer

This command will start the server and automatically open the Jupyter Notebook interface in your default web browser, typically accessible at https://localhost:8888/tree.

Step 5: Create and Manage Notebooks in VS Code

Create a New Jupyter Notebook: In the VS Code interface: Click on the View menu, then select Command Palette.... Type Jupyter: Create New Blank Notebook and select it. Choose the Python interpreter associated with your myenv virtual environment when prompted.
Rename the Notebook: Click on the notebook's name at the top (e.g., "Untitled.ipynb") to rename it as desired.

Step 6: Install Additional Libraries (e.g., pandas)

Install pandas: Within the Jupyter notebook, in a new cell, run:

pip install pandas numpy

Click play button to run code and below the code see output for library running files

7. Create as Dataset as csv file and Save using numpy for Data Analytics


import pandas as pd
import numpy as np
import random

#set number of rows 
num_rows = 1000
state_list =['TamilNamu', 'Karanadaka', 'Kerala', 'Andra', 'Delhi', 'Uthrapradsash', 'Maharastra']
# generate data
np.random.seed(42)
ids = np.arange(1, num_rows+1)
ages = np.random.randint(18, 80, size=num_rows)
incomes = np.random.normal(loc=50000, scale=15000, size=num_rows).astype(int)
credit_scores = np.random.randint(300, 850, size=num_rows)
loan_amounts = (incomes *0.2).astype(int)
states =  random.choice(state_list)


# create dataframe
df = pd.DataFrame({
    'ID': ids,
    'Ages': ages,
    'Income': incomes,
    'Credit_Score': credit_scores,
    'Loan_Amount': loan_amounts,
    'State': states
})

# Save to CSV
df.to_csv('synthetic_data.csv', index=False)
print('dataset created and saved as synthetic_data.csv successfully')

Running above code "dataset created and saved as synthetic_data.csv successfully"

Load CSV data from file and Print :

# Step5 : Load CSV data from file and print it
df =pd.read_csv('synthetic_data.csv')

print(df);

#step 9: Data Analysis Group

grouped = df.groupby('Ages')['Loan_Amount'].sum()
print(grouped)

# step 10: Data visualization using matplot

pip install matplotlib

Create data visualization using line chart

import matplotlib.pyplot as plt
import numpy as np

# Group by 'Ages' and sum 'Loan_Amount'
grouped = df.groupby('Ages')['Loan_Amount'].sum()


# Plot the grouped data as a line chart
grouped.plot(kind='line', marker='o', linestyle='-', color='b', title='Loan Amount by Age Group')

# Set labels for the axes
plt.xlabel('Ages')
plt.ylabel('Total Loan Amount')

# Display the plot
plt.show()

Create data visualization using pie chart :


import matplotlib.pyplot as plt
import numpy as np

# Group by 'Ages' and sum 'Loan_Amount'
grouped = df.groupby('Ages')['Loan_Amount'].sum()

# Plot the grouped data as a pie chart
grouped.plot(kind='pie', autopct='%1.1f%%', startangle=90, figsize=(8, 8))

# Add a title to the pie chart
plt.title('Total Loan Amount by Age Group')

# Display the plot
plt.show()

Conclusion :

Data Visualization Libraries:

In the realm of data visualization, several libraries and tools cater to diverse needs:

Matplotlib: A versatile Python library for creating static, animated, and interactive 2D plots. It's known for its flexibility and extensive customization options. reflex.dev
Seaborn: Built on top of Matplotlib, Seaborn simplifies the creation of informative and attractive statistical graphics, offering a high-level interface for drawing attractive statistical graphics. reflex.dev
Tableau: A powerful data visualization tool that enables users to create interactive and shareable dashboards. It's known for its user-friendly interface and ability to handle large datasets efficiently. codesuite.org
Power BI: Developed by Microsoft, Power BI is a business analytics tool that provides interactive visualizations and business intelligence capabilities with an interface simple enough for end users to create their own reports and dashboards.

Learning Outcomes:

Library Installation: Utilizing Jupyter Notebook's capabilities to install essential libraries like pandas for data manipulation and analysis.
Data Creation and Loading: Generating synthetic data and loading it into pandas DataFrames for structured analysis.
Data Analysis: Performing grouping and aggregation operations to derive meaningful insights from the data.
Data Visualization: Employing Matplotlib to create various plots, enhancing the interpretability of data insights.

This integrated approach demonstrates the effectiveness of using Jupyter Notebook alongside powerful libraries and tools to conduct comprehensive data analysis and visualization tasks.

Thanks to visit my post, see you bye bye next week.

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

jayamoorthi parasuramançš„æ›´å¤šæ–‡ç«

FastApi - Python

2025å¹´2æœˆ23æ—¥

FastApi - Python

FastAPI is a modern, fast (high-performance), web framework for building APIs with Python. It is based on standardâ€¦

2 æ¡è¯„è®º
Radis- Distributed Caching System with .NET 6 API With PostgreSqL DB

2025å¹´1æœˆ11æ—¥

Radis- Distributed Caching System with .NET 6 API With PostgreSqL DB

Distributed hybrid caching as side car pattern accessing data from Reids. Distributed caching in .
.NET 6 with PostgreSQL and Redis Cache

2024å¹´12æœˆ25æ—¥

.NET 6 with PostgreSQL and Redis Cache

To exploring how to utilise PostgreSQL with .NET 6.
Bogus - Generate Realistic Fake Data in C#

2024å¹´12æœˆ22æ—¥

Bogus - Generate Realistic Fake Data in C#

GitSource: https://github.com/jayamoorthi/ApiDIScrutor Bogus is a fake data generator for .
What is Clean Code ?

2024å¹´12æœˆ7æ—¥

What is Clean Code ?

The most popular definition of clean code is code that is easy to understand and easy to change. Clean code is codeâ€¦
PipelineBehavior in MediatR using .NET with CQRS

2024å¹´11æœˆ30æ—¥

PipelineBehavior in MediatR using .NET with CQRS

Git Source: https://github.com/jayamoorthi/ApiDIScrutor What are Pipeline behaviours Pipeline behaviours are a type ofâ€¦
NET API using Scrutor DI with CQRS.

2024å¹´11æœˆ19æ—¥

NET API using Scrutor DI with CQRS.

Dependency injection (DI) is built-in features of ASP.NET Core.
Decorator Pattern

2024å¹´9æœˆ29æ—¥

Decorator Pattern

Decorator pattern is a structural design pattern that allows you to add new functionalities to an Existing objectâ€¦

1 æ¡è¯„è®º
Getting Started with gRPC using .Net

2024å¹´9æœˆ22æ—¥

Getting Started with gRPC using .Net

RPC is a language agnostic, high-performance Remote Procedure Call (RPC) framework. Benifites: Modernâ€¦

1 æ¡è¯„è®º
Value objects in DDD?

2024å¹´9æœˆ15æ—¥

Value objects in DDD?

Value Objects is one of the component of DDD. Identity is fundamental for entities.

See all articles

Jupyter :

Step 1: Create a New Project Folder in VS Code

Step 2: Set Up a Virtual Environment

3. Activate the Virtual Environment:

.\myenv\Scripts\activate

After activation, your terminal prompt should change to indicate that you're now working within the myenv environment.

Step 3: Install Jupyter Notebook Using (Method 2) Using pip Installation

Step 5: Create and Manage Notebooks in VS Code

Step 6: Install Additional Libraries (e.g., pandas)

7. Create as Dataset as csv file and Save using numpy for Data Analytics

Load CSV data from file and Print :

#step 9: Data Analysis Group

# step 10: Data visualization using matplot

Create data visualization using line chart

Create data visualization using pie chart :

Conclusion :

jayamoorthi parasuramançš„æ›´å¤šæ–‡ç«

FastApi - Python

Radis- Distributed Caching System with .NET 6 API With PostgreSqL DB

.NET 6 with PostgreSQL and Redis Cache

Bogus - Generate Realistic Fake Data in C#

What is Clean Code ?

PipelineBehavior in MediatR using .NET with CQRS

NET API using Scrutor DI with CQRS.

Decorator Pattern

Getting Started with gRPC using .Net

Value objects in DDD?