Exploratory Data Analysis – Hydrogen Project Since year 2000
Sameer Shirur
Leading Technology Solutions | IISc | Data Science AI I EPC Digitalisation Expert | Industry4.0 Implementation to E&C | Author, Thought leader in EPC Process & Associated Industry |
Background for this article :
Sustainability and carbon capture , is the focus in today’s world, many govts and companies have committed to?net zero carbon emission and announcing large number of projects ; "where are we now towards net zero emission ? and?what is this ; tipping point that has caused massive attention green projects are taking? Pursuit of answers has led to this article.
This is an Exploratory Data Analysis on the secondary source of data obtained from IEA (International Energy Agency). This article will make an attempt to review project announced since 2000 by various countries.
Credit :
IEA for their wonderful work on collecting the huge data set, tracking the projects and most importantly keeping the data open. this data can be explored by data science enthusiast like me and have fun and share the learning within the community.?
Introduction to Hydrogen Data set:?Source : IEA
This data set covers all projects worldwide that have been commissioned since 2000 for the production of hydrogen for energy or climate change mitigation purposes, i.e., their objective is either to reduce the emissions associated with the production of hydrogen for existing applications, or the introduction of hydrogen as an energy carrier or industrial feedstock in new applications where it is not currently widely used, and has the potential to be a low-carbon?technology option. Projects in planning or construction are also included.
Projects are categorized by the production technology (electrolysis; fossil fuels with carbon capture, utilization and storage; other technologies), the hydrogen-based fuel produced (hydrogen, methanol, ammonia, methane, synthetic hydrocarbons) and the use of the fuel produced.?
Summary of data set
Data-set ~ 500MB,
1000 rows, more than 21 columns
Projects announced: 989
Countries?= ~ 73
Decommissioned project ~ 100
End use is projected in following sectors: Mobility, Power, Refining, Ammonia, Methanol, Other industries?
Tools Used for Data Analysis
Programming Language???????-?Python 3.0?
IDE????????????????????????????????????????- Anaconda Navigator and Jupiter Notebooks
Visualizations???????????????????????-?Matplotlib and Seaborn libraries?
Source Code Location:
EDA Steps followed?:
Data Visualization after EDA
Summary of?EDA process shows following insights :
Insights drawn from IEA Data analysis
( Source : https://www.iea.org/reports/global-hydrogen-review-2021)
IEA research has detail report on road map of net zero following are the key summary (please visit the source page for the details analysis)
Key Take-away of Data Analysis:
??if you are consultant : significantly involved with proven track record in water electrolysis and related business, boom is just start
if you are professional : this is where major economy is focusing , equip yourself skill that matter here
if you are owner/operator with lot money : this is where you should invest?
There are many more question that we can ask to the data and very interesting insights can be drawn out it. However , I have decided to pursue that some time later. if this interests you please contribute to the exercise and lets continue to learn together.
I hope you enjoyed as mush I did collecting, researching and compiling this article.
Happy Reading!!
Certified Independent Director with IICA with certification for ESG and digitalisation. Published Writer.
3 年Excellent work sameer
Product Owner | Digitalisation | Industry consulting |Blend of Industrial, Academic and R&D in multiple engineering domains | Expertise in Engineering Design Tools
3 年Nice piece and explained clearly. Keep going.
Quality Management System| Six Sigma Green Belt | Certified ISO 9K and ISO 14K Lead Auditor | Training
3 年A very insightful results i could see with the huge data sets..