Introduction of Linear Regression - Machine Learning Model for Industry Use Cases ( both EPC and Process Industries)
Sameer Shirur
Leading Technology Solutions | IISc | Data Science AI I EPC Digitalisation Expert | Industry4.0 Implementation to E&C | Author, Thought leader in EPC Process & Associated Industry |
Hello everyone,
Background of the Article:
While I was exploring Machine Learning and mathematical models for industry use cases. I observed that there is a lot traction of in-process industries, especially in the area of Predictive Maintenance. However, which are the possible use cases of the Data-intensive and knowledge-intensive EPC industry?
To find the answers;?I have explored Linear Regression as a base for the mathematical model. Thru this article,
I am trying to dwell into:
Please refer to my earlier article “ Data Science and Process Associated Industry (EPC & Engg Consulting industries)" ”, where we have talked about data science use cases for the industry. This article takes to study a little deeper.
1. INTRODUCTION
What are Linear Regression and its practical use case?
In?statistics,?linear regression?is a?linear?approach for modelling the relationship between a?scalar?response and one or more explanatory variables (also known as?dependent and independent variables). The case of one explanatory variable is called?simple linear regression; for more than one, the process is called?multiple linear regression
Linear regression has many practical uses. Most applications fall into one of the following two broad categories:
Additional reference: (https://www.statisticshowto.com/probability-and-statistics/regression-analysis/find-a-linear-regression-equation/)
So how do we build simple linear regression machine learning model ;
In a simple terms ;?We study the existing variables and they underlying mean, median and standard deviation values; then transform the unseen data with study and predict the target variables. Finally, evaluate our model efficiency.?
Specific Use cases for Process Industries:?
Specific Use cases for EPCs
Beware: before we apply Linear Regression following should be checked:?
With these basics cleared let’s build a Linear Regression model for a similar data set and check the feasibility and get to know the hands-on implementation
Business case:?
Impact spending on adds TV, radio, and newspaper on sales; this is a small dataset (200 rows and 4 columns). which will be used to find the correlation between spending on sales and a linear developing simple regression model that can be used to predict the future data.
X = Independent Variables , Y = Dependent Variables ; X1 = Spending on TV, X2 = Spending on radio X3= Spending on newspapers, Y = Sales?
Programming Language : PYTHON 3.0
Libraries : PANDAS, NUMPY AND SKLEARN
Visualisations : MATPLOTLIB, SEABORN
IDE : JUPYTER: NOTEBOOK
Credit: The data set and LR model was introduced by INSAID as part of their curriculum, as the data set has relevance to EPC, I have chosen to use the same for my study.??
Building the Machine Learning Model :
领英推荐
A more simple definition of study and transformation of seen and unseen data: https://www.analyticsvidhya.com/blog/2021/04/sklearn-objects-fit-vs-transform-vs-fit_transform-vs-predict-in-scikit-learn/
Data Visualization as part of Model Development
Observations:
Points to note:
Idea is not to develop a robust model, but to give an idea to the readers how the overall process and the model ca look
Results :?
The developed model could successfully predict the Target or Dependent value on the unseen test data with a model efficiency of > 80%.?
Sample Feature Engineering is done by dropping the newspaper variable which gave a slight improvement in the overall performance of the model
KEY TAKE AWAY FROM THE EXERCISE
Machine learning model that can be developed for business benefits
There are already enough use cases and business cases that have been researched special in the area of Procurement, Construction, and Huge implementation that are ongoing for plant operators.?
EPC use cases need isolated and evaluated with a focus on Engineering Procurement and construction. Once the ML models are developed feature engineering can be applied to remove ineffective parameters and get great insights.
Considering the Digital Transformation focus by owner-operators, Contractors, and consultants being in proximity of customers, we are in a position to leverage this opportunity.?
Process Industry and Owner-operators and making progress; the big question is; are we (EPC companies) ready?
References:
Construction- Systems Implementation & Integration
3 年Any model needs quality data...rest is understood
Certified Independent Director with IICA with certification for ESG and digitalisation. Published Writer.
3 年One issue with EPC projects is that they are time bound. Use of ML and mathematical models should be applied in bid stage to manage risks associated with metal costs. Other data would be relevant for projects which are using same technology.
Certified Independent Director with IICA with certification for ESG and digitalisation. Published Writer.
3 年Any suggestions for maths books to brush up ?
Manager | Energy Transition & Decarbonization Expert | Ph.D. in Chemical Engineering | Driving Sustainable Solutions | Ex-thyssenkrupp
3 年Very well summed up Sameer Shirur! I'm resharing this.