登录查看更多内容

A Simple Machine Learning Example.

Shivek Maharaj

Data Analyst | Automation Architect | Business success doesn’t follow a blueprint, It follows me | AI Engineer

发布日期: 2024年2月8日

+ 关注

Hi, everyone. I hope you are all doing well.

This post will introduce us to a simple machine learning example.

To increase your level of relatability, a very interesting point I should mention before we look at our example is: Thinking back at your days in school. Do you remember your first mathematics class in which you were introduced to drawing a line of best fit across the data points scattered on a scatter plot? This line of best fit required that the number of data points located above the line was equal to the number of data points below the line.

Believe it or not- What your mathematics teacher did not tell you was that the actions you were performing during that very moment was a simple form or type of Machine Learning technique. It is called Linear Regression.

Now, let’s move on to our example, keeping in mind the teachings learned in school.

1. THE DATA PREPARATION PHASE.

Lets say we have a simple dataset that comprises two features- Crime Rate and House Price- Our data is in good condition and needs no further actions to be taken for preparation.

We would like to predict the value of House Price therefore making it our Target Vector and dependent variable. As we know, the dependent variable will be found on the Y-Axis.

We will use the remaining features, namely, Crime Rate to make the predictions, therefore making it our Features Matrix, and independent variable, which will place it along the X-Axis.

To gain further insight into our Data, let us plot a visualization of the data, as seen below:

As can be seen from the visualization, we have a scatter plot showing us the position of the data points relative to each other.

Looking at the graph and data points, one can anticipate that there is a direct relationship between Crime Rate and House Price. Because as X increases, so does Y.

2. THE ALGORITHM SELECTION PHASE.

In our particular problem, we wish to predict House Prices. The House Price feature takes the form of Numerical Data, i.e., Quantitative values. Go back one slide, and look at the data, focusing on the House Price (R) column. Quantitative or Numerical Data will tell us the Amount, Quantity, or Measurement of something.

Notice, these values are continuous- There is no fixed, or predefined range within with the values can belong. That is why they are called Continuous. Now to predict values of this nature, we require a method or algorithm that will be able to learn the trend in the dataset once-off, and will allow us to use the model for making future predictions pertaining to House Prices.

The method/algorithm we would select will be The Straight Line Graph- or Linear Regression. It has the following formula:

Y=MX+C

Let us analyse this formula.

领英推荐

Introduction to Simple Linear Regression in Machine…

Learnbay 2 年前

Data Science Notes - Part 2

ARNAB MUKHERJEE ???? 1 年前

Automating Machine Learning (AutoML) Selection…

Kai R. Larsen 6 年前

3. THE MODEL COMPILATION PHASE.

Now that we have effectively selected the algorithm to utilize, we may proceed to expose the Algorithm to the data, in order to obtain a successful powerful model.

When an algorithm is applied to data, the process of forming the model involves mathematics and statistics. The model needs to carefully choose and select the best coefficients to make use of in the formula.

One must understand that the model’s digital compilation and selection of coefficients that occurs during the Machine Learning process, tends to happen in backend processes that cannot be seen by the human being. However, when brought forward into the light, each of these intrinsic processes can be expressed mathematically in the form of formulae and equations, and one is able to see exactly how the model has reached those conclusions.

Y = MX + C

Where:

Y is the value for the Y variable, in our case House Price.
X is the value for the X variable, in our case Crime Rate.
M is the gradient of the graph.
C is the y-value at the point the gradient intercepts the y-axis. It is the y-intercept.

Upon doing the mathematics manually, one will find:

M = (y_2 – y_1)/(x_2- x_1 ) (two data points need to be substituted into the respective placeholders)
To calculate the value of C, one needs to make it the subject of the formula by substituting a third data point into the equation.

# Let us proceed to calculate the equation.

4. THE EVALUATION/PREDICTION STAGE.

Now that we have successfully compiled a model, we may proceed to evaluate its predictive power, and use it to make predictions for future scenarios.

Suppose we have a community in which the Crime Rate is 70(%). We wish to know the approximate Price of a House based on this figure. With Machine Learning, making a future prediction becomes simple- We simply make use of the model.

Predicting the House Price for a 70% Crime Rate:

= 1000x

House Price = 1000(70)

= 70000 (R)

Given the fact that I have used a simple example scenario to work with, one is able to see that the models prediction is 100% accurate, if the data follows a continuous trend as we have seen in the table and scatter plot. We may confidently say that a Crime Rate of 70% will cause a House Price to be R70000. Hence, we can assume that our model has achieved 100% accuracy in it’s predictions.

The reason as to why I call this stage Evaluation/Prediction, is because I believe that it is good practice to evaluate model performance before attempting to make a prediction. This gives us insight into our models margin of error– thereby notifying us of: up to what level or degree our model could be incorrect in it’s prediction.

I do hope that you have obtained a good understanding of the general Machine Learning Framework.

Thank you for your time.

Shivek Maharaj

Data & Analytics

1 年

Fantastic explanation of linear regression in machine learning! This post truly brings back memories. ??

1 次回应

Marc Castricum

1 年

I love how you broke down the complex concept of machine learning into a simple example. Thank you! ??

1 次回应

查看更多评论

要查看或添加评论，请登录

Shivek Maharaj的更多文章

Measuring The Clustering Performance

2024年3月12日

Measuring The Clustering Performance

Real-world data are not inherently grouped into several separate groupings. This makes it difficult to visualize and…
Unsupervised Machine Learning With Python: Clustering. Mean Shift Algorithm

2024年3月11日

Unsupervised Machine Learning With Python: Clustering. Mean Shift Algorithm

It is yet another well-liked and effective clustering method applied in unsupervised learning. It is a non-parametric…

1 条评论
Unsupervised Machine Learning With Python: Clustering. K-Means Clustering

2024年3月10日

Unsupervised Machine Learning With Python: Clustering. K-Means Clustering

The next few posts that we look at will explain a few of the many various clustering algorithms that are available for…

2 条评论
Unsupervised Machine Learning With Python: Clustering

2024年3月9日

Unsupervised Machine Learning With Python: Clustering

Machine learning algorithms that are unsupervised lack a supervisor to offer any kind of direction. They closely…

3 条评论
Artificial Intelligence With Python: Logic Programming- Part 2 (Examples)

2024年3月8日

Artificial Intelligence With Python: Logic Programming- Part 2 (Examples)

Hi, everyone! I hope you are all doing well. This article will demonstrate to us a few examples of Logic Programming…

8 条评论
Artificial Intelligence With Python: Logic Programming- Part 1

2024年3月7日

Artificial Intelligence With Python: Logic Programming- Part 1

Hi, everyone. I hope you are all doing well.
Supervised Machine Learning With Python: Regression. Simple Linear Regression

2024年3月6日

Supervised Machine Learning With Python: Regression. Simple Linear Regression

One of the most crucial statistical and machine learning tools is regression. Regression serves as the starting point…

1 条评论
Supervised Machine Learning With Python: Classification: Ensemble Techniques

2024年3月5日

Supervised Machine Learning With Python: Classification: Ensemble Techniques

In essence, this approach is used to adapt current classification algorithms to fit imbalanced data sets. We build…

3 条评论
The Class Imbalance Problem

2024年3月4日

The Class Imbalance Problem

When there are significantly fewer observations in one class than in the other classes, this is referred to as a class…

2 条评论
Evaluating The Performance Of Classification Models

2024年3月3日

Evaluating The Performance Of Classification Models

We need to evaluate the model’s performance after deploying a machine learning method. Datasets and metrics may serve…

3 条评论

See all articles

A Simple Machine Learning Example.

Shivek Maharaj

Data Analyst | Automation Architect | Business success doesn’t follow a blueprint, It follows me | AI Engineer

1. THE DATA PREPARATION PHASE.

2. THE ALGORITHM SELECTION PHASE.

Y=MX+C

领英推荐

3. THE MODEL COMPILATION PHASE.

Y = MX + C

4. THE EVALUATION/PREDICTION STAGE.

Shivek Maharaj的更多文章

社区洞察

其他会员也浏览了

Machine Learning - Cross Validation

Some Statistical Operations For Machine Learning

Why Big Data And Machine Learning Are Important In Our Society

Basics of Machine Learning

The Mathematical Backbone of Machine Learning: Why Math Matters When Building Algorithms

List of Top 10 Algorithms Used in Machine Learning Models

Boosting Techniques Battle: CatBoost vs XGBoost vs LightGBM vs scikit-learn GradientBoosting vs Hierarchical GB

Balancing the Scales : Handling Class Imbalance

Part 2 - Keep it Simple : Machine Learning & Algorithms for Big Boys

Predictive Analytics and Machine Learning: Discovering the Likelihood of a Future Outcome

1. THE DATA PREPARATION PHASE.

2. THE ALGORITHM SELECTION PHASE.

Y=MX+C

领英推荐

3. THE MODEL COMPILATION PHASE.

Y = MX + C

4. THE EVALUATION/PREDICTION STAGE.

Shivek Maharaj的更多文章

Measuring The Clustering Performance

Unsupervised Machine Learning With Python: Clustering. Mean Shift Algorithm

Unsupervised Machine Learning With Python: Clustering. K-Means Clustering

Unsupervised Machine Learning With Python: Clustering

Artificial Intelligence With Python: Logic Programming- Part 2 (Examples)

Artificial Intelligence With Python: Logic Programming- Part 1

Supervised Machine Learning With Python: Regression. Simple Linear Regression

Supervised Machine Learning With Python: Classification: Ensemble Techniques

The Class Imbalance Problem

Evaluating The Performance Of Classification Models

社区洞察

其他会员也浏览了

Machine Learning - Cross Validation

Some Statistical Operations For Machine Learning

Why Big Data And Machine Learning Are Important In Our Society

Basics of Machine Learning

The Mathematical Backbone of Machine Learning: Why Math Matters When Building Algorithms

List of Top 10 Algorithms Used in Machine Learning Models

Boosting Techniques Battle: CatBoost vs XGBoost vs LightGBM vs scikit-learn GradientBoosting vs Hierarchical GB

Balancing the Scales : Handling Class Imbalance

Part 2 - Keep it Simple : Machine Learning & Algorithms for Big Boys

Predictive Analytics and Machine Learning: Discovering the Likelihood of a Future Outcome