A glimpse of Machine Learning

A glimpse of Machine Learning

Machine learning is a process of imitating humans with the help of algorithms. In simple terms, it involves copying the learning process used by humans to learn a task.

The algorithm used for the above process is known as the machine learning algorithm.

Let’s consider a human trying to learn a language; initially, he will try to understand the basics, then with repetitive practice, he will master the task of learning. Likewise, a machine learning algorithm will learn the task by copying the learning procedure used by humans.

Machine learning algorithms are mainly divided into four parts:

?Fig 1.0 - Classification of machine learning algorithm

Fig 1.0 - Classification of machine learning algorithm

To understand in more detail let's first try to understand the excel sheet below.

No alt text provided for this image

Photo 1: Input and Output

As one can see the sheet is divided into 2 parts i.e. Input and Output.

   Input is a factor that may or may not affect the output. In the above example age, education, capital, hours per week are the factors that may affect an individual's income. 

Generally in machine learning, we provide data (including input and output), a machine learns from the examples, and finally produces output for the unknown data. Different types of machine learning algorithms are as follows:

Supervised Learning: In this type of learning, input and output are provided to the model to predict the output of unknown data. Supervised learning consists of both input(X) and output(Y)

Unsupervised Learning: Unsupervised learning consists of only input(X) and no output variable (Y), here machine learns from grouping the data.

Semi-supervised Learning: In this type of learning, some part of input (X) data may have the output(Y)

Deep Learning: This is a subset of machine learning where data is passed through several layers of neural networks. For example, a dataset of images of cats and dogs (mixed) is provided as input and the machine accurately classifies the image as cat or dog.

 A dataset in supervised learning can be classified into two types i.e. regression type and classification type

   A model with continuous values in output falls into regression category whereas data with discrete labels in output is categorized as classification type. In the above photo 1 (Input vs. output), the output in the dataset is classified into income more than 50K and less than 50k hence it is a classification type dataset. The price range of houses is a great example of a regression-type dataset.

Different types of classification and regression algorithms:

Classification algorithms:

  • Naive Bayes 
  • Logistic regression
  • Decision Tree  
  • Random Forest
  • K-Nearest Neighbours
  • SVMs

Regression algorithms: 

  • Simple Linear Regression
  • Multiple Linear Regression
  • Logistic Regression
  • Ridge Regression
  • Lasso Regression

**Continuous data – data that changes and is not fixed. Example- Bodyweight

**Discrete data – data that is fixed. It can also be categorical. Example- male and female

Summary: 

Machine learning involves obtaining maximum accuracy from the data provided. 

As humans learn from practice - to master any art, likewise machines also learn from experience/model. Algorithms develop several models and the model with maximum accuracy is considered as the best model.

Link - https://analyticaluniverse.blogspot.com/2021/06/a-glimpse-of-machine-learning.html




Harkirat Singh

DevOps Engineer | Certified Kubernetes Administrator

3 年

Nice crisp article !!

要查看或添加评论,请登录

Prashil Wanjari的更多文章

  • Data Visualization

    Data Visualization

    Data visualization is the most important part of decision making. Analyst can jump to conclusion after analyzing…

    2 条评论
  • Introduction to Analytics

    Introduction to Analytics

    What is analytics? Analytics is a branch consisting of statistics, machine learning – deep learning algorithms, data…

    2 条评论
  • Netflix Dataset Visualization – Tableau

    Netflix Dataset Visualization – Tableau

    Like every other aspiring BA student, I tried my hands on Tableau. This is the first time I used the software to build…

  • Natural Language Processing (NLP)

    Natural Language Processing (NLP)

    As per Wikipedia, it is a subfield of linguistic, Computer Science, and Artificial Intelligence with the interaction…

  • House Prices - Regression Techniques

    House Prices - Regression Techniques

    With some experience in Machine learning and Python, I tried my hands-on problem statement where one to predict the…

  • First Step towards Machine Learning

    First Step towards Machine Learning

    The increase in demand for Data science engineers has made students think of Analytics as a good career. Many…

    1 条评论

社区洞察

其他会员也浏览了