AutoML - first glance

"Machine Learning and AI attempts to automate manual work...
AutoML attempts to automate Machine Learning process."

Machine Learning appears very fascinating to most of the new comes, but it involves a lot of dirty job inherent in it.

For example if we talk from very high level (1000ft above) the machine learning process would have following steps

  • Collecting the data/information from various sources
  • EDA , cleaning the data, understanding the data
  • Selecting the right family of model
  • Fitting model, predicting and testing it
  • Validating your model
  • And finally deploying the model as a complete solution

 The ML aspirants/engineers find the job of making model, predictions and further steps very fascinating. But in my opinion that's smaller chunk of the real work one has to do. The main work lies in analyzing the data i.e. gathering, cleaning and understanding the data. Which eats up majority of the time and this requires real data science skills.

Many of the steps in process are iterative and you wouldn't even know if the approach you have chosen will work unless you have wasted lot of your time already.

These tasks are very complex for the non-ML experts and appear very daunting for beginners.

So that gives an 'opportunity' to have one OTS easy to use software/library which would help any beginner to achieve the goal in few lines of code. The AutoML comes into the rescue here.

AutoML helps in building a complete pipeline from raw data to deployable machine learning models in few lines of the code.

AutoML in action

So essentially what you see in the black box is all taken care by the AutoML which is the heart of the machine learning. In some tools optionally you should be able to provide Optimization metrics also along with some constraints.

It also gives you ,

  • Simplicity : You don’t have to go deep into the model complexity and algorithm details.
  • OTS solution : It gives you OTS solution which is easy to code and deploy
  • Robustness : Provide the raw data, and that's it.
  • Fault tolerance : Some tool provide you ability to resume if interrupted.

 All the big players like Google and Amazon have already started providing AutoML solutions as

No alt text provided for this image

 

Cloud AutoML

 


No alt text provided for this image

 

Sagemaker, AutoGluon

 

Image Courtesy: Google

For me, AutoML has recently emerged and still under evaluation. Organizations have started using it but at a very low scale right now. This has to mature in a 'true machine learning' way by going through data and experience. So let's wait and watch.

Raja Saurabh Tiwari

 

 

 

要查看或添加评论,请登录

Raja Saurabh Tiwari的更多文章

  • The Hidden Cost of AI

    The Hidden Cost of AI

    Artificial Intelligence (AI) is revolutionizing industries, enhancing automation, and creating new possibilities for…

    3 条评论
  • Agentic AI - My take

    Agentic AI - My take

    Introduction In recent months, Agentic AI has emerged as a focal point in the technology sector, captivating both…

    16 条评论
  • Large Language Models vs Small Language Models

    Large Language Models vs Small Language Models

    Before directly jumping to LLM, a quick recap on AI and Machine Learning. We all have been seeing the below image which…

    2 条评论
  • So what makes a good data science profile

    So what makes a good data science profile

    Let's start with some stats Data science was named the fastest-growing job in 2017 by LinkedIn, and in 2018 Glassdoor…

    3 条评论
  • Don't let your fear win

    Don't let your fear win

    Once Krishna and Balarama got late playing in the forest. They decided to rest in there over the night and thought to…

    1 条评论
  • Data Lake & Data Mesh

    Data Lake & Data Mesh

    Global data creation is projected to exceed 180 zettabytes in the next five years. It was always a struggle to create a…

  • Analytics of Data Scientists in Kaggle

    Analytics of Data Scientists in Kaggle

    Kaggle has recently published a report on the Kaggle users on various aspects. The trend shows analysis of people…

  • Text Analysis - Word Cloud

    Text Analysis - Word Cloud

    Text Analysis : Text analysis one of the richest area in the Machine Learning space. Text analysis is the process of…

  • Machine Learning (Without CODE)

    Machine Learning (Without CODE)

    Machine learning is very fascinating for data science practitioners and everyone and there's a continuous effort…

    2 条评论
  • Statistics vs. Visualization (#Data Science)

    Statistics vs. Visualization (#Data Science)

    Understanding the statistical properties of the data is one of the key aspect of data science or Machine Learning…