登录查看更多内容

All the ways to deploy an ML model

Damien Benveniste, PhD

Founder @ TheAiEdge | Follow me to learn about Machine Learning Engineering, Machine Learning System Design, MLOps, and the latest techniques and news about the field.

发布日期: 2023年12月22日

You should learn to deploy your Machine Learning models! The way to deploy is dictated by the business requirements. You should not start any ML development before you know how you are going to deploy the resulting model. There are 4 main ways to deploy ML models:

Batch deployment - The predictions are computed at defined frequencies (for example, on a daily basis), and the resulting predictions are stored in a database and can easily be retrieved when needed. However, we cannot use more recent data, and the predictions can very quickly become outdated. Look at this article on how AirBnB progressively moved from batch to real-time deployments: “Machine Learning-Powered Search Ranking of Airbnb Experiences“. ?

领英推荐

Issue #307 - The ML Engineer ??

Alejandro Saucedo 4 个月前

Evaluating ML Models with Azure, Preventing AI…

Open Data Science Conference (ODSC) 2 年前

How to Build a Robust Data Collection Pipeline for…

Objectways 5 个月前

Real-time - the "real-time" label describes the synchronous process where a user requests a prediction, and the request is pushed to a backend service through HTTP API calls that, in turn, will push it to an ML service. It is great if you need personalized predictions that utilize recent contextual information, such as the time of the day or recent searches by the user. The problem is that until the user receives its prediction, the backend and the ML services are stuck waiting for the prediction to come back. To handle additional parallel requests from other users, you need to count on multi-threaded processes and vertical scaling by adding additional servers. Here are simple tutorials on real-time deployments in Flask and Django: “How to Easily Deploy Machine Learning Models Using Flask“, “Machine Learning with Django“.
Streaming deployment - This allows for a more asynchronous process. An event can trigger the start of the inference process. For example, as soon as you get on the Facebook page, the ads ranking process can be triggered, and by the time you scroll, the ad will be ready to be presented. The process is queued in a message broker such as Kafka, and the ML model handles the request when it is ready. This frees up the backend service and saves a lot of computation power by an efficient queueing process. The resulting predictions can also be queued and consumed by backend services when needed. Here is a tutorial in Kafka: “A Streaming ML Model Deployment“.

Edge deployment - This is when the model is directly deployed on the client, such as the web browser, a mobile phone, or IoT products. This results in the fastest inferences and can also be predicted offline (disconnected from the internet), but the models usually need to be pretty small to fit on smaller hardware. For example, here is a tutorial on deploying YOLO on IOS: “How To Build a YOLOv5 Object Detection App on iOS“. ? ?

The AiEdge

51,712 位关注者

Tom Scott

Founder and CEO at Streambased

10 个月

I feel like all of these are more dependent on the way on which the data is made available than the business requirements. With unified batch/streaming approaches like Confluent 's tableflow and Streambased you may be able to pick and choose the best aspects of all? I can see a future where the model is trained periodically on a larger set of streaming data that is 'downcast' to batch and then iteratively trained in between.

Andrei Lebedev

Organizer of the International Real Estate Investment Congress /Организатор Международного Конгресса

1 年

If you do not control reality and do not create information or not create something, then you are controlled by: reality and information and the one who creates it. It's time to create and not destroy

Michael K.

Technology and AI Leader | Caltech AI Instructor | Recovering Founder | Startup Advisor

1 年

Really nice clear breakdown, thanks Damien Benveniste, PhD

Sam Dinesh T D

Junior Data Scientist Intern @ Zummit Infolabs | Ex - Intern @ Tata Consultancy Services | M. Tech, Data Science @ Rajalakshmi Engineering College | PG Diploma in Data Science & Analytics @ NIELIT | B. Tech, ECE @ KITS

1 年

Incredible breakdown, Damien Benveniste, PhD. Your expertise in Machine Learning Engineering, System Design & MLOps shines through. Looking forward to more enlightening content from you! ?? #MachineLearning #MLOps #DeployMLModels

Digvijay Singh

?I help Businesses Upskill their Employees in Data Science Technology - AI, ML, RPA

1 年

Absolutely! Deployment strategies should be at the forefront of every ML project. Thanks for stressing this crucial aspect, Damien. #machinelearning #deploymentstrategies

查看更多评论

要查看或添加评论，请登录

Damien Benveniste, PhD的更多文章

New Chapter: Attention Is All You Need - The Original Transformer Architecture

2025年2月11日

New Chapter: Attention Is All You Need - The Original Transformer Architecture

The second chapter of the Big Book of Large Language Models is now available in preview: Attention Is All You Need: The…

9 条评论
Introducing The Big Book of Large Language Models!

2025年1月30日

Introducing The Big Book of Large Language Models!

For the past years, I have been creating educational content around machine learning and, specifically, large language…

13 条评论
Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

2025年1月17日

Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

Hey, this issue covers updates on Meta's decision to halt its fake news filters, a transformative soft robotic armband…
Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

2025年1月16日

Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

Inside this edition: a brain-mimicking AI chip enhancing battery life, machine learning models for sustainable hydrogen…

1 条评论
Today AI in the News: AI's Bold Advances in Healthcare and Beyond

2025年1月15日

Today AI in the News: AI's Bold Advances in Healthcare and Beyond

In this edition: Nvidia's latest leap in AI robotics, a pioneering approach for more efficient neural networks, AI's…

1 条评论
The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

2025年1月14日

The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

I am glad to teach again the Machine Learning Fundamentals Bootcamp V2. On February 12th, 2025, I am going to start…

10 条评论
The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

2025年1月13日

The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

In this edition: AI's role in IVF breakthroughs; real-time translation headsets and subtitles; HPE's billion-dollar…

2 条评论
New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

2024年12月18日

New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

It is almost Christmas, so it is time for a little gift! I am launching a new live bootcamp: Introduction to Data…

2 条评论
Happy Thanksgiving!

2024年11月28日

Happy Thanksgiving!

Happy Thanksgiving, everyone! I want to thank all of you readers for continuing to learn machine learning together! To…
How To Bring Machine Learning Projects to Success

2024年8月9日

How To Bring Machine Learning Projects to Success

To build a successful machine learning product, you need to understand how to manage a machine learning project. This…

7 条评论

See all articles

All the ways to deploy an ML model

Damien Benveniste, PhD

Founder @ TheAiEdge | Follow me to learn about Machine Learning Engineering, Machine Learning System Design, MLOps, and the latest techniques and news about the field.

领英推荐

The AiEdge

51,712 位关注者

Damien Benveniste, PhD的更多文章

社区洞察

其他会员也浏览了

The Evolution of Machine Learning: The Birth of MLOps

MLflow and Databricks as a Comprehensive Solution to AI/ML Workflows

Building Resilient MLOps Pipelines: Lessons from the Field

Reasonance: Shaping the Future of End-to-End Machine Learning Workflows

How MLOps Improves the Lifecycle of Machine Learning Models

Machine Learning Recommendation Systems and Azure ML: A Comprehensive Guide

The Right Machine Learning Lifecycle Tool?

Which machine learning tools elevate software development efficiency in 2023?

Machine Learning as a Service Market to Witness Huge Growth by 2030 | Google, IBM, BigML

AI Technology Projects: The Case for New Skills

领英推荐

The AiEdge

51,712 位关注者

Damien Benveniste, PhD的更多文章

New Chapter: Attention Is All You Need - The Original Transformer Architecture

Introducing The Big Book of Large Language Models!

Latest AI News and Research: Meta's Controversial Move and AI's Future in Healthcare and Gaming

Today AI in the News: Brain-Mimicking Chips, Eco-Focused Models, and Google's News-Powered Gemini

Today AI in the News: AI's Bold Advances in Healthcare and Beyond

The Machine Learning Fundamentals Bootcamp V2: Live Sessions Starting Soon!

The AiEdge: From IVF Successes to Evolving Esports and Billion-Dollar Ventures

New Live Bootcamp: Introduction to Data Science and Machine Learning Bootcamp!

Happy Thanksgiving!

How To Bring Machine Learning Projects to Success

社区洞察

其他会员也浏览了

The Evolution of Machine Learning: The Birth of MLOps

MLflow and Databricks as a Comprehensive Solution to AI/ML Workflows

Building Resilient MLOps Pipelines: Lessons from the Field

Reasonance: Shaping the Future of End-to-End Machine Learning Workflows

How MLOps Improves the Lifecycle of Machine Learning Models

Machine Learning Recommendation Systems and Azure ML: A Comprehensive Guide

The Right Machine Learning Lifecycle Tool?

Which machine learning tools elevate software development efficiency in 2023?

Machine Learning as a Service Market to Witness Huge Growth by 2030 | Google, IBM, BigML

AI Technology Projects: The Case for New Skills