登录查看更多内容

Building a Machine Learning Pipeline – Deployment

Ankush Seth

CTO @ Mi Analyst | Helping businesses accelerate growth and efficiency with Gen AI

发布日期: 2019年9月23日

Welcome Back! Hope you enjoyed the previous two articles on building a machine learning pipeline (Part 1, Part 2 for readers who missed the previous ones) . In this article we conclude our theoretical journey of building a machine learning pipeline. The next article will focus on building a proof of concept. Without further adieu let’s begin!

Now that we have the data and the appropriate model it is time to actually make the model accessible and available for use.

To recap the deployment phase comprises of two aspects –

Deploy to live environment
Observe and refine the model

There are several ways to deploy your model. These will be covered shortly but before that consider the following: -

Type of predictions

Batch processing – In this scenario the model can be engaged as a background process that is triggered when perhaps when data is available in a data warehouse or uploaded to file storage like Azure Storage Account or AWS S3. E.g. Reporting or forecasting engines generally do not need real-time prediction/classification.
Real-time / on-demand – We expect our model to provide real-time predictions based on the input. For example, machine learning based recommendation systems, facial recognition, natural language processing, etc.

Ability to Observe and refine the model

Being able to retrieve and analyze performance metrics is key to making sure your model is serving the needs of the users and delivering value.
In the event the model is not performant, there needs to be a fairly simple way to test, update/replace the model with minimal downtime and programmatic changes. A/B testing principles can be applied by routing input data to generate comparative metrics between a potential replacement or next-gen model.

Based on the use-case we are dealing with one or more of the deployment solutions may apply: -

Direct Injection – Go with this option if you have a custom application or one that can be modified easily to incorporate new modules (perhaps a mobile application). Injecting the trained model directly into the application as a component will allow you to ensure processing happens as close to both the user interaction as well as the input parameters. Minimizing data movement and reducing latency has benefits albeit at the cost of interdependence, and coupling of systems as this approach will probably require one to recode the model or provide a wrapper so as to embed it as part of the application.
Convert and Import - Conversion of the model into a portable format like Predictive Model Markup Language (PMML) or Portable Format Analytics (PFA) may make sense if the final resting place is a data mining or analytics engine that can import models coded using PMML or PFA. This is my least favorite option.
Deploy as a Service – With more applications moving to the cloud deploying your model as a Service through Google’s Cloud ML engine or AWS SageMaker provides you with a pay-per-use scalable stack. My next blog series will actually be a Proof of Concept walk-through using AWS SageMaker demonstrating the entire machine learning pipeline.

So that is it for now. Over the last few weeks we have covered the three phases of building a machine learning pipeline. Every phase is equally important but it all starts with data! As mentioned above next up is a practical implementation of all the concepts we’ve read.

要查看或添加评论，请登录

Ankush Seth的更多文章

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

2024年2月28日

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

Large Language Models (LLMs) and their capabilities have been in the limelight in recent times. Specifically…

1 条评论
Gemini Vs GPT-4 - Battle of the titans

2023年12月11日

Gemini Vs GPT-4 - Battle of the titans

GPT-4 has been the State of the Art (SOTA) model in recent times for a number of generative AI use-cases but now we…

2 条评论
Integrated Gradients — Interpreting the LLM decision making process

2023年10月11日

Integrated Gradients — Interpreting the LLM decision making process

Large Language Models have attracted a lot of attention in recent times. Through the likes of ChatGPT these models have…
Understanding Back Propagation in human terms

2023年9月27日

Understanding Back Propagation in human terms

Deep learning neural networks and their fundamental building block, the perceptron, serve as a mathematical model…
Building a Machine Learning Pipeline – Modeling

2019年9月4日

Building a Machine Learning Pipeline – Modeling

Welcome back everyone. Let’s dive into the Modeling aspect of the machine learning workflow.
Building a Machine Learning Pipeline – Exploration and Data Processing

2019年8月27日

Building a Machine Learning Pipeline – Exploration and Data Processing

In this three-part blog series, we are going to explore how to build a machine learning pipeline (defined below). Each…

See all articles

Building a Machine Learning Pipeline – Deployment

Ankush Seth

CTO @ Mi Analyst | Helping businesses accelerate growth and efficiency with Gen AI

Ankush Seth的更多文章

社区洞察

其他会员也浏览了

Types of Machine Learning Algorithms and building Decision Tree Algorithms

Machine Learning - The main impact areas where we can use it

Accelerating Machine Learning Development Life Cycle

Machine Learning 101: Simplifying the Complexities for Business Leaders

Understanding XGBoost: A Powerful Machine Learning Algorithm

The Machine Learning Lifecycle and MLOps

Data Tuesday: Leveraging Machine Learning for Predictive Analytics

Top 10 Guiding Principles for Big Data Analytics Strategy

Building Intelligent Systems Integrating Machine Learning with Data Engineering

Klassifier No Code Machine Learning

Ankush Seth的更多文章

The Invisible Threat: How Prompt Injection and Leakage Undermine Security in LLM Applications

Gemini Vs GPT-4 - Battle of the titans

Integrated Gradients — Interpreting the LLM decision making process

Understanding Back Propagation in human terms

Building a Machine Learning Pipeline – Modeling

Building a Machine Learning Pipeline – Exploration and Data Processing

社区洞察

其他会员也浏览了

Types of Machine Learning Algorithms and building Decision Tree Algorithms

Machine Learning - The main impact areas where we can use it

Accelerating Machine Learning Development Life Cycle

Machine Learning 101: Simplifying the Complexities for Business Leaders

Understanding XGBoost: A Powerful Machine Learning Algorithm

The Machine Learning Lifecycle and MLOps

Data Tuesday: Leveraging Machine Learning for Predictive Analytics

Top 10 Guiding Principles for Big Data Analytics Strategy

Building Intelligent Systems Integrating Machine Learning with Data Engineering

Klassifier No Code Machine Learning