登录查看更多内容

MLOps Best Practices

Saket Kishore

Principal Data Scientist: Artificial Intelligence at UKG(Ultimate Kronos Group)| Architect | Gen AI | ex Deutsche Bank, IBM , Oracle

发布日期: 2023年2月1日

Every data scientist can relate to this quote. “…developing and deploying ML systems is relatively fast and cheap, but maintaining them over time is difficult and expensive.” – D. Sculley et al .

Perhaps you have encountered it in your search to solve a problem in one of the many moving parts of your machine learning system: data, model, or code.?

While it’s relatively easy to develop a model to achieve business objectives (item classification or predicting a continuous variable) and deploy it to production, operating that model in production comes with a myriad of issues.

Model performance may degrade in production for reasons such as data drift. You might need to change the preprocessing technique. This means new models need to be shipped into production constantly to address performance decline, or improve model fairness.

Hacking together a solution usually means incurring technical debt, which grows as your system ages and/or grows in complexity. Worse, you could lose time, waste compute resources and cause production issues. This calls for "MLOps".

Some practices you should definitely consider implementing are:

Naming conventions
Code quality checks
Experiments— and track your experiments?
Data validation?
Model validation across segments
Resource utilization: remember that your experiments cost money?
Monitor predictive service performance
Think carefully about your choice of ML platforms
Open communication lines are important
Score your ML system periodically

Try them out, and you’ll definitely see some improvement in your work on ML systems. What other factors/practices do you consider important ? I would love to hear your thoughts.

Swati Bharti

Digital Marketer

1 年

This is a great article that emphasizes the importance of data-centric MLOps throughout the ML lifecycle, which is crucial for successful machine learning projects. Learn more about data-centric MLOps here: https://aitech.studio/aie/mlops-best-practices/

要查看或添加评论，请登录

Saket Kishore的更多文章

LLM Evaluation: Finance Industry

2024年6月13日

LLM Evaluation: Finance Industry

Large Language Models (LLMs) like GPT-4, Claude, LLama and Gemini have contributed a lot to the AI community, helping…

4 条评论
Process Mining in Action

2021年1月23日

Process Mining in Action

Process Mining-Why it's must now? Process mining uses data already inside of a company’s systems to visually reverse…

3 条评论

MLOps Best Practices

Saket Kishore

Principal Data Scientist: Artificial Intelligence at UKG(Ultimate Kronos Group)| Architect | Gen AI | ex Deutsche Bank, IBM , Oracle

Saket Kishore的更多文章

社区洞察

其他会员也浏览了

Exploring the Hilbert-Schmidt Independence Criterion (HSIC)

Asymptotic Analysis: The Secret to Blazing Performance

Random Forest

Secrets of Decision Trees: A Guide to Entropy, Gini, and Information Gain

The Role of Feature Engineering in Building Robust Machine Learning Models

Data Science Series: The Hidden Pitfalls of Feature Engineering

Bayes’ Theorem

The Effort Behind an Algorithm

Solving Class Imbalance: Techniques and Strategies

Probabilistic Data structures

Saket Kishore的更多文章

LLM Evaluation: Finance Industry

Process Mining in Action

社区洞察

其他会员也浏览了

Exploring the Hilbert-Schmidt Independence Criterion (HSIC)

Asymptotic Analysis: The Secret to Blazing Performance

Random Forest

Secrets of Decision Trees: A Guide to Entropy, Gini, and Information Gain

The Role of Feature Engineering in Building Robust Machine Learning Models

Data Science Series: The Hidden Pitfalls of Feature Engineering

Bayes’ Theorem

The Effort Behind an Algorithm

Solving Class Imbalance: Techniques and Strategies

Probabilistic Data structures