登录查看更多内容

Marvelous MLOps #19: What do ML engineers deploy: batch use case

Marvelous MLOps

Power up MLOps with Marvelous content

发布日期: 2023年8月31日

In the article Deployment strategies for ML products, we talked about the need for 3 environments with access to production data (DEV, ACC, PRD) and how those environments are used in the ML deployment process. We have touched a bit on what exactly is being deployed, but it is good to come up with some concrete examples.

I will take a very common example from the retail industry, a use case with probably the most impact for any retailer: demand forecast for a warehouse or stores. Typically, we are talking about multiple models here: one for each product category, and there are tens, or hundreds of them.

Steps involved in the deployment

Demand forecast is usually implemented as a batch process, where predictions for coming x days are delivered daily: via SFTP transfer, or via writing to a database. What are the steps involved to make it happen?

Data preprocessing. Usually, there is one big table where new data is processed and added incrementally each day. This table contains features needed for model retraining and model inference.
(Conditional) model retraining. The model can be retrained periodically (for example, every week), or only when significant data drift occurs. Otherwise, the latest artifact is used.
Model inference (generation of predictions).
Delivery of predictions

Read further here: https://marvelousmlops.substack.com/p/what-do-ml-engineers-deploy-batch

??Hakim Elakhrass

post-deployment data science | OSS | co-founder @ nannyML

1 年

#batchforlife, would love to see some deep dives in dev/acc/prod, specifically what are the best practices for integration and acceptance tests

2 次回应

查看更多评论

要查看或添加评论，请登录

Marvelous MLOps #19: What do ML engineers deploy: batch use case

Marvelous MLOps

Power up MLOps with Marvelous content

Steps involved in the deployment

更多精彩文章

社区洞察

其他会员也浏览了

The BigLittle Origin Story : Redefining the Future of RevOps

Key trends to look out for in 2024

10 Proven Techniques to Reduce Latency in Software ??

Ensuring Integrity in Digital Systems: The Importance of Right Shift Testing

June 2024 Edition: Why E2E Fails, Message Queues Testing Guide, Upcoming Events & More for Engineering Leaders

Data Engineering: The ultimate game changer

MLOps Best Practices

Label smoothing: for solving overfitting and overconfidence [code included]

Mastering the Unthinkable: Executing Hundreds of Millions of Workflows Daily

Solving Problems in the World of 4.0

Steps involved in the deployment

Marvelous MLOps #56: Streamlining ML Model Monitoring with Databricks Lakehouse and Inference Tables

2024年11月28日

Marvelous MLOps #55: Traffic Splits Aren’t True A/B Testing for Machine Learning Models

2024年11月4日

Marvelous MLOps #54. Developing on Databricks (without compromises)

2024年10月14日

Marvelous MLOps #53. Top data & AI conferences to attend in 2024

2024年9月3日

Marvelous MLOps #52: How Much ML Should Engineers in Tech Really Know?

2024年8月27日

Marvelous MLOps #51: MLOps with Databricks Roadmap & Course Announcement

2024年8月16日

Marvelous MLOps #50: Dealing with private Python packages in Databricks Asset Bundles, part 1.

2024年8月11日

Marvelous MLOps #49: Handy Databricks Features for Development

2024年7月31日

Marvelous MLOps #48: Lessons learned from migrating models to Unity Catalog

2024年7月24日

Marvelous MLOPs #47: Ain't No Database for All Your Needs

2024年7月17日

社区洞察

其他会员也浏览了

The BigLittle Origin Story : Redefining the Future of RevOps

Key trends to look out for in 2024

10 Proven Techniques to Reduce Latency in Software ??

Ensuring Integrity in Digital Systems: The Importance of Right Shift Testing

June 2024 Edition: Why E2E Fails, Message Queues Testing Guide, Upcoming Events & More for Engineering Leaders

Data Engineering: The ultimate game changer

MLOps Best Practices

Label smoothing: for solving overfitting and overconfidence [code included]

Mastering the Unthinkable: Executing Hundreds of Millions of Workflows Daily

Solving Problems in the World of 4.0