MLOPs monitoring Solution

MLOPs monitoring Solution

Really glad to showcase my work/POC for Model Monitoring using Data Bricks and Microsoft Azure ML.

Problem Statement/s

? We want to enable Model Monitoring for our UCs without hampering our Architecture Blueprint

? We have tried third part libraries like AI Evidently and Alibi Detect but it was too code intrinsic and manual work. Also it didn’t support huge data sets.

? We also have special requirement of comparing two training datasets for drift in dev environments

?MLOPs monitoring framework was required which can send emails/alerts upon drift detection and visually appealing.

Proposed Solution

? We have developed the whole solution using DataBricks and AZML service of Microsoft. This will be a generalized solution for all UCs whereby we can detect the drift between scoring data & training data, graphical insights, application insights & send emails to the stakeholders for their actions.

? I have also developed a generalized template to compare two training datasets as a special requirement from our stakeholders. It will be a similar solution as mentioned above.

?I have also given the inference data back to Data Bricks if required for any visualization in a simple tabular format.

User Impact

?There wasn't any impact to existing users/roles pertaining to any solution/s.

?This implementation is an extension to existing blue print which uses DataBricks, MLFlow and AKS as protagonist solution components.

?We need to setup the AZML service coupled with Data Bricks workspace, whereby Data Drift Monitor will be a visual delight from Microsoft

Final Product:-

The Summary from Drift Monitor
The Drift Trends for several days during inference
Drift Magnitude in terms of features over several days.
The distance between target dataset attribute versus baseline dataset for the desired attribute.
A clear comparison graph based on distribution between baseline data versus real time inference data.
The email post Drift Detection by a monitor.
Application Insights for resource utilization for day to day use of the solution. It can help Solution Architects. Cloud team to tune the resources.

Summary:- This is ready to use, template driven and low code solution for monitoring ML Models. Please feel free to criticize, ask Qs or suggestions for betterment.


Ashutosh Sahoo

IT Product Owner - AI ML Analytics & Data Enablement Platform

2 年

You always manage to distinguish your work by having it done so well. Great effort and outcome. Excellent work Vishal ??

Erik Rasmussen

Regional Sales Director, Enterprise - Denmark????, Norway???? & Iceland ???? @Databricks

2 年

Great work Vishal Garg and thanks for sharing ??

Really appreciate your efforts for coming out with ideas substantiated by facts ! Kudos Vishal ??

要查看或添加评论,请登录

Vishal Garg的更多文章

  • Snowpipe in action for Realtime ingestion

    Snowpipe in action for Realtime ingestion

    In addition to my post on LinkedIn https://www.linkedin.

  • Spark via Kubernetes using MapR as Data Storage Layer

    Spark via Kubernetes using MapR as Data Storage Layer

    Some real good implementation in my Data Platform. We have used NFSV3 to expose the HDFS/MFS for data storage and used…

    3 条评论
  • Data Tiering using Mapr

    Data Tiering using Mapr

    The industry started with a single server Data Base Management Systems with scalability limited to single server. The…

    2 条评论
  • Hadoop Multi Data Centre Migration

    Hadoop Multi Data Centre Migration

    MapR Cluster Migration Via Multi Data Centre Setup Task Despeciation: - It was really challenging when I started this…

    3 条评论

社区洞察

其他会员也浏览了