登录查看更多内容

Model Performance Analysis

Ajay Taneja

Senior Data Engineer | Generative AI Engineer at Jaguar Land Rover | Ex - Rolls-Royce | Data Engineering, Data Science, Finite Element Methods Development, Stress Analysis, Fatigue and Fracture Mechanics

发布日期: 2021年11月15日

1. Introduction

Successfully training a Machine Learning Model through the lifecycle of a data science project is a great feeling – but you’re actually not done here – except in the case of a research project or an academic exercise. In production ML systems, one needs to enter a new phase of development where you have to a deeper analysis of your ML model performance from different directions. It should be underscored that in order to do a deeper analysis of your model performance you not only have to evaluate your model performance based on the entire dataset but also various slices of data. I had given an example in this post from the CAE world which I have personally encountered whilst working on a regression problem.?

Another example could be wherein if you’re to build a ML model to predict the demand for different models of automobiles – then looking at your model performance based on different types of models, accessories, features offered, colours, etc will become important – i.e., you might want to evaluate how the model performs based on these individually. At a higher level, there are 2 main ways to analyse the model performance: Black Box evaluation and Model Introspection.?

2. Black Box Evaluation vs Model Introspection

In Black Box Evaluation [i.e., Input – Output evaluation] - you quantify the model performance through metrics and losses without going into the details of internal model working and then we have Model Introspection techniques which prove useful when you’re interested in understanding how a model is working internally – e.g., you might want to experiment with different architectures to understand how the data is flowing internally within each layer of your model.

Contrary to Black Box evaluation, in model introspection, you’re “not just” interested in the model’s final results but also in the details of each layer.?

Tools for evaluating Model Performance:

There are different tools available for evaluating Model as described above such as:

?Tensor Board

?Tensor Flow Model Analysis [TFMA]

领英推荐

Balancing Act: The Pros and Cons of Machine Learning…

Sanjay Kumar MBA,MS,PhD 1 年前

Overview of Feature Engineering In Machine Learning

Sanjay Kumar MBA,MS,PhD 5 个月前

IID in machine learning

Ajit Jaokar 8 个月前

Using TensorBoard, you can monitor the loss and accuracy at every iteration of the model, you can closely monitor the training process itself. I have found the What-if-Tool part of TensorBoard very powerful which can be run with various platforms including: Jupyter Notebooks, Collab and Cloud AI Platform Notebooks. The What-if-tool can be helpful during data collection, model creation and post-training evaluation as discussed above. The What-if tool supports Tensor Flow models out of the box and can also support models built with any other framework. I will talk more about What-if-tool in subsequent posts of this series.?

Example of Model Introspection:

As one may recall, in summary, the operations within a CNN can be bundled in to 2 main blocks: a) Feature Learning Block and b) Task Learning Block. In the feature learning block, the inputs to a CNN (say, a series of images) are processed through a series of convolutional layers during which the neural network learns the features corresponding to these images. This post shows how one can visualize the features corresponding to an image that the convnet learns through each layer using Keras APIs

3. Model Performance Analysis: Performance Metrics vs Optimization Objectives:

I have discussed about the Performance Metrics in this article highlighting about the evaluation metrics for regression and classification problems

Optimization algorithms: I have very briefly highlighted about the optimization landscape

Frameworks such as TensorFlow and all others have options for tracking performance metrics like accuracy and optimization objectives such a s loss after each epoch of training and validation.?

要查看或添加评论，请登录

Ajay Taneja的更多文章

Low-Rank Adaptation of Large Language Models (LoRA): Part 4 of my Fine-Tuning Series of Blogs

2025年2月24日

Low-Rank Adaptation of Large Language Models (LoRA): Part 4 of my Fine-Tuning Series of Blogs

1. Introduction: This article is the continuation of my series of articles on “Fine-Tuning of LLMs” and is the fourth…
Parameter Efficient Fine Tuning with Additive Adaptation: Part 3 of my Fine-Tuning Series of Blogs

2025年2月10日

Parameter Efficient Fine Tuning with Additive Adaptation: Part 3 of my Fine-Tuning Series of Blogs

1. Introduction This is the continuation of my series of blogs on Fine-Tuning of LLMs and is the third blog in the…
Fine Tuning on Single and Multiple Tasks: Part 2 of my Fine-Tuning Series of Blogs

2025年2月4日

Fine Tuning on Single and Multiple Tasks: Part 2 of my Fine-Tuning Series of Blogs

1. Introduction This is the continuation of my series of blogs on Fine-Tuning and is the second blog in the series.
Essentials of Fine Tuning: Part 1 of my Fine-Tuning Series of Blogs

2025年2月1日

Essentials of Fine Tuning: Part 1 of my Fine-Tuning Series of Blogs

1. Fine Tuning Series and Background of Transformers and ChatGPT Training Process: One of my earlier series of blogs…
RAG Beyond Basics:

2025年1月7日

RAG Beyond Basics:

1. Introduction: In this article/blog, I will discussing some advanced techniques in the Retrieval-Augmented Generation…
The Marriage of Retrieval-Augmented Generation (RAGs) with Knowledge Graphs: Part 15 of my Graph Series of Blogs

2024年10月24日

The Marriage of Retrieval-Augmented Generation (RAGs) with Knowledge Graphs: Part 15 of my Graph Series of Blogs

1. Introduction: The general idea of Retrieval-Augmented Generation (RAGs) is now well understood in LLM community and…

2 条评论
Knowledge Graph Completion and Knowledge Graph Embeddings: Part 14 of my Graph Series of Blogs

2024年9月23日

Knowledge Graph Completion and Knowledge Graph Embeddings: Part 14 of my Graph Series of Blogs

1. Introduction: This is the continuation of my series of blogs on Graphs and is the 14th article in the series.

3 条评论
Setting Up Graph Neural Network Prediction Tasks: Part 13 of my Graph Series of Blogs

2024年8月26日

Setting Up Graph Neural Network Prediction Tasks: Part 13 of my Graph Series of Blogs

1. Introduction: This is the continuation of my Graph Series of Blogs and is the thirteenth blog in the series.
Training Graph Neural Networks: Part 12 of my Graph series of blogs

2024年8月18日

Training Graph Neural Networks: Part 12 of my Graph series of blogs

1. Introduction: This is the continuation of my series of blogs on Graphs and is the twelfth article in the series.
Heterogeneous Graphs and Relational Graph Convolutional Neural Networks (RGCNs): Part 11 of my Graph series of blogs

2024年6月30日

Heterogeneous Graphs and Relational Graph Convolutional Neural Networks (RGCNs): Part 11 of my Graph series of blogs

1. Introduction: This article is the continuation of my series of blogs on “Graphs” and is the eleventh article in the…

See all articles

Model Performance Analysis

Ajay Taneja

Senior Data Engineer | Generative AI Engineer at Jaguar Land Rover | Ex - Rolls-Royce | Data Engineering, Data Science, Finite Element Methods Development, Stress Analysis, Fatigue and Fracture Mechanics

1. Introduction

2. Black Box Evaluation vs Model Introspection

Tools for evaluating Model Performance:

领英推荐

Example of Model Introspection:

3. Model Performance Analysis: Performance Metrics vs Optimization Objectives:

Ajay Taneja的更多文章

社区洞察

其他会员也浏览了

ML Day 16: Real-World Project Example Using ML

Understanding Support Vector Machines (SVM) and Decision Trees in Machine Learning

ML Day 16: Real-World Project Examples Using ML life cycle process steps

Types of Machine Learning Algorithms and building Decision Tree Algorithms

The Art and Science of Feature Engineering in Machine Learning

Simple Linear Regression

Feature Engineering in Machine Learning - Part 04

Decision Tree in Machine Learning.

10 Machine Learning Algorithms every Data Scientist should know

Unveiling the Art of Feature Selection in Machine Learning

1. Introduction

2. Black Box Evaluation vs Model Introspection

Tools for evaluating Model Performance:

领英推荐

Example of Model Introspection:

3. Model Performance Analysis: Performance Metrics vs Optimization Objectives:

Ajay Taneja的更多文章

Low-Rank Adaptation of Large Language Models (LoRA): Part 4 of my Fine-Tuning Series of Blogs

Parameter Efficient Fine Tuning with Additive Adaptation: Part 3 of my Fine-Tuning Series of Blogs

Fine Tuning on Single and Multiple Tasks: Part 2 of my Fine-Tuning Series of Blogs

Essentials of Fine Tuning: Part 1 of my Fine-Tuning Series of Blogs

RAG Beyond Basics:

The Marriage of Retrieval-Augmented Generation (RAGs) with Knowledge Graphs: Part 15 of my Graph Series of Blogs

Knowledge Graph Completion and Knowledge Graph Embeddings: Part 14 of my Graph Series of Blogs

Setting Up Graph Neural Network Prediction Tasks: Part 13 of my Graph Series of Blogs

Training Graph Neural Networks: Part 12 of my Graph series of blogs

Heterogeneous Graphs and Relational Graph Convolutional Neural Networks (RGCNs): Part 11 of my Graph series of blogs

社区洞察

其他会员也浏览了

ML Day 16: Real-World Project Example Using ML

Understanding Support Vector Machines (SVM) and Decision Trees in Machine Learning

ML Day 16: Real-World Project Examples Using ML life cycle process steps

Types of Machine Learning Algorithms and building Decision Tree Algorithms

The Art and Science of Feature Engineering in Machine Learning

Simple Linear Regression

Feature Engineering in Machine Learning - Part 04

Decision Tree in Machine Learning.

10 Machine Learning Algorithms every Data Scientist should know

Unveiling the Art of Feature Selection in Machine Learning