Modern Model Accuracy Analysis
Alexander Berkovich
Principal AI/ML Engineer @ Akridata | Computer Vision Expert | Saving time on visual data curation.
Introduction
Improving a model’s accuracy is a tricky task, with an open question — where do you focus your resources?
The data-centric AI approach focuses on the data — improve the data quality where needed, focus the training cycles where the model suffers and you should be on the right path. But how do you identify these pain points? Confusion matrices, P-R curves, accuracy histograms and other metrics are standard practices in determining a model’s accuracy. Each one provides a slice through the model and the data, emphasizing potential key points to focus on.
Can we go one step forward? YES!
Modern accuracy analysis
Akridata 's Data Explorer offers an interactive platform, where the accuracy metrics are connected directly to the data, saving valuable time in analyzing model’s accuracy, understanding what caused inaccuracies and allowing DS teams to target the next training cycle exactly where the model misfires.
Classification, Object Detection, Segmentation
Data Explorer is an AI platform that saves hours on visual data curation and lowers overall development costs, by giving the researchers and data scientists a simple way to manage their data, reducing annotation spend and eliminating wasted training cycles.
For each branch of computer vision — classification, object detection and segmentation, Data Explorer provides accuracy metrics, that are interactively controlled by relevant parameters and offer an immediate insight into the data behind them.
IOU & Confidence Threshold
Lets start with a few definitions:
The platform allows us to control the IOU Threshold and the Confidence Threshold at different granularities. By default we get a uniform distribution across the [0, 1] range for either threshold, as seen below:
However, if a specific range is more suitable, that can be set, as seen below:
Both thresholds are relevant for Object Detection and Segmentation problems, while for the Classification case only the Confidence Threshold plays a role.
Precision-Recall Curve
A Precision-Recall Curve plots the trade-off between precision and recall at different thresholds, and is a useful measure when prioritizing the number of false positives or capturing as many true positives as possible.
A PR-Curve for each object is provided, where we see how its shape is affected by choosing a different threshold, allowing the DS team to balance between Precision and Recall values. A PR-Curve for “Bird” is displayed below:
A PR-Curve is provided for all problem types — Classification, Object Detection and Segmentation.
Confidence — IOU Histogram
A different way of slicing a model’s accuracy, is to look at the Confidence-IOU histogram. For each object, we can see the number of elements in a specific cell, as in the example below:
Examples in the top left corner may flush-out a problem — a model’s output with high confidence, but a low IOU might indicate a serious accuracy concern, or an error on the labeling part. Below is an example that demonstrates this — top right cell of the histogram contains four instances of model predictions with a high Confidence score (> 0.8), while a very low IOU with labeled data (< 0.2). When the images related to the cell are viewed, each shows examples of missing labels, as seen on the right of the image.
领英推荐
A Confidence-IOU histogram is provided for Object Detection and Segmentation problems, while in the Classification case, a bar plot of Confidence vs Number of Samples is shown. As seen below, the bar plot is shown per class, and similar to the histogram above, each bin provides direct access to the images:
Confusion Matrix
A confusion matrix visualizes a model’s accuracy, counting the number of correct and incorrect predictions, given certain IOU & Confidence thresholds.
As with the histogram above, each cell in the confusion matrix can be clicked to view the instances it represents, providing direct feedback on model and labeling accuracy. In the example below, a combination of the thresholds, the confusion matrix and a cell’s content are visible, for an Object Detection case:
A confusion matrix is provided for all problem types — Classification, Object Detection and Segmentation.
Segmentation mask
While the Confusion Matrix above shows an example for an Object Detection type, Segmentation masks are visible too, in similar fashion, as on the image below:
Summary
In this blog we saw how Data Explorer provides the next step in model accuracy analysis through interactivity and bridging the gap between statistical results and the data behind them.
Combined together, Data Explorer saves researchers and DS teams hours on result analysis, directing their next training cycle exactly where the model needs improvement, and ultimately lowers development cost and duration.
Message me at [email protected] for a demo of Data Explorer, visit us at akridata.ai or click here to register for a free account.
Computer Vision Engineer | Machine Learning Engineer
1 年Really interesting ??