登录查看更多内容

Quick Understanding: Instance segmentation vs. Semantic segmentation in Image Analysis

Rohan Chikorde

VP - AIML at BNY Mellon | 17k+ followers | AIML Corporate Trainer | University Professor | Speaker

发布日期: 2020年3月12日

Explaining the differences between traditional image classification, object detection, semantic segmentation, and instance segmentation is best done visually.

When performing traditional image classification our goal is to predict a set of labels to characterize the contents of an input image (top-left).

Object detection builds on image classification, but this time allows us to localize each object in an image. The image is now characterized by:

Bounding box (x, y)-coordinates for each object
An associated class label for each bounding box

An example of semantic segmentation can be seen in the bottom-left. Semantic segmentation algorithms require us to associate every pixel in an input image with a class label (including a class label for the background).

Pay close attention to our semantic segmentation visualization — notice how each object is indeed segmented but each “cube” object has the same color.

While semantic segmentation algorithms are capable of labeling every object in an image they cannot differentiate between two objects of the same class.

This behavior is especially problematic if two objects of the same class are partially occluding each other — we have no idea where the boundaries of one object ends and the next one begins, as demonstrated by the two purple cubes, we cannot tell where one cube starts and the other ends.

Instance segmentation algorithms, on the other hand, compute a pixel-wise mask for every object in the image, even if the objects are of the same class label (bottom-right). Here you can see that each of the cubes has their own unique color, implying that our instance segmentation algorithm not only localized each individual cube but predicted their boundaries as well.

References:

https://arxiv.org/abs/1704.06857

Pyimagesearch

Shibu Nair

Analytics & Planning Manager at Majid Al Futtaim

4 年

Spot on Rohan Chikorde . Thanks for sharing.

2 次回应

查看更多评论

要查看或添加评论，请登录

Rohan Chikorde的更多文章

Key Steps to Learn Machine Learning in 2024

2024年3月10日

Key Steps to Learn Machine Learning in 2024

Welcome to the exciting world of machine learning! Whether you're a complete beginner or have some programming…

3 条评论
From Content to Art: An Introduction to Neural Style Transfer using Python and TensorFlow

2023年2月15日

From Content to Art: An Introduction to Neural Style Transfer using Python and TensorFlow

In this blog post on Neural Style Transfer - a technique that allows you to combine the content of one image with the…
Dask vs Spark

2021年7月8日

Dask vs Spark

#Apache Spark is a popular distributed computing tool for tabular datasets that is growing to become a dominant name in…

1 条评论
How to Handle Large Data for Machine Learning

2021年6月30日

How to Handle Large Data for Machine Learning

Many times, data scientist or analyst finds difficulty to fit large data (multiple #GB/#TB) into memory and this is a…

2 条评论
Configure Deep Learning Architecture

2019年1月6日

Configure Deep Learning Architecture

Deep Learning can used in wide range of domains – Ecommerce, Supply Chain, Transportation, Medicine etc. and there are…

4 条评论
Recurrent Neural Networks (#RNN) and #LSTM- Deep Learning

2018年10月18日

Recurrent Neural Networks (#RNN) and #LSTM- Deep Learning

What do you do if the patterns in your data change with time? In that case, your best bet is to use a recurrent neural…
Deep Learning vs Traditional Machine Learning... Which one I should use?

2018年8月25日

Deep Learning vs Traditional Machine Learning... Which one I should use?

Over the past several years, deep learning has become the go-to technique for most AI type problems, overshadowing…
Use Cases of Deep Learning

2018年7月28日

Use Cases of Deep Learning

Deep Learning (DL) has become more than just a buzzword in the Artificial Intelligence (AI) community – it is reshaping…

11 条评论
Simplifying Deep Learning - Part II

2018年2月10日

Simplifying Deep Learning - Part II

Outline of Deep Belief Nets Algorithm An RBM can extract features and reconstruct input data, but it still lacks the…

1 条评论
Simplifying Deep Learning - Part I

2017年11月19日

Simplifying Deep Learning - Part I

If you are looking out simplify deep learning so as to make sense out of technical details, then here you go…

7 条评论

See all articles

Quick Understanding: Instance segmentation vs. Semantic segmentation in Image Analysis

Rohan Chikorde

VP - AIML at BNY Mellon | 17k+ followers | AIML Corporate Trainer | University Professor | Speaker

Rohan Chikorde的更多文章

社区洞察

其他会员也浏览了

From Noise to Knowledge: Explore the Magic of DBSCAN which is beyond Traditional Clustering.

How to deal with Multicollinearity?

RANDOM FOREST MODEL(RFM)

Artificial Intelligence Starts a New Chapter for Geodata Usage

How YOLOv8 Redefines Object Detection Capabilities

Top Geospatial Trends for 2023

SHAP is not all you need (or why you should always use permutation feature importance)

Correlation, causation and vector autoregressions

The Convergence of Digital Twins and GeoAI: A Synergistic Approach to Spatial Data Science

Rohan Chikorde的更多文章

Key Steps to Learn Machine Learning in 2024

From Content to Art: An Introduction to Neural Style Transfer using Python and TensorFlow

Dask vs Spark

How to Handle Large Data for Machine Learning

Configure Deep Learning Architecture

Recurrent Neural Networks (#RNN) and #LSTM- Deep Learning

Deep Learning vs Traditional Machine Learning... Which one I should use?

Use Cases of Deep Learning

Simplifying Deep Learning - Part II

Simplifying Deep Learning - Part I

社区洞察

其他会员也浏览了

From Noise to Knowledge: Explore the Magic of DBSCAN which is beyond Traditional Clustering.

How to deal with Multicollinearity?

RANDOM FOREST MODEL(RFM)

Artificial Intelligence Starts a New Chapter for Geodata Usage

How YOLOv8 Redefines Object Detection Capabilities

Top Geospatial Trends for 2023

SHAP is not all you need (or why you should always use permutation feature importance)

Correlation, causation and vector autoregressions

The Convergence of Digital Twins and GeoAI: A Synergistic Approach to Spatial Data Science