Quick Understanding: Instance segmentation vs. Semantic segmentation in Image Analysis
https://arxiv.org/abs/1704.06857

Quick Understanding: Instance segmentation vs. Semantic segmentation in Image Analysis

Explaining the differences between traditional image classification, object detection, semantic segmentation, and instance segmentation is best done visually.

No alt text provided for this image

When performing traditional image classification our goal is to predict a set of labels to characterize the contents of an input image (top-left).

Object detection builds on image classification, but this time allows us to localize each object in an image. The image is now characterized by:

  1. Bounding box (x, y)-coordinates for each object
  2. An associated class label for each bounding box

An example of semantic segmentation can be seen in the bottom-left. Semantic segmentation algorithms require us to associate every pixel in an input image with a class label (including a class label for the background).

Pay close attention to our semantic segmentation visualization — notice how each object is indeed segmented but each “cube” object has the same color.

While semantic segmentation algorithms are capable of labeling every object in an image they cannot differentiate between two objects of the same class.

This behavior is especially problematic if two objects of the same class are partially occluding each other — we have no idea where the boundaries of one object ends and the next one begins, as demonstrated by the two purple cubes, we cannot tell where one cube starts and the other ends.

Instance segmentation algorithms, on the other hand, compute a pixel-wise mask for every object in the image, even if the objects are of the same class label (bottom-right). Here you can see that each of the cubes has their own unique color, implying that our instance segmentation algorithm not only localized each individual cube but predicted their boundaries as well.


References:

https://arxiv.org/abs/1704.06857

Pyimagesearch

Shibu Nair

Analytics & Planning Manager at Majid Al Futtaim

4 年

Spot on Rohan Chikorde . Thanks for sharing.

要查看或添加评论,请登录

Rohan Chikorde的更多文章

  • Key Steps to Learn Machine Learning in 2024

    Key Steps to Learn Machine Learning in 2024

    Welcome to the exciting world of machine learning! Whether you're a complete beginner or have some programming…

    3 条评论
  • From Content to Art: An Introduction to Neural Style Transfer using Python and TensorFlow

    From Content to Art: An Introduction to Neural Style Transfer using Python and TensorFlow

    In this blog post on Neural Style Transfer - a technique that allows you to combine the content of one image with the…

  • Dask vs Spark

    Dask vs Spark

    #Apache Spark is a popular distributed computing tool for tabular datasets that is growing to become a dominant name in…

    1 条评论
  • How to Handle Large Data for Machine Learning

    How to Handle Large Data for Machine Learning

    Many times, data scientist or analyst finds difficulty to fit large data (multiple #GB/#TB) into memory and this is a…

    2 条评论
  • Configure Deep Learning Architecture

    Configure Deep Learning Architecture

    Deep Learning can used in wide range of domains – Ecommerce, Supply Chain, Transportation, Medicine etc. and there are…

    4 条评论
  • Recurrent Neural Networks (#RNN) and #LSTM- Deep Learning

    Recurrent Neural Networks (#RNN) and #LSTM- Deep Learning

    What do you do if the patterns in your data change with time? In that case, your best bet is to use a recurrent neural…

  • Deep Learning vs Traditional Machine Learning... Which one I should use?

    Deep Learning vs Traditional Machine Learning... Which one I should use?

    Over the past several years, deep learning has become the go-to technique for most AI type problems, overshadowing…

  • Use Cases of Deep Learning

    Use Cases of Deep Learning

    Deep Learning (DL) has become more than just a buzzword in the Artificial Intelligence (AI) community – it is reshaping…

    11 条评论
  • Simplifying Deep Learning - Part II

    Simplifying Deep Learning - Part II

    Outline of Deep Belief Nets Algorithm An RBM can extract features and reconstruct input data, but it still lacks the…

    1 条评论
  • Simplifying Deep Learning - Part I

    Simplifying Deep Learning - Part I

    If you are looking out simplify deep learning so as to make sense out of technical details, then here you go…

    7 条评论

社区洞察

其他会员也浏览了