Is AI labor intensive?

Is AI labor intensive?

Contrary to popular perception that AI will cause disruption of labor and employment, it is extremely labor intensive.

It is not blasphemous.

Ask those who have worked on Computer Vision AI projects, they will vouch for ‘data label’ being the MOST important factor for success. 

Most computer vision AI projects range,

from object detection 

No alt text provided for this image

to facial detection

No alt text provided for this image

to tumor diagnosis

No alt text provided for this image

to self driving cars

No alt text provided for this image

Accurate & efficient labelling/"annotation"for each type of business problem is very critical for the success.

As per McKinsey's report towards the end of last year, amongst the various AI functions, robotic process automation, computer vision, and machine learning are most commonly deployed. 

Types of Annotations

The annotations vary based on the type of use case being addressed.

a) Classification

This is the most commonly used type where humans label images manually by marking whether an image consist of car / tree / certain specific objects for which the use case is being developed. Tech giants like Facebook and Google use this regularly with its own users to label captcha images.

No alt text provided for this image

b) Outline annotation

This type of annotation is where one tries to identify x, y coordinates by drawing shapes around the outline of the object of interest. There are four approaches to outline the object

  • Bounding Box

In this approach 2D boxes are drawn either manually or by tools and then labeled for training for Deep Learning use cases.

No alt text provided for this image

This approach is quite cost effective and we can label multiple objects in an image. But it cannot label an image with a meandering river for example.

  • Polygon

Polygon labelling approach is a more exacting approach but is more time consuming for labelling, as shown by figure-eight in one of their blogs. This type of annotation is also used for labelling medical image scans e.g. lung cancer nodules in MRI scan.

source: Figure-Eight
  • Dot

This type of annotation is used for counting jobs and gesture or facial recognition tasks. For example, for counting the number of people in stadium through drone footages, the labeller needs to put dots on each individual in the image.

For gesture and facial recognition, the dots are placed on the face to ascertain the emotion, interest level of person for a prop / product in retail store etc.

No alt text provided for this image

c) Pixel Labeling

This technique is used to label each and every pixel in an image. In this technique, an image is divided into multiple segments. Once segmented, polygons are drawn around objects of interest. But a very detail level of information is labeled including coded RGB pixels. Due to the high level of precision this technique is supposedly most expensive and time taking

No alt text provided for this image


How the world is addressing?

A plethora of solutions and initiatives have been undertaken to address the annotation challenge.

ImageNet has around 15 million images exactly for this purpose and is quite popular for initiating in Vision AI. But, it suffers from large label ‘boxes’, few label types and high error rates. 

Large companies use ‘subtle’ crowdsourcing, either using captcha image labeling (Facebook) or using Human Labeling (Google). Some startups like Figure-Eight, Handl, LabelBox use a hybrid approach for labelling.

In China, call center executives are queueing up to do manual labelling,

A full-time data-tagger at BasicFinder can earn 6,000 to 7,000 yuan a month, along with accommodation and social benefits. In the first three quarters of 2018, the disposable income per capita in Beijing was 46,426 yuan, around 5,158 yuan a month, according to local government statistics. (Source: xinhuanet)

On the API front, Amazon Rekognition provides Storage and Non-storage APIs for images and videos. Google Vision API provides ability across six categories like label detection, text detection, face detection, landmark detection, logo detection and safe search detection.



Somasundar Avantsa

Assistant Professor

5 年

Big's is meager. Why be'cus Basics of machine learning Bias of machinery in transition

回复
Somasundar Avantsa

Assistant Professor

5 年

Excellent Just being tech savvy is sufficient to operate. Labour intensive. Rightly said.

回复
Adhir Chobe

ERP Applications Leader @ Fortune Brands | Cloud Business Transformations

5 年

Excellent article Rahul Sinha?and what a wonderful way to demystify what the effort behind getting AI really entails

要查看或添加评论,请登录

Rahul Sinha的更多文章

  • Power of genuine small talk!

    Power of genuine small talk!

    "Being stuck due to 'Chillai Kalan' harshest winter period in Kashmir and with roads, airport and trains not operating,…

    12 条评论
  • Data is NOT Oil

    Data is NOT Oil

    A new #data universe has been unveiled. With the advent of Generative AI and the democratization of foundational LLMs…

    5 条评论
  • GenerativeAI AppStore - Enterprise Readiness Unleashed

    GenerativeAI AppStore - Enterprise Readiness Unleashed

    I'm thrilled to share a groundbreaking development at TRUGlobal in the world of #artificialintelligence – the…

    3 条评论
  • My 2022 bookshelf

    My 2022 bookshelf

    I am a bookworm :) and this year I was able to take it few notches above my usual annual targets. My bookshelf for this…

    10 条评论
  • How Green Tea helped me steer pandemic

    How Green Tea helped me steer pandemic

    No, this is not a health nugget, though I like many beverages, green tea being one of them :) I joined Allianz…

    4 条评论
  • How to reduce impact of AI on Global Warming?

    How to reduce impact of AI on Global Warming?

    We all know global warming is real and threatening our present and future. But ever wondered how is AI impacting global…

    4 条评论
  • Book recommendations Q1 2021

    Book recommendations Q1 2021

    With the lockdown continued, I personally used this time of no travel to read a few books, which you might find…

    3 条评论
  • A Quarter of Loss...

    A Quarter of Loss...

    It is said a lot changes in a quarter. Its been a quarter since I lost my mother, and I still find very difficult move…

    27 条评论
  • Saving the Titanic - A digital twin hypothesis

    Saving the Titanic - A digital twin hypothesis

    Loss of the Titanic RMS Titanic was heavily publisized as the unsinkable, when she sailed from Southampton on April 10,…

    1 条评论
  • Era of AI in Insurance

    Era of AI in Insurance

    "Innovation distinguishes between a leader and a follower." - Steve Jobs Technology landscape is evolving rapidly…

    3 条评论

社区洞察

其他会员也浏览了