登录查看更多内容

Laptop Vision ML versus Cloud Vision ML

Frederic Molina

Staff- Field Solutions Architect, AI Infrastructure at Google

发布日期: 2021年2月14日

<abstract> ML system quality depends a lot of the quality of the training dataset. Here is a simple demonstration, Cloud's providers Vision APIs for objects detection tends to return better results than pre-trained object detection models. </abstract>

Using pre-trained object detection models could be a perfect way to upskill or to prototype solutions, but when the ambition is to move in production at scale, it looks like using the Cloud's provided pre-trained models accessible through APIs are definitely a more complete solution. Before using pre-trained model you want to ensure the expected predictions were embraced by the space covered by the dataset the model was trained on.

This means, if you want to detect dogs with a pre-trained detection model, ensure it have been trained with lots of dogs samples.

I was testing a course around Object Detection with tensorflow that used a pre-trained model on a 80 object categories dataset called COCO (for comon object in context).

While it was interesting to load the pre-trained model, host it in a virtual machine hosting a Flaks app, the model performances on my chosen samples were barely satisfying so I thought about comparing it with the Google Cloud Vision API using the Testing page.

You just have to upload an image file and you will get outputs of detected objects among others images features such as sensitives categories if detected, dominant colors, sentiment predicted on faces ect.

Here are how the results compare

Using the pre-trained model on COCO dataset:

it detects the bed

While on Google Cloud Vision API, the bed is also detected, the person is detected , flower, clothing.

Using other samples

the pre-trained model on COCO dataset

3 persons are detected, which is great

And we could see here again the Cloud Vision API is pushing it a little bit further

As we can see the persons are also detected, but also piece of fashion themed objects, such as the necklace.

(Fore the reggaeton lovers, yes this is Bad Bunny, the same singing Dákiti : listen to Dakiti on Youtube )

Continuing using cartooned images

with the pre-trained model on COCO dataset

It detects a persons, crop the head of the character... Poor Petit Ours Brun we understand why he looks angry :(

While the Google Cloud Vision API is providing more details

Animal, Footwear, Shoe, ok, if you're like me you would like to get the information that the image looks like a drawing

Building a model detecting drawing vs photography is something that could be easily done as my university pal Thulfakar Hammodi could testify !

In the " Labels " field of the GCP Vision API "Cartoon" and "Illustration" are specified

So what does that means?

Pre-trained model over 80 categories, using cropped pictures on the object detected, will be useful to detect objects of those categories, in pictures cropped on the same way.

The Machine Learning Object Detection model job is to recognise objects that looks like the objects from the Training datasets, if we take a look at the COCO dataset we could understand why my samples have not been perfectly covered

The Cloud's providers Vision Object Detection models are trained on massively more data and categories, this is why they're providing more detailed answers.

As a conclusion, this is a demonstration that a Machine Learning system is as powerful as its input trained dataset have been well curated. The best quality data you will use to train your model the best quality Machine Learning system you will get.

Have a nice Day!

Tensorflow Object detection API github

Images sources:

Flower as a skirt : https://www.mediafactory.org.au/gloria-tanuseputra/2016/03/21/photography/
Bad Bunny : https://www.vanityfair.com/style/2019/08/week-in-fashion-bad-bunny-vmas
Petit Ours Brun est grognon: de Marie Aubiniais (Auteur), Danièle Bour (Illustrations) , Bayard Jeunesse

Paul Montuelle

Traffic Acquisition Manager - Prisma Media

4 年

Interesting comparison between these models, nice work!

1 次回应

Guillaume Eric Chometon

Responsable Acquisition chez Sowee (Groupe EDF)

4 年

Amazing!

2 次回应

pikk

4 年

This is interesting

Arach Montazeri

Helping companies grow with Google Cloud

4 年

Hyper interessant! Merci pour le travail et le partage! Rendons ses titres de noblesse à Petit Ours Brun.

4 次回应

查看更多评论

要查看或添加评论，请登录

Frederic Molina的更多文章

How to use Mistral 8x7B on Google Cloud

2024年1月22日

How to use Mistral 8x7B on Google Cloud

Mistral ai released Mixtral8x7B LLM, a new generation of large language models built with Mixtures of Experts and the…

9 条评论
Use Mistral LLMs locally on your computer with LM Studio for Windows

2024年1月19日

Use Mistral LLMs locally on your computer with LM Studio for Windows

Thanks to the Mistral team for the impressive models : https://mistral.ai/ and to the LM studio team for the great UX…

8 条评论
Productivity boost & MultiModal LLM interaction: txt + image with Gemini Pro Vision

2024年1月18日

Productivity boost & MultiModal LLM interaction: txt + image with Gemini Pro Vision

Today I'm working on a prompt to get insights from a financial data table. I'm trying out Gemini Pro LLM on Google AI…

9 条评论
Start your ML Portofolio in few hours

2021年4月18日

Start your ML Portofolio in few hours

A crucial step to become a professional within Data Science space will be to showcase your skills. 4 proposed steps to…

3 条评论
4 tips on DICOM handling in Machine Learning for Healthcare

2021年2月10日

4 tips on DICOM handling in Machine Learning for Healthcare

This post followed a question from a post about cancer detection ML models asking around DICOM healthcare image…

3 条评论
Easy to avoid mistakes for ML beginners

2020年11月30日

Easy to avoid mistakes for ML beginners

Summary - 15 mins reading - this article targets ML practitioners or aspiring practitioners that are starting their…

7 条评论
My top 3 Data Science Hidden Gems

2019年5月31日

My top 3 Data Science Hidden Gems

Disclaimer: This article is not a hidden ad for the tools, they're mainly open source >5 min read - Here are some very…

2 条评论
A cloud usage you might be thinking about: Real Time Bidding

2017年8月29日

A cloud usage you might be thinking about: Real Time Bidding

Let's internalise my adtech stack with my cloud and connect it to RTB..

2 条评论

See all articles

Laptop Vision ML versus Cloud Vision ML

Frederic Molina

Staff- Field Solutions Architect, AI Infrastructure at Google

Frederic Molina的更多文章

社区洞察

其他会员也浏览了

You can only make some progress if you rest

Electronic Specifier's Weekly Roundup

Electronic Specifier's Weekly Roundup

High Speed Camera Market accounted for US$ 420 Million in 2019 and is anticipated to register a CAGR of 13.5%.

Special Edition: Artificial Intelligence Appreciation Day

Reimagining Work in the Digital Age: The Influence of Modern Technologies on Industries

Innovative Method of Producing Holograms May Lead To Huge Development in Holographic Display Market

How successful companies utilize the power of Industry 4.0 – the new technological revolution

About 3D Scanning Services

Sh*t You Should Know (Issue 7) Google's AI blunder, L'Oreal printing skin, and video game announcement season!

Frederic Molina的更多文章

How to use Mistral 8x7B on Google Cloud

Use Mistral LLMs locally on your computer with LM Studio for Windows

Productivity boost & MultiModal LLM interaction: txt + image with Gemini Pro Vision

Start your ML Portofolio in few hours

4 tips on DICOM handling in Machine Learning for Healthcare

Easy to avoid mistakes for ML beginners

My top 3 Data Science Hidden Gems

A cloud usage you might be thinking about: Real Time Bidding

社区洞察

其他会员也浏览了

You can only make some progress if you rest

Electronic Specifier's Weekly Roundup

Electronic Specifier's Weekly Roundup

High Speed Camera Market accounted for US$ 420 Million in 2019 and is anticipated to register a CAGR of 13.5%.

Special Edition: Artificial Intelligence Appreciation Day

Reimagining Work in the Digital Age: The Influence of Modern Technologies on Industries

Innovative Method of Producing Holograms May Lead To Huge Development in Holographic Display Market

How successful companies utilize the power of Industry 4.0 – the new technological revolution

About 3D Scanning Services

Sh*t You Should Know (Issue 7) Google's AI blunder, L'Oreal printing skin, and video game announcement season!