登录查看更多内容

A radical new technique lets AI learn with practically no data

Arif Harbott

Global Technology Leader | CTO | CDO | CIO | Value Creation

发布日期: 2020年11月5日

Machine learning typically requires a lot of training data and many examples. If you want an AI model to recognise a cat, you need to show it hundreds, if not thousands, of images of different cats.

This current process is expensive and time consuming.

The human brain works very differently and usually only needs a few examples before being able to recognise it for life.

Sometimes children don’t need any examples to identify something! When shown photos of a horse and a rhino, then told that a unicorn is something in between, children can recognise the mythical creature in a picture book the first time they see it.

This remarkable learning pattern is the holy grail for machine learning.

Source: MS TECH / PIXABAY

A new paper from the University of Waterloo in Ontario suggests that AI models should also be able to do this. The AI model should be able to accurately recognise more objects than the number of examples it was trained on.

They call this process “less than one-shot” learning, and it could be a game changer for artificial learning.

How “less than one”-shot learning works

The essence of how “less than one”-shot learning works is to take giant data sets and compress them into much smaller ones.

In the research study, the team took the MNIST database, which contains 60,000 training images of handwritten digits from 0 to 9. They then carefully selected a much smaller number of images, just 10 images.

Sample images from the MNIST dataset

Sample images from the MNIST dataset. Source: WIKIMEDIA

The 10 images selected were carefully chosen to try to contain an equivalent amount of information to the full set. When the model was trained on just these 10 images, instead of the full 60,000, it achieved nearly the same accuracy as a model trained on the full data set!

How could this happen?

The researchers realised that the trick was to create blends of numbers together, then create “soft labels,” drawing on the idea of the horse and rhino having the features of a unicorn.

“If you think about the digit 3, it kind of also looks like the digit 8 but nothing like the digit 7,” says Ilia Sucholutsky, a PhD student at Waterloo and lead author of the paper.

“Soft labels try to capture these shared features. So instead of telling the machine, ‘This image is the digit 3,’ we say, ‘This image is 60% the digit 3, 30% the digit 8, and 10% the digit 0.’”

Is this the holy grail of machine learning?

What the researchers demonstrate in their latest paper is a purely mathematical exploration that only applies to classification problem areas. However, it’s a promising step forward. With more complicated areas of learning, much more work needs to be done.

Trying to use the human brain as a learning pattern is an interesting approach because of the huge capacity we have for learning quickly.

While this approach is not yet ready for use in most corporate scenarios, it could trigger the next wave of AI machine learning development. The jury is still out on this.

This article was inspired by:

Wahab Ahmad

4 年

Totally agreed

Neil Hodgson

CTO @ GAN Integrity | People | Data | Product | Outcomes

4 年

Agree it's promising, however... > The 10 images selected were carefully chosen to try to contain an equivalent amount of information to the full set. Which means we still need humans to understand how to 'carefully choose' training examples. It shows it's possible though.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

A radical new technique lets AI learn with practically no data

Arif Harbott

Global Technology Leader | CTO | CDO | CIO | Value Creation

How “less than one”-shot learning works

Is this the holy grail of machine learning?

更多精彩文章

社区洞察

其他会员也浏览了

The Most Important Lesson in AI

Become an AI/ML Model Training Expert: Navigating Data, Models, and Metrics

How Supervised Learning Shapes AI?

9 Steps for solving any machine learning problem

Making AI Less Hungry: The Race for Efficient Deep Learning Algorithms

Is Machine Learning a Part of Artificial Intelligence?

Help: I need help and need to learn about AI.

AI is here - Are You Ready for AI?

Training and Fine-Tuning Generative AI Models: Best Practices and Challenges

Reaching Artificial General Intelligence (AGI) breakthrough - Are we almost there?

How “less than one”-shot learning works

Is this the holy grail of machine learning?

Why we’re still years away from having self-driving cars

2020年10月14日

Dramatically reduce your cloud hosting spend in 8 simple steps

2020年8月26日

Why Copying Spotify's Squads and Tribes Model Probably Won’t Work for You

2020年8月5日

The ultimate guide to writing an amazing business book

2020年7月1日

7 principles to transform technology devices in the workplace

2016年9月13日

The first 100 days: proven strategies for new digital and technology leaders

2016年8月2日

The real costs of running a technology system

2016年6月27日

How product ownership can transform technology infrastructure

2016年6月13日

What’s the difference between a product and a project?

2016年5月25日

Digital Justice – Transforming the end to end criminal justice system

2016年4月26日

社区洞察

其他会员也浏览了

The Most Important Lesson in AI

Become an AI/ML Model Training Expert: Navigating Data, Models, and Metrics

How Supervised Learning Shapes AI?

9 Steps for solving any machine learning problem

Making AI Less Hungry: The Race for Efficient Deep Learning Algorithms

Is Machine Learning a Part of Artificial Intelligence?

Help: I need help and need to learn about AI.

AI is here - Are You Ready for AI?

Training and Fine-Tuning Generative AI Models: Best Practices and Challenges

Reaching Artificial General Intelligence (AGI) breakthrough - Are we almost there?