登录查看更多内容

How to perform automated data augmentation

Juan David Tuta Botero

Data Science | Machine Learning | Artificial Intelligence

发布日期: 2022年8月10日

Every day of our lives we see regularly how machine learning takes more and more prominence in the different economic, social, and even cultural sectors. A derivation of this means that our models not only become more robust and sophisticated but also require more and better training data. The fact that collecting relevant data has generated a vast market from which many of the most important companies of the last century have leveraged does not mean that this has become an easy task, in many cases the amount of information available is not enough and in this article, we are going to discuss some of how we can solve this problem.

Why Data Augmentation?

Modern machine learning models, such as deep neural networks, may have billions of parameters and require massive labeled training datasets which are often not available. The technique of artificially expanding labeled training datasets known as data augmentation has quickly become critical for combating this data scarcity problem. Today, data augmentation is used as a secret sauce in nearly every state-of-the-art model for image classification and is becoming increasingly common in other modalities such as natural language understanding as well.

Practical Methods of Learnable Data Augmentations

Learnable data augmentation is promising, in that it allows us to search for more powerful parameterizations and compositions of transformations. Perhaps the biggest difficulty with automating data augmentation is how to search over the space of transformations. This can be prohibitive due to a large number of transformation functions and associated parameters in the search space. How can we design learnable algorithms that explore the space of transformation functions efficiently and effectively, and find augmentation strategies that can outperform human-designed heuristics? In response to the challenge, we highlight a few recent methods below.

Transformation Adversarial Networks for Data Augmentations (TANDA)

To address this problem, TANDA proposes a framework to learn augmentations, which models data augmentations as sequences of Transformation Functions (TFs) provided by users. For example, these might include?“rotate 5 degrees”?or?“shift by 2 pixels”. At the core, this framework consists of two components first?learning a TF sequence generator?that results in useful augmented data points, and second?using the sequence generator?to augment training sets for a downstream model. In particular, the TF sequence generator is trained to produce realistic images by having to fool a discriminator network, following the GANs framework. The underlying assumption here is that the transformations would either lead to realistic images or indistinguishable garbage images that are off the manifold. As shown in Figure 1, the objective for the generator is to produce sequences of TFs such that the augmented data point can fool the discriminator; whereas the objective for the discriminator is to produce values close to 1 for data points in the original training set and values close to 0 for augmented data points.

领英推荐

How to choose an algorithm - intuitively and…

Ajit Jaokar 2 个月前

Shaping the Future: Artificial Intelligence and Big…

Wilson C. 10 个月前

Image Analysis in Machine Learning: How It Works and…

Machine Learning 1 Limited 7 个月前

Data Augmentations for Model Patching

Most machine learning research carried out today is still solving fixed tasks. However, in the real world, machine learning models in deployment can fail due to unanticipated changes in data distribution. This raises the concerning question of how we can move from model building to model maintenance in an adaptive manner. In our latest work, we propose model patching the first framework that exploits data augmentation to mitigate the performance issues of a flawed model in deployment.

Class-conditional Learned Augmentations for Model Patching (CLAMP)

The conceptual framework of model patching consists of two stages.

Learn inter-subgroup transformations?between different subgroups. These transformations are class-preserving maps that allow semantically changing a datapoint’s subgroup identity (add or remove colorful bandages).
Retrain to patch the model?with augmented data, encouraging the classifier to be robust to their variations.

We propose CLAMP, an instantiation of our first end-to-end model patching framework. They combine a novel consistency regularizer with a robust training objective that is inspired by recent work of Group Distributionally Robust Optimization (GDRO). They extend GDRO to a class conditional training objective that jointly optimizes for the worst-subgroup performance in each class. CLAMP is able to balance the performance of subgroups within each class, reducing the performance gap by up to?24x. On a skin cancer detection dataset ISIC, CLAMP improves robust accuracy by?11.7% compared to the robust training baseline. Through visualization, we also show in Figure 5 that CLAMP successfully removes the model’s reliance on the spurious feature (colorful bandages), shifting its attention to the skin lesion's true feature of interest.

Bibliography

Automating Data Augmentation: Practice, Theory and New Direction | SAIL Blog (stanford.edu)

要查看或添加评论，请登录

Juan David Tuta Botero的更多文章

How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

2022年5月11日

How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

In this article, we will see how to build an artificial intelligence using an RNN (Recurrent neural network) to predict…
Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

2022年4月18日

Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

One of the main concerns in the development of neural networks is the correct selection of hyperparameters. These are…
Transfer Learning, how to use pre-trained neural networks to apply to your own

2022年2月21日

Transfer Learning, how to use pre-trained neural networks to apply to your own

Abstract This review is provided a detailed overview of how to develop a Neural network able to recognize and…
Convolutional Neural Networks how Artificial intelligence see

2022年2月6日

Convolutional Neural Networks how Artificial intelligence see

Recently the idea to use Artificial intelligence to analyze images and videos is becoming a trending topic even for…
Optimization operations in supervised learning and hyperparameter choices

2022年1月17日

Optimization operations in supervised learning and hyperparameter choices

In the current world, it’s hard to imagine one industry or company that is not interested to implement machine learning…
Activation functions in machine learning and Neural Networks

2022年1月5日

Activation functions in machine learning and Neural Networks

It’s been a while since the last article I wrote, today we are going to talk about activation functions, this concept…
What happened when you search in your browser?

2021年9月6日

What happened when you search in your browser?

The Internet has become one of the most important tools in the current society. It is hard to believe any activity that…
IoT is one step closer to the future

2021年8月19日

IoT is one step closer to the future

What is the first thing that comes to your mind when you think of the word Iot? In my case, I didn't know anything…
What is Recursion? Computer science

2021年6月17日

What is Recursion? Computer science

Every day of our lives we usually find notions to interpret the reality from where we are living, and how we perceived…

1 条评论
Differences between static and dynamic libraries

2021年5月4日

Differences between static and dynamic libraries

Why using libraries in general A library in C is a collection of objects files exposed for use and build other…

1 条评论

See all articles

How to perform automated data augmentation

Juan David Tuta Botero

Data Science | Machine Learning | Artificial Intelligence

Why Data Augmentation?

Practical Methods of Learnable Data Augmentations

Transformation Adversarial Networks for Data Augmentations (TANDA)

领英推荐

Data Augmentations for Model Patching

Class-conditional Learned Augmentations for Model Patching (CLAMP)

Juan David Tuta Botero的更多文章

社区洞察

其他会员也浏览了

What Are Image Embeddings for Computer Vision Data Curation?

Fabio Cuzzolin Deciphered Epistemic Artificial Intelligence

Image-Based Predictions with SHAP

The misconception of self-learning capabilities of Large Language Models during Production

EXPLAINABLE ARTIFICIAL INTELLIGENCE (XAI) - ONE OF THE MAIN CHARACTERISTICS OF PETROLEUM DATA ANALYTICS (PDA); Section -2

How to handle limited ground truth?

Father Of Artificial Intelligence

Steve Jobs AI LinkedIn Article Generator: Let's Think Step by Step on How to Create One

Not all machine learning needs to be deep

AI Glossary

Why Data Augmentation?

Practical Methods of Learnable Data Augmentations

Transformation Adversarial Networks for Data Augmentations (TANDA)

领英推荐

Data Augmentations for Model Patching

Class-conditional Learned Augmentations for Model Patching (CLAMP)

Juan David Tuta Botero的更多文章

How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

Transfer Learning, how to use pre-trained neural networks to apply to your own

Convolutional Neural Networks how Artificial intelligence see

Optimization operations in supervised learning and hyperparameter choices

Activation functions in machine learning and Neural Networks

What happened when you search in your browser?

IoT is one step closer to the future

What is Recursion? Computer science

Differences between static and dynamic libraries

社区洞察

其他会员也浏览了

What Are Image Embeddings for Computer Vision Data Curation?

Fabio Cuzzolin Deciphered Epistemic Artificial Intelligence

Image-Based Predictions with SHAP

The misconception of self-learning capabilities of Large Language Models during Production

EXPLAINABLE ARTIFICIAL INTELLIGENCE (XAI) - ONE OF THE MAIN CHARACTERISTICS OF PETROLEUM DATA ANALYTICS (PDA); Section -2

How to handle limited ground truth?

Father Of Artificial Intelligence

Steve Jobs AI LinkedIn Article Generator: Let's Think Step by Step on How to Create One

Not all machine learning needs to be deep

AI Glossary