登录查看更多内容

NVIDIA Mixed Precision - Loss & Accuracy - Part 2

Andrew Antonopoulos

Senior Solutions Architect at Sony Professional Solutions Europe

发布日期: 2024年5月20日

Part 1 explained how Nvidia's mixed precision can help reduce power consumption. However, we also need to consider accuracy and loss. Accuracy and loss are the two most well-known and discussed metrics in machine learning.

Accuracy is a method for measuring a classification model’s performance. It is the count of predictions where the predicted value equals the true value. Accuracy is often graphed and monitored during the training phase, though the value is often associated with the overall or final model accuracy.

A loss function, also known as a cost function, takes into account the probabilities or uncertainty of a prediction based on how much it varies from the true value.

Loss is a summation of the errors made for each sample in training or validation sets. The goal during the training process is to minimise this value. Unlike accuracy, loss may be used in both classification and regression problems.

Most of the time, we observe that accuracy increases with the decrease in loss, but this is not always the case. Accuracy and loss have different definitions and measure different things.

A new test has been performed by using the following hyper-parameters for the benchmarking:

Benchmarking

Floating point: 32-bit
Batch size: 32
Neurons: 1024
Epochs: 25

After the benchmarking model training completion, the accuracy graph was the following:

and the loss graph:

Additionally, the following image presents the same results but for each epoch (up to 25th):

The same dataset and hyper-parameters were used for the experiment, but the main difference between the two tests is the usage of mixed precision.

Experiment

Floating point: Mixed Precision
Batch size: 32
Neurons: 1024
Epochs: 25

For the experiment testing, the accuracy graph was the following:

and the loss graph:

领英推荐

How Intel is quietly gearing up to become a player in…

Fast Company 1 年前

AI Breakthroughs: AMD’s MI325x Chip, Google’s Imagen…

The AI Journal 4 个月前

DeciDiffusion 1.0: 3x the Speed of Stable Diffusion…

Deci AI (Acquired by NVIDIA) 1 年前

the detailed accuracy and loss per epoch was the following:

Additionally, when we fit an ML model and use validation split, the data is split into two parts for every epoch: training and validation data. The model is trained on training data and validated on validation data by checking its loss and accuracy for the training data, and validation loss and validation accuracy for the validation data.

By comparing the results for the training data, we can see that the loss is slightly better when using mixed precision (0.094318 for mixed precision and 0.094806 for 32-bit floating point). Conversely, accuracy is slightly better when using a 32-bit floating point (0.996538 for 32-bit floating point and 0.996030 for mixed precision).

However, with the validation data, the validation loss and validation accuracy are better when using a 32-bit floating.

The learning rate (the last column in the above tables) directly influences model convergence, stability, and overall performance metrics such as accuracy and loss. In both cases, the same learning scheduler was used. At the beginning of every epoch, the callback gets the updated learning rate value from the schedule function, with the current epoch and current learning rate, and applies the updated learning rate to the optimiser.

To achieve maximum performance, “learning rate decay” was used, which means decreasing the learning rate as we iterate during the training process. In this way, we get a faster learning algorithm without the risk of our algorithm not converging to a minimum loss value.

#machinelearning #mixedprecision #loss #accuracy

要查看或添加评论，请登录

Andrew Antonopoulos的更多文章

Sustainable ML - Monitor Power Consumption

2024年5月25日

Sustainable ML - Monitor Power Consumption

Training models will also consider the power consumption of the hardware. The following paper compares the most common…
TensorFlow Serving API & gRPC

2024年5月25日

TensorFlow Serving API & gRPC

To serve models for production applications, one can use REST API or gRPC. gRPC is a high-performance, binary, and…
Blockchain & Web3 Technology

2024年5月22日

Blockchain & Web3 Technology

Blockchain is a technology that securely stores transactional information by linking blocks together in a specific…
NVIDIA Mixed Precision & Power Consumption - Part 1

2024年5月14日

NVIDIA Mixed Precision & Power Consumption - Part 1

Deep Learning has enabled progress in many different applications and can be used for developing models for…
Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

2024年5月13日

Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

Tensorflow announced that it would stop supporting GPUs for Windows. The latest support version was 2.

5 条评论
FreeBSD 13 & TCP BBR Congestion Control

2022年4月29日

FreeBSD 13 & TCP BBR Congestion Control

Finally TCP BBR is available for FreeBSD new release 13.x.

2 条评论
Kubernetes - Open Source Tools

2020年6月17日

Kubernetes - Open Source Tools

Kubernetes (also known as k8s or “kube”) is a very popular container orchestration platform that automates many of the…
Cache-Control Headers

2020年6月17日

Cache-Control Headers

The performance of content that is available via web sites and applications can be significantly improved by reusing…
CDN Cache and Machine Learning

2020年6月17日

CDN Cache and Machine Learning

The majority of the Internet’s content is delivered by global caching networks, also known as Content Delivery Networks…
OTT & Mobile Battle in Africa

2019年9月5日

OTT & Mobile Battle in Africa

OTT and specially SVOD is growing in Africa. Recently big OTT providers such as Netflix, muvi, Showmax, iFlix, MTN and…

See all articles

NVIDIA Mixed Precision - Loss & Accuracy - Part 2

Andrew Antonopoulos

Senior Solutions Architect at Sony Professional Solutions Europe

领英推荐

Andrew Antonopoulos的更多文章

社区洞察

其他会员也浏览了

Super AI = Supercomputers.. So how “Super” are they?

More Than Popcorns: Big Reveals From NVIDIA, Intel & AMD, The Rise Of Synthesis AI, Big Data Statistics Of 2024 And More!

Intelligent Automation Newsletter #168

Intelligent Automation Newsletter #168

Optimizing the T5 Model for Fast Inference

NVIDIA's NVILA VLM with "scale-then-compress" approach

This week in AI

‘Onforand’: the AI-RAN confluence with NVIDIA and it’s 6G developer’s forum, an outside in perspective on the initiatives to Xform the Telco Industry

DeepSeek - How did they do it?

The AI Revolution Just Got an Upgrade: Intel's Xeon? 6 and Gaudi? 3

领英推荐

Andrew Antonopoulos的更多文章

Sustainable ML - Monitor Power Consumption

TensorFlow Serving API & gRPC

Blockchain & Web3 Technology

NVIDIA Mixed Precision & Power Consumption - Part 1

Nvidia GPU & TensorFlow for ML in Ubuntu 24.04 LTS

FreeBSD 13 & TCP BBR Congestion Control

Kubernetes - Open Source Tools

Cache-Control Headers

CDN Cache and Machine Learning

OTT & Mobile Battle in Africa

社区洞察

其他会员也浏览了

Super AI = Supercomputers.. So how “Super” are they?

More Than Popcorns: Big Reveals From NVIDIA, Intel & AMD, The Rise Of Synthesis AI, Big Data Statistics Of 2024 And More!

Intelligent Automation Newsletter #168

Intelligent Automation Newsletter #168

Optimizing the T5 Model for Fast Inference

NVIDIA's NVILA VLM with "scale-then-compress" approach

This week in AI

‘Onforand’: the AI-RAN confluence with NVIDIA and it’s 6G developer’s forum, an outside in perspective on the initiatives to Xform the Telco Industry

DeepSeek - How did they do it?

The AI Revolution Just Got an Upgrade: Intel's Xeon? 6 and Gaudi? 3