登录查看更多内容

A Primer on Nvidia-Docker — Where Containers Meet GPUs

Janakiram MSV

Analyst | Architect | Advisor

发布日期: 2018年3月10日

GPUs are critical for training deep learning models and neural networks. Though it may not be needed for simple models based on linear regression and logistic regression, complex models designed around convolutional neural networks (CNNs) and recurrent neural networks heavily rely on GPUs. Especially computer vision-related models based on frameworks such as Caffe2 and TensorFlow have a dependency on GPU.

In supervised machine learning, a set of features and labels are used to train a model. Deep learning algorithms don’t even need explicit features to evolve trained models. They pretty much “learn” from existing datasets designated for training, testing, and evaluation.

Neural networks perform complex computation on tens of thousands of matrices before the final model is evolved. When an image is fed to a CNN, it gets translated into a matrix of real numbers. Depending on the density and size of the image, multiple such matrices are generated by the neural network. These matrices are added and multiplied with other matrices during the forward propagation and backward propagation till appropriate weights are derived.

A trained model can be run on CPUs for inference. Since inference is not as intense as training, GPUs are strictly optional when running models for inference.

CPUs are not designed to deal with such a rapid rate of computation. While they are faster for performing regular number crunching, CPUs are not meant for parallelizing mathematical operations. That’s where GPU plays a crucial role. They may not have the horsepower of CPUs, but they can perform massively parallelized calculations.

Read the entire article at The New Stack

Janakiram MSV is an analyst, advisor, and architect. Follow him on Twitter, Facebook and LinkedIn.

Francisco Javier Garcia Romero

EMEA Partner Solutions Architecture

7 年

Thanks for the article, very didactic. I recommend you check out AWS Sagemaker. Beyond the already optimized algorithms it provides, it allows you to upload your Docker images with your ML models written in your favorite framework and let Sagemaker perform the training in a distributed cluster of any size freeing the Data Scientists from having to manage the infrastructure used for training - I.e. a Kubernetes cluster or others. If you want to train on GPU enabled instances like the P3 instances which provides up to 8 NVIDIA Tesla V100 GPUs in a single instance, you just need to include the CUDA toolkit in the Docker image. The service is GA, it’s already been used in production by many customers and provides other goodies like managed Jupyter notebooks with Anaconda, Tensorflow, MXNET and Spark preinstalled and let you put your models in production exposing them via API and again provisioning the infrastructure required for inference.

2 次回应

要查看或添加评论，请登录

Janakiram MSV的更多文章

Nvidia Dynamo — Next-Gen AI Inference Server For Enterprises

2025年3月26日

Nvidia Dynamo — Next-Gen AI Inference Server For Enterprises

At the GTC 2025 conference, Nvidia introduced Dynamo, a new open-source AI inference server designed to serve the…

1 条评论
What Is AI Factory, And Why Is Nvidia Betting On It?

2025年3月24日

What Is AI Factory, And Why Is Nvidia Betting On It?

At the recent Nvidia GTC conference, executives and speakers frequently referenced the AI factory. It was one of the…

5 条评论
Portkey: An open-source AI gateway for easy LLM orchestration

2025年3月12日

Portkey: An open-source AI gateway for easy LLM orchestration

The explosion of open-source AI frameworks has given developers unprecedented flexibility in deploying AI models…

1 条评论
How to Run DeepSeek Models Locally on a Windows Copilot+ PC

2025年3月10日

How to Run DeepSeek Models Locally on a Windows Copilot+ PC

With the Windows 11 version 24H2, Microsoft has enabled access to the Neural Processing Unit (NPU) on Copilot+ PCs…

4 条评论
How Physical AI Transforms Industries Through Embedded Intelligence

2025年3月5日

How Physical AI Transforms Industries Through Embedded Intelligence

Physical artificial intelligence represents the evolution of AI from purely digital systems to intelligent machines…

1 条评论
Orchestrate Cloud Native Workloads With Kro and Kubernetes

2025年3月3日

Orchestrate Cloud Native Workloads With Kro and Kubernetes

In the first part of this series, I introduced the background of Kube Resource Orchestrator (Kro). In this installment,…
Sonar Bets On AI Code Automation With AutoCodeRover Acquisition

2025年2月27日

Sonar Bets On AI Code Automation With AutoCodeRover Acquisition

Sonar’s acquisition of AutoCodeRover, announced on February 19, 2025, marks a strategic move to integrate agentic AI…
Gemini Lands On Agentforce: A Bold Move By Google And Salesforce

2025年2月25日

Gemini Lands On Agentforce: A Bold Move By Google And Salesforce

Salesforce and Google have expanded their partnership to integrate Google’s Gemini AI into Salesforce’s Agentforce…
Kubernetes Gets a New Resource Orchestrator in the Form of Kro

2025年2月24日

Kubernetes Gets a New Resource Orchestrator in the Form of Kro

For the first time in history, Amazon Web Services, Google and Microsoft have collaborated on an open source project…

1 条评论
GitHub Copilot Agent And The Rise Of AI Coding Assistants

2025年2月18日

GitHub Copilot Agent And The Rise Of AI Coding Assistants

One of my GenAI predictions for 2025 was that copilots would transition into fully-fledged agents that would become an…

1 条评论

See all articles

A Primer on Nvidia-Docker — Where Containers Meet GPUs

Janakiram MSV

Analyst | Architect | Advisor

Janakiram MSV的更多文章

社区洞察

其他会员也浏览了

Cloudbric: Integrating Deep Learning with Crypto Security Ecosystem

Neural Layer Parallelism (Deep Dive)

?The Evolution and Future of Neural Processing Units (NPUs) ??

Does Artificial Intelligence Require Specialized Processors?

Harnessing the Power of GPUs: A Game Changer for Neural Networks

Intel? oneAPI Perfomance Libraries- Part 2

How video game tech makes neural networks possible

NVIDIA DEEP LEARNING INSTITUTE WORKSHOP: WHAT I LEARNT

Better Convolutional Neural Network Efficiency in 12 Figures

Will NPUs replace CPUs and GPUs

Janakiram MSV的更多文章

Nvidia Dynamo — Next-Gen AI Inference Server For Enterprises

What Is AI Factory, And Why Is Nvidia Betting On It?

Portkey: An open-source AI gateway for easy LLM orchestration

How to Run DeepSeek Models Locally on a Windows Copilot+ PC

How Physical AI Transforms Industries Through Embedded Intelligence

Orchestrate Cloud Native Workloads With Kro and Kubernetes

Sonar Bets On AI Code Automation With AutoCodeRover Acquisition

Gemini Lands On Agentforce: A Bold Move By Google And Salesforce

Kubernetes Gets a New Resource Orchestrator in the Form of Kro

GitHub Copilot Agent And The Rise Of AI Coding Assistants

社区洞察

其他会员也浏览了

Cloudbric: Integrating Deep Learning with Crypto Security Ecosystem

Neural Layer Parallelism (Deep Dive)

?The Evolution and Future of Neural Processing Units (NPUs) ??

Does Artificial Intelligence Require Specialized Processors?

Harnessing the Power of GPUs: A Game Changer for Neural Networks

Intel? oneAPI Perfomance Libraries- Part 2

How video game tech makes neural networks possible

NVIDIA DEEP LEARNING INSTITUTE WORKSHOP: WHAT I LEARNT

Better Convolutional Neural Network Efficiency in 12 Figures

Will NPUs replace CPUs and GPUs