EXCITING SESSIONS FROM NVIDIA GTC FALL 2021
Frank Denneman
Distinguished Engineer | Chief Technologist for AI | AI and Advanced Services | VMware Cloud Foundation Division | Broadcom
Over the last few weeks, I watched many sessions of the NVIDIA Fall version of GTC. I created a list of interesting sessions for a group of people internally at VMware, but I thought the list might interest some outside VMware. It’s primarily focused on understanding NVIDIA’s product and services suite and not necessarily deep diving into technology or geeking out on core counts and speeds and feeds. If you found exciting sessions that I haven’t listed, please leave them in the comments below.
Data Science
Reason to watch:?A 55-minute overview of the state of RAPIDS (the OS framework for data science), upcoming features, and improvements
Inference
Please note: Triton is part of the NVIDIA AI Enterprise stack (NVAIE)
Reason to watch:?A 50-minute session. Hugging Face is the dominant play for NLP Transformers. Hugging Face is pushing the open platform \ democratizing ML message.
Reason to watch:?A?50-minute session covering ARM NN architecture and deploying models on far edge technology (Jetson\Pi’s) using NVIDIA Triton Inference Server
Reason to watch:?50 minutes overview of Triton architecture, features, customer case studies, and onyx runtime integration. It covers ONNX RT, which provides optimization for target platforms (inference).
Reason to watch:?45 minutes session covering Triton on AWS SageMaker and two customers sharing their deployment overview and their lessons learned.
No-Code Approach
Reason to watch:??A 3-minute overview of TAO, an model adaptation framework that can fine-tune pre-trained models by feeding proprietary (smaller) datasets and optimizing it for the inference hardware architecture.
Reason to watch:?A?50-minute session that covers NVIDIAs approach of Transfer Learning. NVIDIA provides a Pre-trained model, while customers optimize the model, NVIDIA TAO assists with future optimization for inference deployment. Fleet command to orchestrate the deployment of the model at the edge.
Reason to watch:?A 35-minute session that explores TAO in more detail and provides a demo of the TAO (GUI-based) solution.
NVIDIA LaunchPad
Reason to watch:?A 30-minute overview of the NVIDIA Cloud AI platform delivered through their partnership with Equinix. (Rapid testing and prototyping AI)
How to Quickly Pilot and Scale Smart Infrastructure Solutions with LaunchPad and Metropolis [A31622]
Reason to watch:?A 30-minute overview of how to use Metropolis (Computer Vision AI Application Framework) and LaunchPad to accelerate POCs. (Zero Touch Testing and System Sizing). The session covers Metropolis Certification for system design (TS: 11:30) and the bare metal access.
NVIDIA and Cloud Integration
Reason to watch:?A 40-minutes overview of NCG integration with Google Vertex AI (Google End-to-End ML AI Platform)
领英推荐
Reason to watch:?A 15-minutes overview of Azure Percept. Azure Percept is an edge AI solution for IoT devices, now available on the Azure HCI stack.
Reason to watch:?A 45-minute technical overview of the NVIDIA and VMware partnership demonstrating the key elements of the NVIDIA AI-Ready Enterprise Platform in detail.
NVIDIA EGX
Reason to watch:?A 50-minute overview of NVIDIA’s cloud-native platform (Kubernetes based, edge AI platform)
Retail
Reason to watch:?A 55-minute session providing great insights into real use-case of Everseen technology deployed at Kroger
Reason to watch:?A 25-minute session providing insights into the challenges of deploying and using autonomous store technology.
DPU
Reason to watch:?A 40-minute session covering the GPU+DPU converged accelerators (A40X and A100X), their architecture and their DOCA (Data Center on a Chip Architecture) and CUDA architecture and programming environment.
Reason to watch:?A 50-minute session offering an excellent explanation of the different vGPU modes (native vs MIG mode). Starting from the 37 minute time stamp the presenters dive into the use of Network Function virtualization on smartNICs.
Reason to watch:?A 40-minute session covering DOCA architecture in detail.?
Developer / Engineer Type?Sessions
Reason to watch:?A 50-minutes overview of what’s new in the CUDA toolkit.
Reason to watch:?A 50-minutes deep dive session on the end-to-end pipelines for vision-based AI
Reason to Watch:?A 50-minutes overview of DeepStream solution. DeepStream helps to develop and deploy vision-based AI
Reason to watch:?A 40-minute deep dive session on how Salesforce worked on a framework to drastically reduce?CPU-GPU communication