登录查看更多内容

EXCITING SESSIONS FROM NVIDIA GTC FALL 2021

Frank Denneman

Distinguished Engineer | Chief Technologist for AI | AI and Advanced Services | VMware Cloud Foundation Division | Broadcom

发布日期: 2021年12月9日

Over the last few weeks, I watched many sessions of the NVIDIA Fall version of GTC. I created a list of interesting sessions for a group of people internally at VMware, but I thought the list might interest some outside VMware. It’s primarily focused on understanding NVIDIA’s product and services suite and not necessarily deep diving into technology or geeking out on core counts and speeds and feeds. If you found exciting sessions that I haven’t listed, please leave them in the comments below.

Data Science

Accelerating Data Science: State of RAPIDS [A31490]

Reason to watch:?A 55-minute overview of the state of RAPIDS (the OS framework for data science), upcoming features, and improvements

Inference

Please note: Triton is part of the NVIDIA AI Enterprise stack (NVAIE)

How Hugging Face Delivers 1 Millisecond Inference Latency for Transformers in Infinity [A31172]

Reason to watch:?A 50-minute session. Hugging Face is the dominant play for NLP Transformers. Hugging Face is pushing the open platform \ democratizing ML message.

Scalable, Accelerated Hardware-agnostic ML Inference with NVIDIA Triton and Arm NN [A31177]

Reason to watch:?A?50-minute session covering ARM NN architecture and deploying models on far edge technology (Jetson\Pi’s) using NVIDIA Triton Inference Server

Deploy AI Models at Scale Using the Triton Inference Server and ONNX Runtime and Maximize Performance with TensorRT [A31405]

Reason to watch:?50 minutes overview of Triton architecture, features, customer case studies, and onyx runtime integration. It covers ONNX RT, which provides optimization for target platforms (inference).

NVIDIA Triton Inference Server on AWS: Customer success stories and AWS deployment methods to optimize inference throughput, reduce latency, and lower GPU or CPU inference costs. [SE31488]

Reason to watch:?45 minutes session covering Triton on AWS SageMaker and two customers sharing their deployment overview and their lessons learned.

No-Code Approach

NVIDIA TAO: Create Custom Production-Ready AI Models without AI Expertise [D31030]

Reason to watch:??A 3-minute overview of TAO, an model adaptation framework that can fine-tune pre-trained models by feeding proprietary (smaller) datasets and optimizing it for the inference hardware architecture.

AI Life-cycle Management for the Intelligent Edge [A31160]

Reason to watch:?A?50-minute session that covers NVIDIAs approach of Transfer Learning. NVIDIA provides a Pre-trained model, while customers optimize the model, NVIDIA TAO assists with future optimization for inference deployment. Fleet command to orchestrate the deployment of the model at the edge.

A Zero-code Approach to Creating Production-ready AI Models [A31176]

Reason to watch:?A 35-minute session that explores TAO in more detail and provides a demo of the TAO (GUI-based) solution.

NVIDIA LaunchPad

Simplifying Enterprise AI from Develop to Deploy with NVIDIA LaunchPad [A31455]

Reason to watch:?A 30-minute overview of the NVIDIA Cloud AI platform delivered through their partnership with Equinix. (Rapid testing and prototyping AI)

How to Quickly Pilot and Scale Smart Infrastructure Solutions with LaunchPad and Metropolis [A31622]

Reason to watch:?A 30-minute overview of how to use Metropolis (Computer Vision AI Application Framework) and LaunchPad to accelerate POCs. (Zero Touch Testing and System Sizing). The session covers Metropolis Certification for system design (TS: 11:30) and the bare metal access.

NVIDIA and Cloud Integration

From NGC to MLOps with NVIDIA GPUs and Vertex AI Workbench on Google Cloud (Presented by Google Cloud) [A31680]

Reason to watch:?A 40-minutes overview of NCG integration with Google Vertex AI (Google End-to-End ML AI Platform)

Tarry Singh 1 年前

Seeed Monthly Wrap-up for December 2022: 6 Amazing…

Seeed Studio 1 年前

AI-Specific Chips: GPUs to Custom ASICs

Ganesh Raju 5 个月前

Automate Your Operations with Edge AI (Presented by Microsoft Azure) [A31707]

Reason to watch:?A 15-minutes overview of Azure Percept. Azure Percept is an edge AI solution for IoT devices, now available on the Azure HCI stack.

Fast Provisioning of Kubernetes Clusters for the AI/ML Developer on VMware: Technical Details (Presented by VMware) [A31659]

Reason to watch:?A 45-minute technical overview of the NVIDIA and VMware partnership demonstrating the key elements of the NVIDIA AI-Ready Enterprise Platform in detail.

NVIDIA EGX

Exploring Cloud-native Edge AI [A31166]

Reason to watch:?A 50-minute overview of NVIDIA’s cloud-native platform (Kubernetes based, edge AI platform)

Retail

The One Retail AI Use Case That Stands Out Above the Rest [A31548]

Reason to watch:?A 55-minute session providing great insights into real use-case of Everseen technology deployed at Kroger

One of the World’s Top Retailers is Bringing AI-Powered?Convenience to a Store Near You [A31359]

Reason to watch:?A 25-minute session providing insights into the challenges of deploying and using autonomous store technology.

DPU

Real-time AI Processing at the Edge [A31164]

Reason to watch:?A 40-minute session covering the GPU+DPU converged accelerators (A40X and A100X), their architecture and their DOCA (Data Center on a Chip Architecture) and CUDA architecture and programming environment.

NVIDIA AI Enterprise with VMware vSphere: Combining NVIDIA GPU’s Superior Performance, NVIDIA AI Software, and Virtualization Benefits for AI Workflows (Presented by VMware, Inc.) [A31694]

Reason to watch:?A 50-minute session offering an excellent explanation of the different vGPU modes (native vs MIG mode). Starting from the 37 minute time stamp the presenters dive into the use of Network Function virtualization on smartNICs.

Programming the Data Center of the Future Today with the New NVIDIA DOCA Release [A31069]

Reason to watch:?A 40-minute session covering DOCA architecture in detail.?

Developer / Engineer Type?Sessions

CUDA New Features and Beyond [A31399]

Reason to watch:?A 50-minutes overview of what’s new in the CUDA toolkit.

Developing Versatile and Efficient Cloud-native Services with Deepstream and Triton Inference Server [A31202]

Reason to watch:?A 50-minutes deep dive session on the end-to-end pipelines for vision-based AI

Accelerating the Development of Next-Generation AI Applications with DeepStream 6.0 [A31185]

Reason to Watch:?A 50-minutes overview of DeepStream solution. DeepStream helps to develop and deploy vision-based AI

End-to-end Extremely Parallelized Multi-agent Reinforcement Learning on a GPU [A31051]

Reason to watch:?A 40-minute deep dive session on how Salesforce worked on a framework to drastically reduce?CPU-GPU communication

EXCITING SESSIONS FROM NVIDIA GTC FALL 2021

Frank Denneman

Distinguished Engineer | Chief Technologist for AI | AI and Advanced Services | VMware Cloud Foundation Division | Broadcom

Data Science

Inference

No-Code Approach

NVIDIA LaunchPad

NVIDIA and Cloud Integration

领英推荐

NVIDIA EGX

Retail

DPU

Developer / Engineer Type?Sessions

更多精彩文章

社区洞察

其他会员也浏览了

NVIDIA’s Journey from Graphic Cards to Powering AI Revolution

The Intelligent Industrial Revolution

Observations on the first order outputs of LLM’s wrt NVIDIA DGX Reference Architecture employing ChatGPT and Claude – an outside in perspective

Nvidia's powerful strategy: Full AI Orchestration

Catalysts of Change: How the AI Industrial Revolution is Rewiring the Future of Computing

AI Is Eating Software

#44: The NVIDIA Revolution, RAFTing & Beyond......

Nvidia Advances GenAI Adoption

Overview of NVIDIA software ecosystem to develop vision AI apps for traffic and retail

Text to 3D, NVIDIA Giveaway and more!

Data Science

Inference

No-Code Approach

NVIDIA LaunchPad

NVIDIA and Cloud Integration

领英推荐

NVIDIA EGX

Retail

DPU

Developer / Engineer Type?Sessions

Building an Efficient AI Ingestion Pipeline: Data Ingestion Strategies

2024年10月30日

Building an Efficient AI Ingestion Pipeline: A Guide for Infrastructure Teams on Data Transformation and Vector Database Optimization

2024年10月25日

Private AI Sessions at Explore Barcelona

2024年10月9日

VMware Private AI Foundation with NVIDIA Explore 2024 Sessions

2024年9月2日

Unlocking AI Potential: My VMware Explore 2024 Sessions - A Deep Dive into Private AI, RAG, and Security

2024年8月15日

VMware Private AI Foundation - Privacy and Security Best Practices white paper

2024年6月21日

RAG Architecture Deep Dive

2024年3月19日

Retrieval-Augmented Generation Basics for the Data Center Admin

2024年1月16日

The misconception of self-learning capabilities of Large Language Models during Production

2023年11月14日

vSphere ML Accelerator Spectrum Deep Dive Series

2023年5月3日

社区洞察

其他会员也浏览了

NVIDIA’s Journey from Graphic Cards to Powering AI Revolution

The Intelligent Industrial Revolution

Observations on the first order outputs of LLM’s wrt NVIDIA DGX Reference Architecture employing ChatGPT and Claude – an outside in perspective

Nvidia's powerful strategy: Full AI Orchestration

Catalysts of Change: How the AI Industrial Revolution is Rewiring the Future of Computing

AI Is Eating Software

#44: The NVIDIA Revolution, RAFTing & Beyond......

Nvidia Advances GenAI Adoption

Overview of NVIDIA software ecosystem to develop vision AI apps for traffic and retail

Text to 3D, NVIDIA Giveaway and more!