NVIDIA: Cosmos World Foundation Model Platform for Physical AI
https://ibl.ai

NVIDIA: Cosmos World Foundation Model Platform for Physical AI

Overview

Introducing NVIDIA's Cosmos World Foundation Model (WFM) platform for Physical AI. Cosmos uses a pre-training and post-training paradigm, employing both diffusion and autoregressive models trained on a massive, curated video dataset (20M hours) to create generalist WFMs.

These are then fine-tuned for specialized Physical AI tasks like robotic manipulation and autonomous driving. The platform includes a novel video tokenizer for efficient processing and a guardrail system for safety.

Results demonstrate state-of-the-art performance across various benchmarks and applications.


Podcast

Apple: https://podcasts.apple.com/us/podcast/ibl-ai/id1771542512

Spotify: https://open.spotify.com/show/65ngdXwe3EK8DjahjB26JA

YouTube: https://www.youtube.com/playlist?list=PLW0-4yErlU3XBy7xrvVk3LKmfWAclpWRQ

Full Report:

https://d1qx31qr3h6wln.cloudfront.net/publications/NVIDIA%20Cosmos_3.pdf


要查看或添加评论,请登录

ibl.ai的更多文章

社区洞察

其他会员也浏览了