The Most Important Part of Sora
generated by DALL.E 3

The Most Important Part of Sora

The real substance of Open AI’s Sora release is this paragraph in their research write up.

Emerging simulation capabilities

We find that video models exhibit a number of interesting emergent capabilities when trained at scale. These capabilities enable Sora to simulate some aspects of people, animals and environments from the physical world. These properties emerge without any explicit inductive biases for 3D, objects, etc.—they are purely phenomena of scale.”

To clarify, this means three things

  1. The bigger the scale the more emergent phenomenon occur. In other words throwing more data and compute at training equals more unexpected and better quality capabilities emerging.
  2. Multi-modal training is effective at creating better quality models with increased capabilities.? A new paper out of Microsoft found a similar phenomenon.?
  3. Among the most important emerging capabilities are that the models are gaining a deep understanding of real world objects, 3D space, and realistic physics.

A human like multi-modal world model with an effective action model is a huge leap forward for a usable AI agent.? It allows for far more than digital workflows as embodied AI is a small hardware step away.

Tesla may be ahead of everyone

With Tesla’s recent move to an end to end neural network for their Full Self Driving (FSD) models, I suspect they found the emergent world-model phenomenon was the most effective path forward a few years ago when they switched to vision only for FSD.? They're cars being just the first step.

Embodying that model in their Optimus robots is where Tesla sees its largest opportunity.? In other words, a robust world model is the limitation preventing everyone from having a robot in their house to do the dishes.? With 100s of millions of miles of high quality video, sensor data, and their custom Dojo chips they have a hell of a moat.

They may actually be ahead of everyone with the quality of their world and action models. It makes more sense of the Elon’s recent demand to double his Tesla stake and their confidence in the Optimus timeline.

No limits to the emergence

There has yet to be a limit at which AI quality and capability tapper off with more scale.? That means more high quality data and more chips.? A lot more chips.? Altman isn’t joking with his announcement to raise 7 trillion for chip manufacturing.

With better world models emerging through more data, compute, and multimodal training inputs; scale is the game.? The prize is a huge portion of the value of global labor and the potential to increase global productivity by many orders of magnitude.?

This is serious business.

What an interesting time to be in technology. I suspect many technologists, especially older ones, will be tempted to skip out on their knowledge of what these platforms can do. That would be a huge mistake.

要查看或添加评论,请登录

Shane Kempton的更多文章

  • Why your business must be a software first company

    Why your business must be a software first company

    The New Economic Engine Software has revolutionized the global economy, becoming the most efficiently produced and…

    3 条评论
  • The Death of the Software Engineer has been Greatly Exaggerated

    The Death of the Software Engineer has been Greatly Exaggerated

    Day after day, I witness small teams of great engineers consistently release nearly flawless software with as much…

    1 条评论
  • World beating world models

    World beating world models

    For the large players in AI development (think OpenAI’s GPT, Meta’s Llama, Anthropic’s Claude, Tesla's FSD) the race…

  • The Robots are Coming

    The Robots are Coming

    Altman and Elon must have had a falling out. The recent announcement of Open AI's partnership with Figure is another of…

  • Part 2: How Will AI Impact Business?

    Part 2: How Will AI Impact Business?

    Introduction AI’s impact has been and will continue to be vast. In the second article of this series (part one here)…

  • Part 1: A Brief History of AI and Why Businesses Should Care

    Part 1: A Brief History of AI and Why Businesses Should Care

    Technologists have been creating innovative ways to offload intelligence into machines, however minor it may have been,…

  • Something New, Something Old

    Something New, Something Old

    A Seamless Shift: P2’s First CEO Transition Phase 2 recently passed through an enormous milestone in our company’s…

    1 条评论

社区洞察

其他会员也浏览了