World beating world models
thanks for the image DALL-E

World beating world models

For the large players in AI development (think OpenAI’s GPT, Meta’s Llama, Anthropic’s Claude, Tesla's FSD) the race for market share has a very specific direction: Build a foundational model with the best internal World Model.

Let me explain what I mean by World Model.

The goal of the AI industry is to develop Artificial General Intelligence (AGI). This means an artificial system that is broadly intelligent across many different domains.? It could learn a new language, plan a vacation to Japan, architect a new house, write a novel, edit a photo, make coffee, paint a picture, direct a feature length film, and if embodied, do the dishes at your house.? The tasks a person can do.

The current limitation to this level of general intelligence is a functioning world model, in other words a broad and meaningful understand of the world as a whole.?How wind blows, what a painting looks like, what a house looks like, the stresses a roof puts on walls, how a person is different than a dog, what an airplane is, how a website is navigated, and what film shot on a 35mm camera looks like.

This sort of world model is the baseline we use in our daily lives to be able to work through the process of planning and executing tasks.? We know a roof has to sit on top of walls or it will fall down.? We know putting blended bologna in coffee is going to be disgusting.? We know putting a dress in the dishwasher isn’t going to turn out well.? Why?? Because we have an effective and efficiently learned world model as our baseline for reasoning.

This is not yet the case with AI models.? The latest large language models (GPT 4 Turbo and Claude Opus) and vision models (Sora and FSD) are showing signs of the of real and effective world model.? It may still be just an illusion of probabilities, but there are hints of a deeper comprehension about our everyday world.

If, or maybe once, these neural networks have generalized world models, the scope of what they can accomplish becomes vast.? While there are many additional capabilities needed to accomplish this vast array of work, like hierarchical planning, a robust general world model is the foundation that all other feats of intelligence require.

This is the current goal, and scaling the training infrastructure is showing that more is better, and better by a wide margin.? Scale has become so important that the race for better and more hardware is a limit even the largest companies in the world struggling to overcome.? At this point the willingness to spend is practically unlimited.?

Meta for example published they used two clusters of 24,000 GPUs to train their latest model, Llama 3.? By the end of 2024 their aim is to grow that their GPU infrastructure by adding 360,000 Nvidia H100 GPUs and growing their overall capacity to the equivalent of 600,000 GPU. ? Putting that in terms of cost it would take over $25 billion to build out similar capacity using Nvidia H100s.

This enormous outlay of cash is not isolated to Meta.? All serious companies looking to build AI models are burning as much cash on compute as they can.

Getting a robust general world model will be game changing for progress toward AGI and the upside is worth far more than the $100s of billions being spent.

要查看或添加评论,请登录

Shane Kempton的更多文章

  • Why your business must be a software first company

    Why your business must be a software first company

    The New Economic Engine Software has revolutionized the global economy, becoming the most efficiently produced and…

    3 条评论
  • The Death of the Software Engineer has been Greatly Exaggerated

    The Death of the Software Engineer has been Greatly Exaggerated

    Day after day, I witness small teams of great engineers consistently release nearly flawless software with as much…

    1 条评论
  • The Robots are Coming

    The Robots are Coming

    Altman and Elon must have had a falling out. The recent announcement of Open AI's partnership with Figure is another of…

  • Part 2: How Will AI Impact Business?

    Part 2: How Will AI Impact Business?

    Introduction AI’s impact has been and will continue to be vast. In the second article of this series (part one here)…

  • The Most Important Part of Sora

    The Most Important Part of Sora

    The real substance of Open AI’s Sora release is this paragraph in their research write up. “Emerging simulation…

    1 条评论
  • Part 1: A Brief History of AI and Why Businesses Should Care

    Part 1: A Brief History of AI and Why Businesses Should Care

    Technologists have been creating innovative ways to offload intelligence into machines, however minor it may have been,…

  • Something New, Something Old

    Something New, Something Old

    A Seamless Shift: P2’s First CEO Transition Phase 2 recently passed through an enormous milestone in our company’s…

    1 条评论

社区洞察

其他会员也浏览了