登录查看更多内容

The journey to AGI

Ted Shelton

Chief Operating Officer Inflection AI, Inc.

发布日期: 2022年10月21日

Current AI systems are sometimes referred to as "narrow AI" as they are optimized to solve one specific task. Large language models like OpenAI's GPT-3 which manipulates text are examples. In order to move toward AI with "human" level capabilities these systems will have to do many different tasks - usually referred to as artificial general intelligence or AGI.

One step on this journey is to introduce "multimodal" AI systems, most recently accomplished by Deepmind with the introduction of their Gato system. MIT technology review writes that Gato "learns multiple different tasks at the same time, which means it can switch between them without having to forget one skill before learning another."

This is accomplished by having parameters of different kinds in the same system - text, image, sound, even actions -- a simple example of this last category is movement in a computer game: move up, move down, go left. So instead of compartmentalizing these different skills into different models, a single model can handle all skills.

Sayali Shelke 6 个月前

Open AI GPT 5

David N. 4 个月前

Number of exciting new developments in generative AI

Arivukkarasan Raja, PhD 12 个月前

For example, OpenAI currently has four different systems for four different types of content: GPT-3 for text, DALL-E for images, Codex for programming, and Whisper for speech. A multimodal approach would combine all four content types into a single model -- a user would then specify which skill should be used for a given prompt. Such a system should also then be able to "learn" new skills.

Where are we going: On the journey toward artificial general intelligence, the near term milestone will be multimodal AI systems that can handle and move between different information types. The next step will be to use those multimodal systems to quickly learn new skills -- composing music for example could be the next step for a multimodal system that already has mastered text, images, and sounds. We should see these multimodal systems within the next year - not just in a lab, but available for use by anyone (just as GPT-3 or Codex or DALL-E are all available today). Researchers believe that this eventually leads to a system that can learn any skill and thus is a path to AGI.

Infinite Future

3,158 位关注者

Tom Short

What’s next?

1 年

Great info on AGI and multimodal approach to it. In the multimodal model approach to AGI (if I understand it correctly), it seems to me that one of the hardest parts of this would have to be dealing with visual inputs. Somehow babies figure out very quickly what's new in their environments and what isn't, and even more important, which new stuff is important to pay attention to, and which new stuff can be ignored. I could well imagine an AGI being very good at detecting new stuff, but struggling with determining what is important and what isn't, leading to 'info overload', or at least a slower learning curve. Then again, transactions can be millions of times more and faster than a human could ever deal with, so maybe there's a meta learning loop an AGI could use to develop that discernment.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

The journey to AGI

Ted Shelton

Chief Operating Officer Inflection AI, Inc.

领英推荐

Infinite Future

3,158 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

GPT-5: The Future of AI is Here

Is "AI" = "What We know" + "Something else"

Google Gemini: A Monumental Step in AI Evolution, Surpassing GPT-4

Navigating the Generative AI Jungle: Understanding LLMs and Agents

IA.Chat GPT vs Google Bard: The Battle of AI Language Models

Attention is not all you need

The Mirror and the Crystal Ball: Harnessing AI’s Potential in a Changing World

Latent Learning in Artificial Intelligence! Things to Consider When Interacting with 'Specialized' Models. ?????? #AI #MachineLearning #LatentLearning

The Auto-GPT Phenomenon: A Critical Look at the AI Hype

GEMINI AI : Google's New AI Language Model

领英推荐

Infinite Future

3,158 位关注者

The Turing Dilemma

2024年9月1日

Possible futures

2024年8月18日

Meta + Ray Ban

2024年8月5日

Containment

2024年7月20日

Where it all began

2024年7月11日

Access to the future

2024年7月6日

Transforming business with AI

2024年6月23日

Mustafa Suleyman

2024年6月15日

First look at a new world

2024年5月25日

An Inflection Point

2024年5月20日

社区洞察

其他会员也浏览了

GPT-5: The Future of AI is Here

Is "AI" = "What We know" + "Something else"

Google Gemini: A Monumental Step in AI Evolution, Surpassing GPT-4

Navigating the Generative AI Jungle: Understanding LLMs and Agents

IA.Chat GPT vs Google Bard: The Battle of AI Language Models

Attention is not all you need

The Mirror and the Crystal Ball: Harnessing AI’s Potential in a Changing World

Latent Learning in Artificial Intelligence! Things to Consider When Interacting with 'Specialized' Models. ?????? #AI #MachineLearning #LatentLearning

The Auto-GPT Phenomenon: A Critical Look at the AI Hype

GEMINI AI : Google's New AI Language Model