登录查看更多内容

Mega, Mamba, Liquid...

Ted Shelton

Chief Operating Officer Inflection AI, Inc.

发布日期: 2024年5月4日

I had the opportunity to attend and contribute to an outstanding event on AI and business held at the MIT Media Lab on April 18th. Our host John Werner is amazingly energetic, dynamic, and intelligent and pulled off an event for thousands of people with a small handful of hardworking volunteers.

In his commentary before, during, and after the event John was quite transparent in his motivations: that Boston and MIT in particular should be front and center as the place where AI innovation will happen over the next few years and that he hoped by bringing the community together at the Media Lab a few connections and relationships would blossom. As one person at the conference put it "...let's pull some of those MIT grads off the train to Silicon Valley and keep them here!"

While there were a huge number (100+) of impressive local startups at the event, exhibit "A" for John's argument that AI innovation is happening in Boston was the company Liquid. He even gave them a main stage panel discussion to educate the audience about the interesting approach that they are bringing to challenge the current dominance of transformer based foundation models. Considered a "spinoff" from MIT, they are largely local to Boston although they also have a Palo Alto, CA presence. The research work that inspired the company approached the same questions that inspired the developers of the transformer (GPT) architecture but with a different starting point.

“Liquid’s approach to advancing the state-of-the-art in AI is grounded in the integration of fundamental truths across biology, physics, neuroscience, math, and computer science. We believe that trans-disciplinary approaches will unlock the greatest levels of acceleration towards the most efficient breakthroughs.”– Joseph Jacks (Founder and General Partner at OSS Capital)

What is that approach? Well it started with the brain of a roundworm C. elegans which has just 302 neurons and 7000 connections (vs. 100 billion neurons in the human brain). The goal is to create smaller, simpler, but nonetheless powerful foundation models which can perform important tasks while using much less computing and power.

领英推荐

AI 4 Science This Week

Ben Blaiszik 2 年前

A New Wave in the Ocean of Neural Networks

PJ LaBarbera 9 个月前

Breakthrough Information To Develop The World’s Most…

Anton Krutz 5 年前

The bigger story here beyond Liquid is that there are a growing number of different research initiatives all experimenting with alternatives to transformer algorithms. Another example is state space models which originally were developed to address problems in signal processing. Mamba, developed from research at Carnegie Mellon and Princeton, is a linear-time sequence model. It scales better than transformers for longer sequences and can extrapolate to sequences much longer than it was trained on. The core of this approach is in summarizing past information into a current state and then using that representation to drive predictions.

Meanwhile the transformer architectures aren't standing still with a variety of larger models, sometimes referred to as Mega models, which are trained on huge data sets with the goal of providing an underlying "utility" layer which is good at all but the most specialized tasks.

In the long run we should expect to see application architectures emerge which utilize different approaches to solve different parts of a problem. You may have a pre-processing algorithm which provides the "front-end" interface which can maintain consistency, be managed for safety, and presents the right tone and personality for a given activity. Behind this you may have a panel of models which interact with one another to answer a given request, including specific tools focused on deterministic tasks like calculation, code execution, or integration with other information systems.

As the Berkeley AI Research lab recently explained, even the common platforms that we are using today like ChatGPT are actually compound AI systems incorporating many components. ChatGPT might be characterized as an LLM plus a web browser plugin for retrieving timely content, a code Interpreter plugin for executing Python, and the DALL-E image generator. Another example BAIR provides -- Google DeepMind's AlphaGeometry is a combination of a fine-tuned LLM and a symbolic math engine.

One thing is certain -- every advance which solves one set of questions simply unearths a new set of questions. And there is an enormous amount of both human and machine intelligence being applied to these questions resulting in rapid (dare I say exponential?) advances in the capabilities of our machine intelligence inventions. Have a problem that you don't think can be solved by today's AI? Definitely take a look again tomorrow.

Infinite Future

3,117 位关注者

Jack C Crawford

? Creating value with generative AI agents (Stealth) ?

4 个月

It is gotten down to hours. (e.g. Llama3 Hackathon)

Javier Luraschi

Founder at Hal9

5 个月

Great write up Ted!

1 次回应

Emeric Marc

I help companies resuscitate dead leads and sell using AI ?????????????? #copywriting #emailmarketing #coldemail #content #databasereactivation

5 个月

Exciting times ahead with all these AI advancements.

1 次回应

Yassine Fatihi ??

Crafting Audits, Process and Automations that Generate ?+??| Work remotely Only | Founder & Tech Creative | 30+ Companies Guided

5 个月

Thought-provoking insight on AI's rapid evolution. Question everything, stay curious.

Mebs Loghdey

Leadership and Ethics in AI - Coach, Consultant and Facilitator

5 个月

Bro’ said it here: “Liquid’s approach to advancing the state-of-the-art in AI is grounded in the integration of fundamental truths across biology, physics, neuroscience, math, and computer science. We believe that trans-disciplinary approaches will unlock the greatest levels of acceleration towards the most efficient breakthroughs.”– Joseph Jacks (Founder and General Partner at OSS Capital) Directing AI/Ml to; 1) find isomorphic structures, patterns and hierarchies across seemingly disparate domains. 2) find Intersective synthesis between domains 3) uncover unknown unknowns within domain beyond current human cognition and perception.

查看更多评论

要查看或添加评论，请登录

查看全部

Mega, Mamba, Liquid...

Ted Shelton

Chief Operating Officer Inflection AI, Inc.

领英推荐

Infinite Future

3,117 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

For a minute, forget AI becoming self-aware or somewhat unruly and look behind the curtain at what is happening with Science.

Accelerating scientific discovery through AI

Here are some technological breakthroughs we will see in 2024

Delinking God: Can technology help answer the ultimate question?

Is it really possible to create something more intelligent than brain?

Digital transformation mindcandy 30 July 2024

Why we need to become cyborgs to master AI

Are we creating a false god?

Weekend Summary: Emerging Trends in AI, Physics, and Science: A Detailed Analysis

Brain-Like Transistor: A Quantum Leap in AI and Machine Learning

领英推荐

Infinite Future

3,117 位关注者

The Turing Dilemma

2024年9月1日

Possible futures

2024年8月18日

Meta + Ray Ban

2024年8月5日

Containment

2024年7月20日

Where it all began

2024年7月11日

Access to the future

2024年7月6日

Transforming business with AI

2024年6月23日

Mustafa Suleyman

2024年6月15日

First look at a new world

2024年5月25日

An Inflection Point

2024年5月20日

社区洞察

其他会员也浏览了

For a minute, forget AI becoming self-aware or somewhat unruly and look behind the curtain at what is happening with Science.

Accelerating scientific discovery through AI

Here are some technological breakthroughs we will see in 2024

Delinking God: Can technology help answer the ultimate question?

Is it really possible to create something more intelligent than brain?

Digital transformation mindcandy 30 July 2024

Why we need to become cyborgs to master AI

Are we creating a false god?

Weekend Summary: Emerging Trends in AI, Physics, and Science: A Detailed Analysis

Brain-Like Transistor: A Quantum Leap in AI and Machine Learning