登录查看更多内容

They(LLMs) are all the same - myth or fact ?

Nikhil Kodilkar

Director Strategic Group ? Lead by example

发布日期: 2024年9月27日

This is the third article in the AI series. And I need to address something that keeps coming up a lot in discussion.

FACT : Performance of all major LLMs are currently converging. In-fact the above image shows us that gap between both closed and open source foundation models are also decreasing.

Myth : With every new model release, we see this gap between the performance of the models decrease. Hence foundational models are going to be commoditized.

The myth stems from the following assumptions :

We are running out of data to train, hence no one has an edge
Data is the only blocker, and we are at the end of what can be squeezed out of the Transformer model

The reality is far from this. To understand it, let's look under the hood of the different forces at work. Three voices which alluded to them are :

Leopold Aschenbrenner(former OpenAI engg.) from his Menifesto. The full 165 pages here.
Eric Shmidt (former Google CEO) - recent talk at Stanford(they took down the video, but parts of it are available back now)
Ilya's view on the Transformer model

Here's the summary:

We are NOT running out of data. See below
The true blockers are: Capital & Power

Data isn't a blocker

Re-training on the same data works wonders on improving model performance (Leopold)
Training on artificial structured data works pretty well. i.e. when annotated data is fed to models, instead of raw data from the internet, the model performance is better & a lot less data is required
Capital required for new models is exponential. 1 Billion & 100 Billion dollar models are seriously being thought of. They are no more thought experiments.

领英推荐

There is No Wall for China

AIM 4 个月前

There is No Wall for China

Bhasker Gupta 4 个月前

Scale AI CEO says China has quickly caught up with the…

CNBC International 2 个月前

Why Power & Capital

The first two reasons above requires an enormous amount of power, think about training GPT-4 10-100 times on a slight variations of reasoned data
Those NVidia clusters are hungry babies !! :)

When you put all the above in to perspective, the emerging picture shows that the picture hasn't played out fully.

And as we get closer to 2029, things are going to:

Divergence in performance
Capital will dictate consolidation of companies
War for AI supremacy will move from Companies to Nation state (currently being played out in the shadows)

#AI #AItrends #AI-for-CIOs

Here are the links to the previous articles in this series:

2nd Article - Fourth Revolution

1st Article - AI : Where are the use cases

Nikhil Kodilkar

Director Strategic Group ? Lead by example

6 个月

Something to confirm this hypothesis: Microsoft recently signed one of the biggest power deals ever. ~$800 million/yr for 20 years $16 BILLION for one nuclear reactor

1 次回应

Tausif Sheikh

Software Engineer and Chat Bot Developer - Full Stack (Node JS, React, React Native, AI, ML)

6 个月

Very informative Thank you Nikhil

1 次回应

查看更多评论

要查看或添加评论，请登录

Nikhil Kodilkar的更多文章

Beyond the Hype: A Candid Evaluation of 8 AI Developer Tools

2025年3月22日

Beyond the Hype: A Candid Evaluation of 8 AI Developer Tools

With so many AI tools flooding the market, I decided to cut through the noise and test them myself—hands-on, no fluff…

3 条评论
Fulfilling moment as a Computer Engineer

2024年10月28日

Fulfilling moment as a Computer Engineer

Vocation : a strong feeling of suitability for a particular career or occupation Ever questioned your career ? Well, at…
LLMs : They aren't intelligent, they just …

2024年10月4日

LLMs : They aren't intelligent, they just …

This is the fourth article in the #AI series. And probably a good time to bust a myth.

1 条评论
Fourth Revolution : Two en route !

2024年8月27日

Fourth Revolution : Two en route !

This is the second article in the continuing series of where we are & where we will be in regards to AI. For this…

2 条评论
AI : Where are the use cases ?!

2024年8月17日

AI : Where are the use cases ?!

I see a lot of people asking this question "These images of Dolphin riding a bicycle are great, BUT is that all AI has…
"Peter Principle" - My notes on the book

2016年3月17日

"Peter Principle" - My notes on the book

Here is the second installment of my effort to take notes on books I'm reading or have read in the past. The book is…

1 条评论
My notes from the #book "Don't make me think"

2015年12月29日

My notes from the #book "Don't make me think"

I like to learn, hence I read a few books every month. As with anyone, I forget things if they aren’t written down.
Did Apple just listen to my idea & patent it - #wishfulThinking :)

2014年12月2日

Did Apple just listen to my idea & patent it - #wishfulThinking :)

Maybe Apple was listening to me when I said iPhones should reorient themselves if they fall. Here is my blog :…

4 条评论
Economic sizes of US cities

2014年11月5日

Economic sizes of US cities

Understanding the 17+ Trillion dollar economy of the USA by its constituents.

1 条评论

See all articles

They(LLMs) are all the same - myth or fact ?

Nikhil Kodilkar

Director Strategic Group ? Lead by example

领英推荐

Nikhil Kodilkar的更多文章

社区洞察

其他会员也浏览了

The Power Puzzle

Of factories and farms

Superintelligence: America Must Lead or Risk It All

Edge AI: The Network may be less important than you think

Grok 3: Elon Musk’s AI Power Play and the Future of Intelligence

?? DeepSeek: Gamechanger Or Dark Side of AI?

DeepSeek’s AI Revolution: A Turning Point in the Race for Artificial Intelligence

AI Journal #1: The Overlooked Cost Dynamics of AI

The Sunday Prompt: Open vs. Closed AI - Who Controls the Future?

Artificial Intelligence Trends Shaping 2025

领英推荐

Nikhil Kodilkar的更多文章

Beyond the Hype: A Candid Evaluation of 8 AI Developer Tools

Fulfilling moment as a Computer Engineer

LLMs : They aren't intelligent, they just …

Fourth Revolution : Two en route !

AI : Where are the use cases ?!

"Peter Principle" - My notes on the book

My notes from the #book "Don't make me think"

Did Apple just listen to my idea & patent it - #wishfulThinking :)

Economic sizes of US cities

社区洞察

其他会员也浏览了

The Power Puzzle

Of factories and farms

Superintelligence: America Must Lead or Risk It All

Edge AI: The Network may be less important than you think

Grok 3: Elon Musk’s AI Power Play and the Future of Intelligence

?? DeepSeek: Gamechanger Or Dark Side of AI?

DeepSeek’s AI Revolution: A Turning Point in the Race for Artificial Intelligence

AI Journal #1: The Overlooked Cost Dynamics of AI

The Sunday Prompt: Open vs. Closed AI - Who Controls the Future?

Artificial Intelligence Trends Shaping 2025