登录查看更多内容

#14 The Future of o(a)1

Roger Lam

Learning the ins and outs of ML/AI and sharing along the way.

发布日期: 2024年9月17日

+ 关注

Few things to write about this week:

OpenAI's o1 model, Terrance Tao on Future Iterations, and ecosystem shifts

Product x Engineering Management and new professional challenges

PyTorch Conf!

OpenAI o1

OpenAI came out with a new line of models - the o1. In their technical breakdown, they call out two methods: chain of thought and reinforcement learning.

Chain of thought is when you prompt the model to "think step by step". We've seen examples of this working with current models that weren't specifically trained to respond in that format. Now they trained specifically to give higher quality answers in that format.

Reinforcement learning is an interesting addition and they didn't give much detail on how they were using it. Reinforcement learning is what gave us AlphaZero - the model that beat world champions at chess and go - and OpenAI's Dota 2 team.

they really just use "reinforcement learning" three times in the technical breakdown

Looking for other's takes, I found James Chiang's post.

His theory is they're doing some sort of tree search in parallel - similar to Microsoft’s rStar method (that I'm learning about for the first time). This is why each response can take up to 2 minutes to return! But I would bet on that also improving over time.

Terence Tao is actually optimistic

If you're not familiar, Terence Tao is a renowned mathematician. A joke I've heard is if you're ever stuck on a research project, get Terry interested and he'll solve it over lunch. Something like that.

He's willing to try new methods like using machine assisted provers.

His latest take is on o1 - and some people have taken some quotes out of context. So I wanted to link it here and talk a little about it.

"mediocre, but not completely incompetent, grad student" ouch

People fixated on the mediocre and incompetent grad student part. Pretty harsh word (which he clarified). The interesting part is that he's extrapolating to "one or two further iterations" until they get to a "competent grad student". It's good and will get better - as a tool.

Can you imaging bringing the brightest minds in every discipline to train the n-th generation of models?

领英推荐

Everything New Coming to ODSC East 2025

Open Data Science Conference (ODSC) 3 个月前

Issue #300 - The ML Engineer ??

Alejandro Saucedo 6 个月前

Python AI and Machine Learning for Production &…

Free Online Courses With Printable Certificates 1 年前

Terry does clarify his words. I love how he expands on the many characteristics that make people successful and impactful.

Insider Look at OpenAI on the Latent Space pod

The Latent Space pod had Michelle Pokrass of OpenAI on talking about the latest 4o changes and gave an insider look at how they take in customer feedback and iterate.

What I took away from the convo was that OpenAI is trying to be ~the~ developer platform for AI. Structure Output was something that you would import a library for like Instructor. Chain of thought you can think of as now being embedded in your model.

And OpenAI's corporate structure is likely changing.

It's a scary time to be a developer tool and a first mover.

How will Meta / Google / Anthropic respond?

This might be the first new big architectural change since mixture of experts. I can see open datasets being even harder to accumulate if more refinement of data is needed for CoT.

Meta wants to be the open AI platform - like they've done with React, PyTorch, Llama, and more.

Google is the sleeping giant in reinforcement learning.

Will Anthropic copy OpenAI's lead and release a similar model to o1?

The Manager's Path vs Founder Mode

I liked how these podcasts and posts came out around the same time.

Camille Fournier was on Lenny's pod talking about the relationship between product and engineering leadership.

Shreyas Doshi was on The Skip pod giving a grounded take on the Founder Mode essay by Paul Graham.

Both great. Having recently stepped into eng management responsibilities, it's making more sense how organizations operate differently depending on scale and scope just as how engineers and systems operate at different scales and scope.

As yet another person who aspires to build their own destiny, not much to say except to keep learning and building.

Hope you're happy, healthy, and hopeful and have a good rest of the week. DM me if you're going to PyTorch conf this week!!!

Roger Lam Weekly

441 位关注者

要查看或添加评论，请登录

Roger Lam的更多文章

#23 Building Scientists and Software Engineers

2025年2月20日

#23 Building Scientists and Software Engineers

A biweekly cadence feels more natural as things are settling around ML / AI. It's still an exciting field but much of…

1 条评论
#22 Mixture of Perspectives on DeepSeek

2025年2月6日

#22 Mixture of Perspectives on DeepSeek

You've likely heard of DeepSeek, the Chinese AI company that has put OpenAI and Nvidia on notice. There was a ton of…
#21 Fire, Birds, and Career

2025年1月19日

#21 Fire, Birds, and Career

Hi everyone - hope all is well. It's been a bit since the last newsletter but in the meantime I posted slide on:…
#20 Engineering Addiction and Class

2024年12月14日

#20 Engineering Addiction and Class

Hey everyone, thanks following along. Hope all is well.
#19 Amazon Nova = Commoditization?

2024年12月7日

#19 Amazon Nova = Commoditization?

Hey! Hope all is well. I'm digging the lull of the holidays.
#18 It's always a good time to build

2024年11月23日

#18 It's always a good time to build

Hey - Hope all is well. First, election fatigue and now recovering from a bad cold.
#17 A Wild AI Appears! (in Japan and Korea)

2024年11月3日

#17 A Wild AI Appears! (in Japan and Korea)

I'm back from vacation and was pretty unplugged with limited data on the e-sim and a whole lot of walking so this is…
All Meta and meta planning

2024年10月11日

All Meta and meta planning

Meta Movie Gen is coming for Sora There are so many releases coming from Meta. Llama 3.
#15 What does PyTorch and the Holy Roman Empire have in common?

2024年9月29日

#15 What does PyTorch and the Holy Roman Empire have in common?

I had the chance to go to PyTorch Conf in SF a week ago and really appreciated the level of depth. Lots of talks and…

1 条评论
#13 Coding with Claude and having fun

2024年9月10日

#13 Coding with Claude and having fun

Been a busy couple weeks but happy to get back into it. Coding with Claude I've been toying with an idea of making…

See all articles

#14 The Future of o(a)1

Roger Lam

Learning the ins and outs of ML/AI and sharing along the way.

OpenAI o1

Terence Tao is actually optimistic

领英推荐

Insider Look at OpenAI on the Latent Space pod

How will Meta / Google / Anthropic respond?

The Manager's Path vs Founder Mode

Roger Lam Weekly

441 位关注者

Roger Lam的更多文章

社区洞察

其他会员也浏览了

Artificial Intelligence #34 - Foundations of Coding for artificial intelligence - part two

Featuring: Poornaditya Mishra

Artificial Intelligence #91: How could domain experts learn Artificial Intelligence? Bias Variance tradeoff as a pedagogy

The Ultimate Guide to Becoming an AI Developer: Courses You Should Take

Some on AI in Education

Model Debugging: Sensitivity Analysis, Adversarial Training, Residual Analysis

Common AI Prompt Engineering Interview Question 10: What programming languages and libraries do you use for AI development?

I Implemented GPT-2 (124M) Base-model From Scratch Using PyTorch and trained it: Here's The summary of the whole process.

What is Transfer Learning?

OpenAI o1

Terence Tao is actually optimistic

领英推荐

Insider Look at OpenAI on the Latent Space pod

How will Meta / Google / Anthropic respond?

The Manager's Path vs Founder Mode

Roger Lam Weekly

441 位关注者

Roger Lam的更多文章

#23 Building Scientists and Software Engineers

#22 Mixture of Perspectives on DeepSeek

#21 Fire, Birds, and Career

#20 Engineering Addiction and Class

#19 Amazon Nova = Commoditization?

#18 It's always a good time to build

#17 A Wild AI Appears! (in Japan and Korea)

All Meta and meta planning

#15 What does PyTorch and the Holy Roman Empire have in common?

#13 Coding with Claude and having fun

社区洞察

其他会员也浏览了

Artificial Intelligence #34 - Foundations of Coding for artificial intelligence - part two

Featuring: Poornaditya Mishra

Artificial Intelligence #91: How could domain experts learn Artificial Intelligence? Bias Variance tradeoff as a pedagogy

The Ultimate Guide to Becoming an AI Developer: Courses You Should Take

Some on AI in Education

Model Debugging: Sensitivity Analysis, Adversarial Training, Residual Analysis

Common AI Prompt Engineering Interview Question 10: What programming languages and libraries do you use for AI development?

I Implemented GPT-2 (124M) Base-model From Scratch Using PyTorch and trained it: Here's The summary of the whole process.

What is Transfer Learning?