登录查看更多内容

OpenAI's secret: Q*

Azeem Azhar

Making sense of the Exponential Age

发布日期: 2023年11月27日

This was originally published in the Sunday edition of my newsletter yesterday. You can now become a member for the next year with full access to research, essays, and conversations that will deepen your understanding of AI with a one-time discount for your first year. The offer ends in just a few hours. Upgrade to Exponential View Premium with 33% off

Just when you think the OpenAI drama is over, it bursts open again. Sam Altman is back as CEO, and now there’s talk that his firing was due to a secret project, Q*. According to Reuters:?

Several staff researchers wrote a letter to the board of directors warning of a powerful artificial intelligence discovery that they said could threaten humanity.

Reportedly, this secret AI can do grade-school maths on problems it hasn’t seen before. This has previously been a challenge for LLMs like GPT-4. While ‘grade-school maths’ doesn’t seem like much, with exponential progress of the type we’ve witnessed, grade-school maths could quickly become PhD maths. (The Information has more details on what this breakthrough could be. Others speculate it is about a different approach to reinforcement learning that can operate “model-free”, in complex or highly evolving environments. Yann LeCun suggests it could be about planning. But equally, this could be no more than breathless hype, diverting attention from OpenAI’s recent troubles.)

Let’s imagine that this “grade school maths” thing is real. So far, LLMs have built capabilities quickly. A recent study by Anthropic, Cohere, and NYU researchers pitted humans, GPT-4 and GPT-3.5 against a range of difficult graduate science problems. Highly skilled non-experts, pursuing PhDs in unrelated disciplines, were given access to Google to help them. They got 34% of questions right, better than GPT-3.5 but worse than GPT-4. PhD students in their own domains, naturally, did the best, scoring 65%. So, today’s state-of-the-art token predictor is nowhere near expert (PhD-level) performance, but it is better than your smarter-than-average-bear with a search engine–and will likely get better.

领英推荐

#50 Why Do Neural Networks Hallucinate?

Towards AI 3 个月前

Unpacking the hype around OpenAI’s rumored new model

MIT Technology Review 1 年前

The Art and Science of Algorithm Selection in Machine…

Bryce Undy 1 年前

Perhaps, Altman had been giving hints. One day before his ouster, he said:

Four times now in the history of OpenAI, the most recent time was just in the last couple weeks, I’ve gotten to be in the room, when we sort of push the veil of ignorance back and the frontier of discovery forward, and getting to do that is the professional honour of a lifetime

This week won’t be the end of the drama. To help contextualise things and what might come next, I spoke with Karen Hao, a contributing writer at The Atlantic, who has been researching the company closely.

Exponential View on LinkedIn

194,661 位关注者

Alexander Peschkoff

Founder & CEO - making it happen.

1 年

It's not hard to fool AI with a simple twist to a standard task.

Michael Spencer

A.I. Writer, researcher and curator - full-time Newsletter publication manager.

1 年

What's with the Verge saying it never happened? That they received no such letter. Sounds like a PR leak to me. Not enough clarity to even cover it, this company is clearly very manipulative with the sort of news they release.

1 次回应

José María Puerta González

Senior Corporate Brand & Communications Manager @ ávoris ???? ?? ????

1 年

I'm terribly sorry, but I can't help it, if they call it Q*, it doesn't start well.

3 次回应

Marc Cavazza

1 年

There could be a confusion around 'planning' in relation with problem solving. Not least because most of the literature on LLM and Planning is not very positive towards LLM, see the latest summary from Kambhapati.

1 次回应

查看更多评论

要查看或添加评论，请登录

Azeem Azhar的更多文章

?? What's the deal with Manus AI?

2025年3月12日

?? What's the deal with Manus AI?

Six things you need to know to understand the hype The online discourse around Manus AI typically falls into three…

8 条评论
AI’s productivity paradox

2025年2月28日

AI’s productivity paradox

I want to play a game of counterfactuals..

10 条评论
Why the AI surge isn't like 1999

2025年2月10日

Why the AI surge isn't like 1999

Economist Paul Krugman sees parallels between the late-90s tech bubble and today’s AI frenzy. In my conversation with…

4 条评论
What OpenAI’s Deep research means for search

2025年2月6日

What OpenAI’s Deep research means for search

Originally published in Exponential View on 4 February OpenAI released yet another add-on on to its growing suite of AI…

4 条评论
??DeepSeek: everything you need to know right now.

2025年1月27日

??DeepSeek: everything you need to know right now.

My WhatsApp exploded over the weekend as we received an early Chinese New Year surprise from DeepSeek. The Chinese AI…

38 条评论
?? Stargate & DeepSeek R-1 – What matters

2025年1月27日

?? Stargate & DeepSeek R-1 – What matters

In the past week, a lot was written about the US government’s “Stargate” partnership with OpenAI AND DeepSeek R-1…

12 条评论
Davos Daily, Day 1

2025年1月21日

Davos Daily, Day 1

The energy here is different this year, so I’ll share my daily takes from the Forum to help you understand what it’s…

6 条评论
?? Join me live on AI, deep tech & geopolitics

2025年1月17日

?? Join me live on AI, deep tech & geopolitics

Hi all, I am going live in two hours from DLD — one of Europe’s most important annual events focused on the…

5 条评论
Five contrarian ideas about genAI in the workplace

2025年1月9日

Five contrarian ideas about genAI in the workplace

ChatGPT alone sees over 300 million weekly users—roughly 7% of all mobile phone owners worldwide. Nearly a third of…

13 条评论
?? AGI in 2025?

2025年1月7日

?? AGI in 2025?

We can't ignore Sam's bet that..

11 条评论

See all articles

OpenAI's secret: Q*

Azeem Azhar

Making sense of the Exponential Age

领英推荐

Exponential View on LinkedIn

194,661 位关注者

Azeem Azhar的更多文章

社区洞察

其他会员也浏览了

Why LLMs Hallucinate; GraphGPT; Inside Microsoft’s small LLM; Deploy Tiny Llama on AWS EC2; Fine-Tune LLM using PyTorch; and More

How OpenAI o1 Could Change the Future of Problem-Solving?

2023 Advanced Machine Learning and Deep Learning Projects

OpenAI update: Strawberry is live, how to prompt it, the subscription fee, and the hunt for cash

OpenAI Is An App Company Now

Artificial Intelligence #5 : A taxonomy of machine learning and deep learning algorithms

The Top 4 Reasons to Learn PyTorch (and start getting into AI)

Tensorflow

6 Free Open-Source Replacements for OpenAI’s Deep Research AI

Issue #194 - THE ML ENGINEER ??

领英推荐

Exponential View on LinkedIn

194,661 位关注者

Azeem Azhar的更多文章

?? What's the deal with Manus AI?

AI’s productivity paradox

Why the AI surge isn't like 1999

What OpenAI’s Deep research means for search

??DeepSeek: everything you need to know right now.

?? Stargate & DeepSeek R-1 – What matters

Davos Daily, Day 1

?? Join me live on AI, deep tech & geopolitics

Five contrarian ideas about genAI in the workplace

?? AGI in 2025?

社区洞察

其他会员也浏览了

Why LLMs Hallucinate; GraphGPT; Inside Microsoft’s small LLM; Deploy Tiny Llama on AWS EC2; Fine-Tune LLM using PyTorch; and More

How OpenAI o1 Could Change the Future of Problem-Solving?

2023 Advanced Machine Learning and Deep Learning Projects

OpenAI update: Strawberry is live, how to prompt it, the subscription fee, and the hunt for cash

OpenAI Is An App Company Now

Artificial Intelligence #5 : A taxonomy of machine learning and deep learning algorithms

The Top 4 Reasons to Learn PyTorch (and start getting into AI)

Tensorflow

6 Free Open-Source Replacements for OpenAI’s Deep Research AI

Issue #194 - THE ML ENGINEER ??