OpenAI's secret: Q*

OpenAI's secret: Q*

This was originally published in the Sunday edition of my newsletter yesterday. You can now become a member for the next year with full access to research, essays, and conversations that will deepen your understanding of AI with a one-time discount for your first year. The offer ends in just a few hours. Upgrade to Exponential View Premium with 33% off


Just when you think the OpenAI drama is over, it bursts open again. Sam Altman is back as CEO, and now there’s talk that his firing was due to a secret project, Q*. According to Reuters:?

Several staff researchers wrote a letter to the board of directors warning of a powerful artificial intelligence discovery that they said could threaten humanity.

Reportedly, this secret AI can do grade-school maths on problems it hasn’t seen before. This has previously been a challenge for LLMs like GPT-4. While ‘grade-school maths’ doesn’t seem like much, with exponential progress of the type we’ve witnessed, grade-school maths could quickly become PhD maths. (The Information has more details on what this breakthrough could be. Others speculate it is about a different approach to reinforcement learning that can operate “model-free”, in complex or highly evolving environments. Yann LeCun suggests it could be about planning. But equally, this could be no more than breathless hype, diverting attention from OpenAI’s recent troubles.)

Let’s imagine that this “grade school maths” thing is real. So far, LLMs have built capabilities quickly. A recent study by Anthropic, Cohere, and NYU researchers pitted humans, GPT-4 and GPT-3.5 against a range of difficult graduate science problems. Highly skilled non-experts, pursuing PhDs in unrelated disciplines, were given access to Google to help them. They got 34% of questions right, better than GPT-3.5 but worse than GPT-4. PhD students in their own domains, naturally, did the best, scoring 65%. So, today’s state-of-the-art token predictor is nowhere near expert (PhD-level) performance, but it is better than your smarter-than-average-bear with a search engine–and will likely get better.

Perhaps, Altman had been giving hints. One day before his ouster, he said:

Four times now in the history of OpenAI, the most recent time was just in the last couple weeks, I’ve gotten to be in the room, when we sort of push the veil of ignorance back and the frontier of discovery forward, and getting to do that is the professional honour of a lifetime

This week won’t be the end of the drama. To help contextualise things and what might come next, I spoke with Karen Hao, a contributing writer at The Atlantic, who has been researching the company closely.




Alexander Peschkoff

Founder & CEO - making it happen.

1 年

It's not hard to fool AI with a simple twist to a standard task.

回复
Michael Spencer

A.I. Writer, researcher and curator - full-time Newsletter publication manager.

1 年

What's with the Verge saying it never happened? That they received no such letter. Sounds like a PR leak to me. Not enough clarity to even cover it, this company is clearly very manipulative with the sort of news they release.

José María Puerta González

Senior Corporate Brand & Communications Manager @ ávoris ???? ?? ????

1 年

I'm terribly sorry, but I can't help it, if they call it Q*, it doesn't start well.

  • 该图片无替代文字

There could be a confusion around 'planning' in relation with problem solving. Not least because most of the literature on LLM and Planning is not very positive towards LLM, see the latest summary from Kambhapati.

要查看或添加评论,请登录

Azeem Azhar的更多文章

  • ?? What's the deal with Manus AI?

    ?? What's the deal with Manus AI?

    Six things you need to know to understand the hype The online discourse around Manus AI typically falls into three…

    8 条评论
  • AI’s productivity paradox

    AI’s productivity paradox

    I want to play a game of counterfactuals..

    10 条评论
  • Why the AI surge isn't like 1999

    Why the AI surge isn't like 1999

    Economist Paul Krugman sees parallels between the late-90s tech bubble and today’s AI frenzy. In my conversation with…

    4 条评论
  • What OpenAI’s Deep research means for search

    What OpenAI’s Deep research means for search

    Originally published in Exponential View on 4 February OpenAI released yet another add-on on to its growing suite of AI…

    4 条评论
  • ??DeepSeek: everything you need to know right now.

    ??DeepSeek: everything you need to know right now.

    My WhatsApp exploded over the weekend as we received an early Chinese New Year surprise from DeepSeek. The Chinese AI…

    38 条评论
  • ?? Stargate & DeepSeek R-1 – What matters

    ?? Stargate & DeepSeek R-1 – What matters

    In the past week, a lot was written about the US government’s “Stargate” partnership with OpenAI AND DeepSeek R-1…

    12 条评论
  • Davos Daily, Day 1

    Davos Daily, Day 1

    The energy here is different this year, so I’ll share my daily takes from the Forum to help you understand what it’s…

    6 条评论
  • ?? Join me live on AI, deep tech & geopolitics

    ?? Join me live on AI, deep tech & geopolitics

    Hi all, I am going live in two hours from DLD — one of Europe’s most important annual events focused on the…

    5 条评论
  • Five contrarian ideas about genAI in the workplace

    Five contrarian ideas about genAI in the workplace

    ChatGPT alone sees over 300 million weekly users—roughly 7% of all mobile phone owners worldwide. Nearly a third of…

    13 条评论
  • ?? AGI in 2025?

    ?? AGI in 2025?

    We can't ignore Sam's bet that..

    11 条评论

社区洞察

其他会员也浏览了