It's been a few days since OpenAI released their new o1 set of models. It's not called 'GPT5', so is it actually a big change? YES!! Here's my quick(ish) take.
Most of the models so far have been using very similar methodologies and training data. If you track the general performance of these models on benchmarks, they are all reaching about the same 'GPT-4' level of capability. Meta's Llama, Anthropics's Claude, Google's Gemini ... all broadly similar.
OpenAI's o1 is something new and the befuddlement around how they've exactly done it means that the architecture and approach is a bit different too.
The way I think about this is related to human 'System 1' and 'System 2' thinking as popularised in the book 'Thinking Fast & Slow'.
Basically, our system 1 thinking uses shortcuts and estimates to be super-fast whereas System 2 is more thoughtful and methodical.
System 1 is great at lots of things - quickly spotting possible danger in the shadows for instance - but it also can make a bunch of errors, and I line this up with the idea of current LLMs suffering from 'hallucinations'.
Here's my favourite example of a System 1 hallucination. Read this puzzle quickly and grab the first answer that's in your head:
"A bat and a ball cost $1.10 in total. The bat costs $1.00 more than the ball. How much does the ball cost?"
Most people immediately say that it's obviously 10c ... but, you're wrong! Our system 1 thinking actually simplifies this question for the sake of speed and gets it wrong. The answer is actually 5c - write it down and you'll see.
So, OpenAI's model is the first AI model I've seen that specifically tackles the idea of System 2 thinking. It doesn't simply spit out the most obvious or probable answer. It thinks, it checks again, and thinks again, until it's pretty sure that it's covered all the bases and hasn't jumped to conclusions.
So, this is actually a very meaningful shift and important step towards more useful and reliable AI.
As always, I'll wrap up with an encouragement to everyone to get across this stuff, to try it out, and to get skilled up. Our world will change dramatically over the next decade or so and it's started already.
All the best.
https://openai.com/o1/
GTM Leadership @ OpenAI | 4x Founding GTM | Advisor
2 个月https://openai.com/o1/