Transcript
Podcast Host (0:05)
Hi listeners and welcome back to no Priors. Today I'm speaking with Brandon McKenzie and Eric Mitchell, two of the minds behind OpenAI's O3 model. O3 is the latest in the line of reasoning models from OpenAI. Super powerful with the ability to figure out what tools to use and then use them across multi step tasks. We'll talk about how it was made, what's next, and how to reason about reasoning. Brandon and Eric, welcome to no Priors.
Brandon McKenzie (0:27)
Thanks for having us.
Eric Mitchell (0:28)
Yeah, thanks for having us.
Interviewer/Co-host (0:29)
Do you mind walking us through O3? What's different about it? What it was in terms of a breakthrough in terms of a focus on reasoning and you're adding memory and other things versus this, a core foundation model at LLM and what that is.
Eric Mitchell (0:41)
So O3 is our most recent model in this O series line of models that are focused on thinking carefully before they respond. And these models are in sort of some vaguely general sense smarter than models that don't think before they respond. Similarly to humans, it's easier to be more accurate if you think before you respond. I think the thing that is really exciting about O3 is that not only is it just smarter if you make an apples to apples comparison to our previous O series models, you know, it's just better at like giving you correct answers of math problems or factual questions about the world or whatever. This is true and it's great. And we, you know, we'll continue to train models that are smarter, but it's also very cool because it uses a lot of tools that, that, that enhance its ability to do things that are useful for you. So yeah, like you can train a model that's really smart, but like if it can't browse the web and get up to date information, the there's just a limitation on how much useful stuff that model can do for you if the model can't actually write and execute code. There's just a limitation to the sorts of things that an LLM can do efficiently, whereas a relatively simple Python program can solve a particular problem very easily. So not only is the model, it's on its own smarter than our previous O series models, which is great, but it's also able to use all these tools that further enhance its abilities. And whether that's doing research on something where you want up to date information, or you want the model to do some data analysis for you, or you want the model to be able to do the data analysis and then kind of review the results and adjust course as it sees fit instead of you having to be so sort of prescriptive about like each step along the way the model is sort of able to take these like high level requests like do some due diligence on this company and you know, maybe run some reasonable like forecasting models on so and so thing and then you know, write a summary for me. Like the model will kind of like infer a reasonable set of actions to do on its own. So it gives you kind of like a higher level interface to doing some of these more complicated tasks.
