
Loading summary
Big Technology Podcast Host
GPT5 is here and OpenAI COO Brad Lightkap is with us to break down the new model's capabilities, what it means for the AI business, and what's next for this promising technology. Brad, it's so great to see you. Thank you for joining us on an emergency episode of Big Technology Podcast.
Brad Lightkap
My pleasure, thanks for having me.
Big Technology Podcast Host
All right, so briefly, I just want you to talk a little bit about what GPT5 is. So maybe within like 60 seconds or so, can you talk about what it is and how it improves on previous OpenAI models?
Brad Lightkap
Yeah. So GPT5 is our next generation flagship model. It does something really interesting, which is it actually combines into one model the ability to dynamically choose whether to think hard about a problem and reason about it to give you an answer or not. And so you'll remember previously you had to go deal with the model picker in ChatGPT, everyone's favorite thing. You had to select a model that you wanted to use for a given task and then you'd run the process, asking a question, getting an answer. Sometimes you choose a thinking model, sometimes you wouldn't. And that was, I think, a confusing experience for users. GPT5 Abstracts all of that, so it makes that decision for you and it's actually a smarter model. So you're going to get a better answer in all cases, regardless of whether you're using the thinking mode or not. And it's vastly improved on things like writing, coding, health. It's much more accurate, is much faster, and so all around we think a better experience.
Big Technology Podcast Host
And now for those of us who've been following the hype, I think we probably imagine you would lead with this is an explosive increase in intelligence versus there's a switcher on the model that will go to reasoning or non reasoning when it makes the most sense. So can you explain what's the disconnect there? And why lead with the usability versus the intelligence increase?
Brad Lightkap
Yeah, because intelligence really is a function of how much time the model is going to be thinking. And so depending on how much you want to allocate thinking time to a problem, you're going to get a better answer. Typically, the longer it thinks, the better an answer it can give you. So when we test the model on certain benchmarks and evals and we allow it to think, it will dramatically outperform any of our existing models by far. Even though if you don't allow any thinking time, you still get a typically net better answer than you would for one of our non thinking models, like GPT 4.1. So it is a dramatic improvement in intelligence. It should be, I think, a better quality model across pretty much all dimensions. But that reasoning time and being able to use the reasoning time dynamically to think we think actually is the important part. It makes it for a much better user experience.
Big Technology Podcast Host
Now, I'm going to parse your words a little bit. You said that it is a dramatic improvement over previous models. Sam in a press call said that GPT5 is a pretty significant step over 4.0. Simon Wilson, who's been using your model for a little bit, says it doesn't feel like a dramatic leap ahead from what other LLMs, from other LLMs, but it exudes competence, it rarely messes up and frequently impresses me. I'm just setting this up because I'm curious whether we could say or whether you would say that this model is an exponential increase in capabilities or an incremental increase in capabilities.
Brad Lightkap
You know, it's hard to measure it that way. I think we're now kind of into this regime of having to measure intelligence across a lot of different dimensions, which isn't a way to dodge the question so much as it is to explain why GPT5 is such a special model. And so obviously it's better at the core things that you'd expect it to be better at. It scores better on things like Sweebench. It scores better on all the kind of academic evals that we put it through. This one in particular, we actually made a real emphasis to have it score better on certain health benchmarks. So it's better at medical reasoning and other health related things. But there's a lot of things that go into what makes a model good now, because you have a lot of dimensions to play with depending on kind of how that model's trained and how it can think about problems. So if it's faster, for example, we think that's actually indicative of it being better. If it can give you a better answer per unit of time, thinking, we think that's an improvement. That is an important vector to measure. Also, if it can do things like structured thinking, problem solving, tool use, all these things are things we actually measure and they're kind of invisible to users. You know, if you're just using ChatGPT, you don't necessarily appreciate each of these things happening under the hood. But all those things are better for GPT5 than they were for our previous models. Right.
Big Technology Podcast Host
And the reason why I'm asking is because I think a lot of people have pointed to the leaps from GPT original GPT to GPT 2 G GPT 2 to GPT 3, GPT 3 to GPT 4. And one of the things people have seen is just a general increase in capabilities across the board. There were no caveats of like, and maybe there's a reason for those caveats, but there were no caveats of, you know, there's intelligence increases in this place and that place it was, we trained a bigger model. I'm pretty sure this is what it was and it's better across the board. So have things changed?
Brad Lightkap
They've changed, yeah. From a technical perspective, I think when you go from GPT 2 to GPT 3, 3 to 4, these were really just exploits of what was and is the scaling paradigm of training larger pre training, bigger and bigger models, training larger models. It's kind of one vector of training and you get a better model as a result. And that continues to hold true. But we now have this kind of other category of training which is post training and being able to use test time, compute in more interesting ways than we used to is almost kind of a second stage of training. And so we think that that actually gives us a little bit of a boost, a force multiplier on our ability to push the model toward new intelligence levels and also be able to train into it a lot of the things that you want an intelligent model to be able to do. So using tools, for example, is something that rethink is really important for overall intelligence. GPT2 and 3 couldn't really do that as well. GPT4 could do it in a more nascent way. And now GPT5 you get that baked in with the benefit of these kind of multi step and longer horizon reasoning processes. So yeah, we want to abstract that from users. Obviously we don't think that you as a ChatGPT user should have to stop and think about that. And in some sense I think the model picker being a point of frustration for people was an expression of the fact that people don't necessarily want to have to make those decisions. Every time they talk to an AI model, they kind of want the model to make those decisions for them. And so that's why we think GPT5 is a big step.
Big Technology Podcast Host
And going back to that, increasing pre training, increasing the scale of pre training, delivering predictable improvements in model performance. Yes, now post training is in the picture, it's making models better in really impressive ways. But are you of the belief, and is OpenAI of the belief now that there are diminishing returns from pre training Given that we're now talking about different forms of training these models, not at all.
Brad Lightkap
Our scaling laws still hold empirically. There's no reason to believe that there's any kind of diminishing return on pre training and on post training. We're really just starting to scratch the surface of that new paradigm. The O series of models, which were kind of the previous reasoning models, were really just the beginning of us starting to explore what's possible in that post training regime. And I think that's going to be kind of the dominant theme here for the next year or two, is continuing to scale in that dimension and continuing to see the gains that you get there simply because they're so significant. And so now we're pushing on two axes for how to improve models, and we think that's going to tighten and condense the rate of innovation.
Big Technology Podcast Host
OpenAI believe that the vast majority of improvements from here are going to be coming from scaling or from algorithms.
Brad Lightkap
I think it'll be a combination. It's always a combination, right? It's always algorithms, scale, compute and data. Right. And so we push on all three. And they all play a really important role, I think, in how we look at the future. And then the hard part, obviously is having them come together. So being able to train larger models requires typically that you want to train on more data, obviously with more compute. And so that's a delicate balance between those things because just scaling up doesn't necessarily mean in all cases that you're going to get kind of the same corresponding rate of improvement. You have to be able to bring those other pieces also. So it's not like we push one button or the other. We actually make a really conscientious effort to try and kind of pull all of those together.
Big Technology Podcast Host
Okay. And you're not calling it AGI. And I have to say I've lost a bet on this show because. Because I was listening to Sam on the Theo Von Show. He says, he said GPT5 is smarter than us in almost every way. And I said, all right, well, that sounds like what you would imagine AGI would be. And then, you know, GPT5 comes out yesterday or as the release happens, Sam says, I kind of hate the term AGI because everyone at this point uses it to mean a slightly different thing. But this is clearly a model that is generally intelligent. Help me understand what's going on, because it seems like maybe he wants to call it AGI, but you're not yet. So why is this not AGI?
Brad Lightkap
Well, it is a hard thing to define the joke here is you ask five people what AGI is, you'll get seven answers. And I think the way we kind of look at it is it's a cumulative process, right? It's a system. And I think you have to define kind of what is it that that system is and what do you expect it to be able to do? And for me at least, that's a system that is reliably able to learn new things that are kind of out of distribution by virtue of its ability to reason, to think, to solve problems, to use tools, to come up with new ideas. And so do I think we're at a system that I would call AGI? No, but I think we start to see the traces and the pieces of that overall system for generalized learning start to come together in models like GPT5 and I suspect in its successors. I don't know if we'll have a point where we are like, okay, we've crossed from a non AGI world into an AGI world. And even if there were, I'm not sure we'd actually realize it necessarily until after the fact, because one of the things we've learned working with the models that we have is the capability overhang is significant. I think when Sam refers to the intelligence of the models and having a PhD in your pocket, we haven't yet really exploited that as a thing in some sense. I think you could pause AI progress right here for 10 years and you'd still have about a decade worth of new products to get built, of new ways that people figure out how to use the models, even at a GPT5 level model, in interesting products and interesting processes. And one of the kind of interesting things is I think as the models get smarter, they almost demand more from a product building perspective in terms of how you actually plug them into the system. I always kind of roughly analogize it to like you could have a really, really smart intern, and at the end of the day they're only capable of doing a few things for you. They can take notes in meetings, they can write summaries, they can pull basic analyses together. But if you bring a PhD to work, that person has a tremendous capability set that they may not be totally effective on the job on day one, but your job is to really figure out how to expose them to enough context, enough information, give them the right tools to make them really effective later on. And that process actually takes longer to get them to their full effectiveness than it would an intern. And I think it's going to be similar with AI models. And so it is a continuous process and I don't think it will be linear. But where we are today, I would say we're probably not quite yet at something I would call an AGI level system.
Big Technology Podcast Host
Yeah. And it brings up such an interesting question, which is, does it really make sense to try to make the models smarter from here, or is it about trying to build those ancillary capabilities? I think Sam mentioned this on the media call, but GPT3, he said, was high school level intelligence, GPT4, maybe the level of a college student, and GPT5 an expert. So I guess, I wonder, for OpenAI, is the quest to add more intelligence to the mix or is it to focus on capabilities other than smarts? Some of the things that you mentioned, like memory and continual learning, it's going.
Brad Lightkap
To be, I think all those things. Certainly there are some unsolved problems. You mentioned a few here, and I would agree with those that, you know, you'd expect a really smart person to, you know, it kind of comes by default that our models still struggle with. And so there's open research there that we still have to do, I think, to be able to kind of close the loop on what I would call the full spectrum of intelligence. But intelligence, like we were talking about earlier in the podcast, expresses in a lot of different ways. And part of it is just your pure iq. It's your knowledge of how things work and your ability to recall information. But then it's also your ability to reason about how to use other tools to solve problems. It's your ability to be reflective and to look back on your own chain of thought, your own line of thinking, and actually course correct when you feel like, you know, I actually went down the wrong path and maybe I didn't come up with the right strategy to solve this problem. And so that's one of the cool things we see is GPT5 on those vectors we can actually reliably measure as better than the previous systems we had. And for us, I think one of the real world things that we really want to understand is how do they actually perform in the real world? How do developers use these models? How do enterprises use these models to actually apply them to existing problems, real world problems, and see if the next models kind of do better than the last models. And so that's for us, I think the real world benchmark is increasingly becoming important as a sign of intelligence relative to the academic benchmarks and how big.
Big Technology Podcast Host
Of a priority is continual learning within.
Brad Lightkap
OpenAI, we have a lot of Priorities, I think certainly that's among them. But we feel really good on our research.
Big Technology Podcast Host
Middle, low priority.
Brad Lightkap
It's hard to, you know, the cool thing about OpenAI is the way that we kind of, you know, has, I think, have like systematized being able to do research. And this has really been true from the early days of the company I joined OpenAI in 2018 is we, we take this kind of highly exploratory approach to research. And so we're very much not tops down, I think, in how we, how we approach research where there's one idea and everyone kind of just gloms on to that one idea and we kind of do one thing at a time. What we really do is a lot of open ended exploration in small teams. We explore different paths and see if those lead to new ideas that we then kind of cycle back into the kind of core idea, the main line of ideas, if they work. And if they don't, we kind of, we recombine those teams into other ideas that seem to be working and then allow other, you know, new ideas to offshoot from there. And so it really is kind of feeling around in the dark a little bit. And when you find that kind of patch of grass that you're like, okay, we might be on the right path here, you kind of bring everyone to that point and then kind of let everyone feel around a little more. And I think that's kind of how it has to work. I think it's really hard a priori to know these things in advance. I think you can have intuition and I think our researchers tend to have kind of better intuition than the average, but it really is still scientific exploration.
Big Technology Podcast Host
Now I want to talk about whether how your plus subscribers or how the people who are using these chatbots will feel, using ChatGPT will feel the improvements. You know, there's an interesting comment from Ethan Moloch, the Wharton professor who is also experimenting with GPT5. He says, I think it's a big step forward, but not an unexpected one if you've been following the curve. He says, these models got gold at the Math Olympiad this week. I'm losing track of what massive advances mean. All the models are improving very quickly right now. Their question is if you have a model that's capable of graduate level or college level biology, and then it goes to graduate level biology. The average chatbot user may not feel that even though it's, even though it's gotten much smarter. So I guess I'm curious how you think this will be reflected the Increased smarts will be reflected in the average user's ChatGPT experience and the plus users experience who've been using these reasoning models for a while. Is it going to feel any different for them?
Brad Lightkap
Yeah, I saw something on On X that was akin to what you're describing, which someone basically kind of said. I think for the, you know, upper echelon of, of ChatGPT users who are probably in the paid tiers, who are, you know, active on a daily basis and are really kind of expert level using these systems, it's going to feel like an improvement, but maybe a more subtle improvement. But for the average user, for the free user, and we're bringing GPT5 to our free tier, it will feel like a dramatic increase. If you actually look at kind of the way free users have used ChatGPT, most of them have actually not experienced the power of the reasoning models. They mostly are using GPT4O and they mostly are kind of using it for this very kind of turn based, kind of very quick back and forth almost search like ways that I think don't actually kind of express the full capability of the model. And so for a lot of people this will be the first time using a model that has reasoning capability. And not only will it be the first time using it with reasoning, but it'll be the first time that they're experiencing a model making a decision about how long to think about a problem and how good of an answer to give relative to how hard the question is. And so we expect that for the average user it will feel dramatically different. Maybe for the kind of upper echelon of power user it may not feel as different. So I would agree with that. And I think that's a natural thing. I think that's actually a good thing that if you've been following the kind of rate of AI progress and you're kind of exploiting the frontier at every point, yes, it probably is dizzying, but it starts to feel more continuous than if you've kind of, you know, you're using what is basically kind of the, the best model from a year or two ago.
Big Technology Podcast Host
Right. I think you're so spot on about the average user is using it as like a search version of search. And they're like, well, what should I use? When they speak to me, they're like, what should I use AI for? I'm like just upload stuff and start talking to it about the things you upload. And I had a friend who like was uploading pictures of his son's football practice and asking it for tips about like, for coaching tips. And he was like fairly blown away that this thing is giving some like real analysis of positioning. I mean, I wouldn't use it as a football coach, but I do think that as the average user gets into these capabilities, it's going to be fairly mind blowing.
Brad Lightkap
Yeah, it's, you know, there's, everyone's got a little bit of a different entry point and that's the cool thing about it is like it's really personal for everybody. You know, we focused on health a lot with this release because that was one of the consistently common things that we heard from people as a starting point for how they've used powerful AI was when they're navigating a health journey. And so we really wanted to make an effort on making sure that if people are going to be using AI systems for health related things that we could serve them the best possible model. And so that was a big push for training GPT5.
Big Technology Podcast Host
Yeah, you brought up health a couple times. Do you want this to replace a gp? I mean, a lot of people are really underserved with healthcare, but I kind of worry about handing them a model that can hallucinate and saying this is the substitute.
Brad Lightkap
Now I don't think it'll replace gps, but what I think it helps people do is have more agency in their journey, a little bit more control over their, you know, the process of managing care. It gives people also just an awareness of the conditions. So, you know, we hear stories all the time of people managing conditions that, you know, they didn't really understand because no one actually took the time to explain it to them. And that's not because anyone did anything wrong. It's just because the health system, healthcare system, as it's designed, doesn't allow for there to be time to allow people to understand what it is that they're managing. And so even just giving people that baseline of education of like, you know, this is the condition you're managing, it's this common, it's going to express in this ways you're going to feel these types of symptoms. That's a huge unlock just in people's psychology for what it means to be managing a disease. And I think you still have to kind of work with a GP for care or a specialist for care. But having something that can kind of handhold you through that journey, I think for a lot of people is really comforting and in a lot of cases has actually proven to be helpful. Obviously we want to make sure that model is as accurate as possible. So being able to kind of push the model capability in that domain specifically has been a big area of focus. But we think now with GPT5 and obviously with future models, we've seen consistently the rates of accuracy and the rates of hallucination go up and down respectively. GPT5, I think, depends on how you measure it, but it's four to five times more accurate than its predecessors. And so, and that may be more accentuated in health, I don't know off the top of my head. So we have a lot of control, I think, and are pushing in the right direction on being able to make them reliable and accurate.
Big Technology Podcast Host
It's pretty interesting. We're talking about things so far beyond the chatbot. Like of course there's the chat function, but there's coding, there's health, and then of course there's enterprise or the way that businesses use these models. And businesses are notoriously slow, slow at implementing this technology and I'm sure there's so many approvals and reviews and it's tough to get things out the door. But I do think that when you have better models, this is sort of my belief, when you have better models, you sort of are able to push that forward much faster and much more effectively. So talk a little bit about what a better model in GPT5 will enable on the enterprise front or business front.
Brad Lightkap
Yeah, no, I would agree with your assessment there. I think in many ways I always kind of say we haven't yet seen the ChatGPT moment. I think in business for AI, I think AI was an amazing tool for consumers where your search space, so to speak, is more narrow and you've got a more constrained problem, you've got obviously a much more narrow context that you're processing. And I think you can kind of take things turn by turn with very few kind of external dependencies and you really just kind of let the model's pure intelligence shine. Businesses are a different category of difficulty. So you've got complex business processes, you've got a lot of multi user dependencies, you've got a lot of context that you have to process, you've got a lot of tools that have to be brought to bear. Those tools have to be used in succession in certain ways with certain guardrails and there's not as much fault tolerance for when they don't work. And so kind of goes back to what we were talking about earlier. I think you look at models like GPT5 and the impact that they're going to have in business, it is that baseline of capability that's moved up. It's their ability to use tools, to think in a structured way, to solve problems, to kind of recursively correct their own mistakes, to do long context retrieval, things like that, that actually these little things do matter on the edge. And you don't feel them every day in ChatGPT as an individual user, but you will start to feel them as a developer or an enterprise. And so we see this anecdotally too. I mean, we've worked with large enterprises and small startups and the entire spectrum in between on testing these models and GPT5 specifically before release. And we get a lot of feedback from companies like Uber and Amgen and Harvey and Cursor, lovable Jetbrains. I mean, all companies that have use cases that are highly, highly sensitive to the model's ability to reliably call tools, to deal with long context, to problem solve and reason effectively. And so it's a rising tide, I think, across the enterprise. And it's just really going to be on the developers we work with to, to be able to kind of, you know, understand the difference and the improvement and then implement them in the applications that they're building.
Big Technology Podcast Host
Yeah, it is interesting to know that you've been, you have been already working with many companies and letting them use GPT5 already. So has there been a sort of unified. We couldn't do this with the previous models, but we can do it now with GPT5, or is it sort of spread out in terms of the capabilities that it's now enabling?
Brad Lightkap
I would say it's been, you know, rising tide across the board. So everyone who's kind of benchmarking and all the companies that we work with typically now are pretty accustomed to evaluating and benchmarking performance across all the models that they use. But everyone has kind of reported, you know, much higher, kind of consistently higher performance on those evals. There are a few areas in particular we've seen spikes, so. So one is coding for sure. I mentioned companies like Cursor, Jetbrains, Windsurf, Cognition and others that we work with who anecdotally have all said that GPT5 now feels like the most capable coding model, whether that's in an interactive coding environment or more of an agentic coding environment. And then also one of the things that we see consistently now is its ability to reason and problem solve in very technical domains is significantly improved. And so Harvey's a great example of that. Where you've got Harvey AI working with legal firms and law firms is very, very reliant on its ability to reliably, accurately and consistently portray cases, that it's looking at legal analysis to provide that kind of level of structured thinking you want when you're doing legal analysis. And so I expect we'll see that carry over. I mean, financial services is a very interesting area. Heavy on data analysis, heavy on research, heavy on planning. Those are all areas that we've seen improvement in. And so as we continue to kind of see GPT5 permeate the market, we'll get more and more of that feedback and can continue to improve on those use cases.
Big Technology Podcast Host
And how about pricing? Because it's half the cost of an input. An input token is half the cost. Then GPT4O output token is the same. Are these lower costs going to help enable more use cases? And on that note, I mean, how does lowering costs sync with the fact that you've raised like 48 billion this year or announced 48 billion in funding? Is it really possible to lower costs and deliver on the expectations that the investors are expecting on that front?
Brad Lightkap
Yeah, so we've, you know, in OpenAI's history, every time we've cut cost, we've seen typically some corresponding increase in consumption that usually outweighs the cost cut. And so for as long as that trend holds, we will continue to cut costs on models. We know that there's this complicated dance that developers have to do between latency, model quality and intelligence and price. And I think what we've tried to do here basically is take the market's feedback on all three of those fronts and really place these models, these GPT5 models, not just the standard model, but also the mini model and the nano model on this frontier of quality, cost and latency that kind of optimizes for what we think the market needs to be successful. And so we tried to find a really attractive price target at a very attractive average latency. And then obviously with the kind of built in model quality and intelligence you get with GPT5. And so we will continue to push that frontier. And I think the more we push that frontier, typically the more we just see people want to use it for more things. And so for that equation to exist, we're very fortunate and it motivates us to try and make them better.
Big Technology Podcast Host
Are you ever going to be profitable?
Brad Lightkap
I hope so.
Big Technology Podcast Host
Okay, we'll take it. All right, Brad, before we wrap, let me be the first to ask you, when is GPT6 coming?
Brad Lightkap
Well, you're not the first to ask. I could tell you But I already. Yeah, no, don't. Twitter is quick on the trigger on that one. But, no, I mean, look, like I said, we think GPT5 is extraordinarily capable. We think there will be better models in the future. We know there will be better models in the future. For now, we're just focused on how do we get this in people's hands, how do we support the companies that are building with us using this model, and then we're still in the science of it. I think that's the exciting part, is we're in the first inning of it, and we ourselves are just understanding the paradigm we're in. And so this is, I think, an important first step. And you kind of have to understand where you are to understand where you're going. And hopefully the learning from this will make GPT6 much better.
Big Technology Podcast Host
Well, Brad, it's so great to have you on, especially today on GPT5 launch day. So whenever GPT6 comes, we'll have to do it again. Thank you so much for joining.
Brad Lightkap
We look forward to it.
Big Technology Podcast Host
All right, folks, GPT5 is out. You can try it on chat.com and it's going to roll out to everybody, so give it a look. And we'll be back to talk more about it tomorrow, where Ron, John, Roy and I will break down the week's news, especially what the Latest is on GPT5. Thanks, everybody, for listening and we'll see you next time on Big Technology Podcast.
Big Technology Podcast Summary: OpenAI COO Brad Lightcap on GPT-5's Capabilities, Importance, and the Future of AI
Podcast Information
Timestamp: [00:00]
Host: Alex Kantrowitz welcomes Brad Lightcap, COO of OpenAI, to discuss the launch of GPT-5. The conversation aims to dissect GPT-5's advancements, its significance in the AI industry, and the future trajectory of artificial intelligence.
Timestamp: [00:17]
Brad Lightcap: Introduces GPT-5 as OpenAI's next-generation flagship model. Highlights its unique ability to dynamically decide when to "think hard" about a problem, enhancing user experience by eliminating the need for manual model selection.
“GPT5 abstracts all of that, so it makes that decision for you and it's actually a smarter model. So you're going to get a better answer in all cases...” [00:29]
Key Improvements:
Timestamp: [01:30]
The host probes why OpenAI emphasizes usability improvements over raw intelligence gains. Brad explains that intelligence is tied to the model's reasoning time, and GPT-5 optimizes this dynamically for better answers and user experience.
“Intelligence really is a function of how much time the model is going to be thinking...” [01:53]
Timestamp: [03:19]
Brad Lightcap discusses whether GPT-5 represents an incremental improvement or an exponential leap. He emphasizes that measuring intelligence is multifaceted, covering various dimensions like speed, accuracy, tool use, and structured problem-solving.
“...there's a lot of dimensions to play with depending on... how it can think about problems.” [04:33]
Timestamp: [05:07]
Explaining the shift from merely scaling model size (as seen from GPT-2 to GPT-4) to incorporating post-training techniques. This dual approach acts as a force multiplier, enhancing intelligence and enabling capabilities like tool usage and multi-step reasoning.
“We now have this kind of other category of training which is post training...” [05:07]
Scaling Laws:
Timestamp: [09:31]
The conversation shifts to whether GPT-5 qualifies as AGI. Brad Lightcap argues that while GPT-5 exhibits many AGI-like traits, such as reasoning and problem-solving, it doesn't fully embody AGI. He emphasizes the complexity and evolving nature of defining AGI.
“...we start to see the traces and the pieces of that overall system for generalized learning...” [09:31]
Timestamp: [12:04]
Discussion on whether OpenAI is prioritizing increasing intelligence or focusing on ancillary capabilities like memory and continual learning. Brad affirms that both aspects are crucial, highlighting ongoing research to close gaps in areas where models still struggle.
“...it's about trying to build those ancillary capabilities.” [12:04]
Key Areas of Focus:
Timestamp: [16:38]
Brad Lightcap differentiates the impact of GPT-5 on average users versus power users. While advanced users may notice subtle improvements, average users, including free-tier subscribers, will experience dramatic enhancements, especially in reasoning capabilities.
“...for the average user it will feel like a dramatic increase.” [16:38]
Implications:
Timestamp: [19:31]
The discussion delves into GPT-5's role in healthcare. Brad Lightcap clarifies that GPT-5 is not intended to replace general practitioners but serves as a tool to empower patients. It offers educational support and aids in managing health conditions by providing accurate information and reasoning.
“...have more agency in their journey... It's a huge unlock just in people's psychology...” [19:45]
Key Points:
Timestamp: [22:17]
Brad Lightcap highlights GPT-5's transformative potential for businesses. Unlike consumer applications, enterprises require models that can handle complex processes, multi-user dependencies, and integrate various tools reliably. GPT-5's enhanced reasoning and problem-solving abilities make it a valuable asset for businesses.
“...their ability to use tools, to think in a structured way, to solve problems...” [22:17]
Impact on Enterprises:
Timestamp: [26:33]
Discussion on GPT-5’s pricing strategy, which includes halving the cost of input tokens while maintaining output token prices. Brad Lightcap explains that cost reductions typically lead to increased consumption, supporting further innovation and broader use cases.
“...every time we've cut cost, we've seen typically some corresponding increase in consumption...” [27:03]
Economic Considerations:
Timestamp: [28:08]
When asked about GPT-6, Brad Lightcap remains non-committal but acknowledges future improvements. He emphasizes the focus on fully leveraging GPT-5’s capabilities and understanding the current paradigm before moving forward.
“We think there will be better models in the future... Understanding the paradigm we're in.” [28:21]
Outlook:
Timestamp: [29:06]
Alex Kantrowitz wraps up the episode by celebrating the launch of GPT-5 and teasing future discussions on its impact. Brad Lightcap expresses enthusiasm for continued collaboration and innovation.
Notable Quotes:
Brad Lightkap on GPT-5's Dynamic Reasoning:
“GPT5 abstracts all of that, so it makes that decision for you and it's actually a smarter model...” [00:29]
On Measuring Intelligence:
“Intelligence really is a function of how much time the model is going to be thinking...” [01:53]
On GPT-5’s Role in Healthcare:
“...have more agency in their journey... It's a huge unlock just in people's psychology...” [19:45]
On Enterprise Applications:
“...their ability to use tools, to think in a structured way, to solve problems...” [22:17]
On Future Models:
“We think there will be better models in the future... Understanding the paradigm we're in.” [28:21]
Final Thoughts: GPT-5 represents a significant evolution in OpenAI's language models, emphasizing dynamic reasoning, enhanced accuracy, and broader applicability across various domains, including healthcare and enterprise solutions. While not yet classified as AGI, GPT-5 lays the groundwork for future advancements, balancing increased intelligence with usability to deliver a superior user experience. The podcast underscores OpenAI's commitment to continuous innovation, cost efficiency, and addressing complex real-world challenges through AI.