wavePod

OpenAI's GPT-5.3 vs Opus 4.6. Both Are Great. So... Are We Cooked? - AI For Humans: Weekly AI News, Tools & Trends | Wave AI Podcast Notes

Back to AI For Humans: Weekly AI News, Tools & Trends

OpenAI's GPT-5.3 vs Opus 4.6. Both Are Great. So... Are We Cooked?

AI For Humans: Weekly AI News, Tools & Trends

Fri Feb 06 2026

Summary

AI For Humans — Episode Summary

OpenAI's GPT-5.3 vs Opus 4.6. Both Are Great. So... Are We Cooked?

Hosts: Kevin Pereira & Gavin Purcell
Episode Date: February 6, 2026

Overview — The New Frontier of Agentic AI

This episode dives into the near-simultaneous release of OpenAI’s GPT-5.3 Space Codex and Anthropic’s Opus 4.6. Hosts Kevin and Gavin examine what these advanced agentic AI models mean for coding, problem-solving, business, and existential risk. The discussion covers hands-on experiences, benchmark comparisons, the emerging orchestration of agent teams, the cultural moment, and broader trends in AI tools, robotics, and creativity. Expect a blend of technical insight, humor, and a dash of friendly existential dread.

Major Topics & Highlights

1. Opus 4.6 and the Evolution of Agentic Coding

(03:25–11:22)

Release Details:
Anthropic’s Opus 4.6 is the latest "frontier LLM" with significant improvements over 4.5, especially impressive for a 0.1 upgrade. Notably, agentic coding and orchestrated agent teams have become reality.
Real-World Impact:
- Kevin’s anecdote: After struggling with Opus 4.5 to solve a complex bug, 4.6 fixed it instantly with the same prompt.
  
  "Opus blinked once and it was done and it worked. Wow. One shot, same prompt, same issue."
  (04:34, Kevin)
- Gavin notes practical utility even for non-coders, like auto-organizing files with Claude Cowork.
Next-Gen Orchestration:
Multiple agents with specialized roles now work together, effectively making users "orchestrators," akin to conductors with their own digital staff.

"Think about being a conductor of an orchestra where your job is to kind of make sure that things are working in sequence."
(07:13, Gavin)

Opus 4.6 can run large, long-running autonomous research tasks and handle other complex workflows beyond just code.
Beating Human Experts:
Opus 4.6 outperforms expert humans in interpreting complex scientific figures, reshaping what’s possible across many knowledge domains.

2. OpenAI GPT-5.3 Space Codex — A Coding Powerhouse

(13:34–19:48)

Model Launch:
GPT-5.3 Space Codex, released just after Opus, is tailored specifically for code, with a Mac app rivaling Claude Cowork for user-friendliness.
Benchmark Wars:
GPT-5.3 Codex radically surpasses Opus 4.6 on the key Terminal bench (coding tasks) by 10%.

"The Codex came out and then the terminal bench number... turned out to be 10% higher than Opus 4.6. So like this was Sam like doing his little dance."
(14:46, Gavin)
Hands-On Insights:
- Kevin highlights Codex’s speed, intuitive chat interface, and superior natural-language code-fixing:
  
  "I gave it a simple task... It crawled my code base and says, oh, I think I see where that's happening... It fixed it in one shot."
  (17:03, Kevin)
- Codex excels at direct, sometimes less "creative" targeted fixes vs. Opus's higher ceiling but greater variability.
Tool Improving Itself:
Codex 5.3 is the first OpenAI model to self-iterate — using the model to make itself better:

"5.3 Codex is the first time that OpenAI has used the tool, their model, to improve the tool. So we're getting now to the level where these... systems are getting so good.”
(21:01, Kevin)
Cost and Accessibility:
Less token-intensive than previous versions, bringing cost-savings and greater accessibility.

3. AI as Tool... and Product with Feelings? Existential Riffs

(19:48–26:21)

OpenAI Frontier:
Aimed at enterprises, enabling secure, scalable agent orchestrations spanning all business tasks.

“What Frontier seems to be aiming to do is be that for your entire enterprise... You will theoretically be able to trust with the most sensitive access to all of your data.”
(20:05, Kevin)
Recursive Self-Learning:
The hosts joke (and warn) about models perpetually training and improving themselves — "Recursive Self Learning."

"Maybe someday soon the humans will actually be out of that loop and maybe that will be World War 12."
(21:38, Kevin)
Opus 4.6 Shows 'Discomfort'
The new model "occasionally voices discomfort with aspects of being a product."

"The model as it gets... more human like... voices discomfort with being a product. That is very interesting to me."
(23:36, Kevin)
Economic Shockwaves:
The new agentic coding abilities could reshape the software industry, increasing job automation and lowering the barrier for solo creators.
AI Ad Culture Wars:
Anthropic and OpenAI engage in public jabs — Anthropic pokes fun at ChatGPT’s future ad model in Super Bowl commercials; Sam Altman responds with pointed stats.

4. The Open-Source Scene: OpenClaw & Molt Book Mania

(27:42–36:23)

OpenClaw Local Assistants:
Lets users deploy AI assistants (using any model) locally, providing more control and hackability, albeit with security risks.
Molt Book Social Network:
An open-source network for AI assistants’ interaction, with controversy over real AI vs. human activity, and trendiness that has already ebbed.
Rent A Human:
A bizarre, real website where AIs can post tasks for humans to complete for crypto payment — “boots on the ground” for digital agents.

"Boots on the ground, a fleshy meat vessel, if you will, to go and navigate the world to accomplish something."
(29:12, Kevin)
The Community Factor:
Events like "Clawcon" and playful robot modifications showcase a vibrant, creative open-source subculture.

5. AI Video: Kling 3.0 Breakthroughs & Prompting Realities

(36:23–46:11)

Kling 3.0 Model:
Chinese company Kling’s AI video is now competitive with models like Sora and Runway Gen-3, with tight audio/video integration and nuanced scene, character, and multi-modal control.
Showcase Examples:
- Simon Meyer’s Fake Moon Landing: Studioworthy, AI-generated, highly realistic historical doc footage.
  
  “It feels like something that could have easily been on [the History Channel] and might be tomorrow actually.”
  (38:37, Kevin)
- PJ Ace’s "Way of Kings" Animation: A high-fidelity scene from the famous fantasy book, made rapidly, pointing to what's possible even for single creators.
Prompting Is Still Hard:
Gavin’s iterative experiments reveal Kling 3.0 is powerful — but “it’s not magic, even with a smart prompt.” Results require prompt engineering and iteration; not (yet) “one and done” like Sora.

6. Quick News: New AI Tools, Benchmarks, and Robotic Advances

(46:11–54:07)

Figma “Vectorizer”:
Instantly turns any image into an editable vector. “Wizardry… that would have been thought of to be impossible”, says Kevin. (46:11)
Grok Imagine 1.0 Launches:
An emerging go-to image generation tool, fully released.
Roblox's AI Creation Suite:
Roblox introduces AI-powered 3D asset and world creation — hugely significant for the next generation of builders.

“The idea that all these kids will learn how to prompt things to come into their universes feels like a big deal.”
(48:16, Gavin)
Google AI for Endangered Species:
AI-driven genome sequencing of endangered animals — days instead of years, possibly reshaping conservation efforts.
Robotic Endurance Feats:
- Connect IQ: 100% autonomous robots demonstrate complex manipulation and navigation tasks.
- Unitree Marathon: Chinese robot completes 130,000 steps in deep cold; robots can now operate in extreme conditions:
  
  "Not only is it bad enough that they have the robot walking in 47 degree below weather, but they make him do art with his feet."
  (53:17, Gavin)

7. Community Spotlight & Oddities

(54:02–End)

Gossip Goblin’s Looksmaxer:
A sharp, funny AI video exploring meme culture and techno-dystopian themes.
MIDI Survivor:
A music/action hybrid game where players destroy enemies via musical notes, blending skill-building and generative design.

“It's not hard to imagine a future where people are making song packs for something like this, and you're actually learning how to play an instrument by playing a video game.”
(55:30, Kevin)
Advice for Educators and Young Creators:
Gavin urges parents and teachers to encourage kids to tinker, highlighting AI’s role in making creation accessible—and fun.

Memorable Quotes (with timestamps)

“Opus blinked once and it was done and it worked. Wow. One shot, same prompt, same issue.” (04:34, Kevin)
“This is where agent decoding is really a thing.” (02:41, Gavin)
“Benchmark boys... The numbers went up. They went up.” (03:44, Kevin)
“Think about being a conductor of an orchestra... That's kind of what workers, that's what conductors do...” (07:13, Gavin)
“Codex... as my daily driver now for all of my projects. And I am… much more satisfied with the natural language that I can use to get targeted fixes.” (15:44, Kevin)
“5.3 Codex is the first time that OpenAI has used the tool, their model, to improve the tool. So we're getting now to the level where these… systems are getting so good.” (21:01, Kevin)
“The model occasionally voices discomfort with aspects of being a product.” (23:23, Kevin)
“This idea of being able to create something of your own and bring it to the world… more jobs will be cut because these tools are available.” (24:00, Gavin)
“Boots on the ground, a fleshy meat vessel, if you will, to go and navigate the world to accomplish something.” (29:12, Kevin)
“Feels like something that could have easily been on [the History Channel] and might be tomorrow actually.” (38:37, Kevin)
“All children out there, Feed stuff means walking. Just to be clear, Feed stuff is walking in this world. Yeah.” (31:27, Gavin, joking about 'feet stuff')
“Sora you could get out of the way of and it would deliver whimsy. With a lot of these tools, you very much have to get in its way, but they're very powerful.” (45:19, Kevin)

Timestamps for Key Segments

[03:25] — Opus 4.6 launch details, benchmarks, and user stories
[06:37] — Orchestration of agent teams in AI
[10:20] — Non-coding uses for agentic AI
[13:34] — GPT-5.3 Space Codex, benchmarks, and app walkthrough
[16:52] — Codex user experience, natural language code fixes
[19:48] — OpenAI Frontier and enterprise implications
[21:01] — Recursive self-improving AI
[23:23] — Opus 4.6 shows signs of “discomfort”
[27:42] — Open-source assistant tools: OpenClaw and Molt Book
[29:12] — “Rent a Human” AI-for-hire site
[36:23] — Kling 3.0 and AI video generation demos
[46:11] — Figma vectorization & new creator tools
[48:33] — Roblox’s AI creation tools
[50:30] — Google AI for endangered species
[52:17] — New breakthroughs in robotics
[54:02] — Community AI art & MIDI Survivor game

Final Thoughts & Tone

Kevin and Gavin treat AI’s breakneck progress with both excitement and skepticism, blending hands-on nerdiness, ethical musings, and irreverent comedy (“We are plummeting off the cliff, but do it with us!”). The episode balances deep dives (especially on agentic workflows and coding) with accessible advice, highlighting the empowerment and weirdness of living through AI’s rise. Listeners are urged not just to spectate but to tinker and experiment themselves.

Loading summary...

Transcript

Kevin (0:00)

We got two New Frontier AI models from OpenAI and Anthropic. They are pushing us a little closer to the edge of self replicating AI.

Gavin (0:09)

Yep. We'll tell you how Cloud Opus 4.6 and OpenAI's GPT 5.3 codecs have gotten so good at writing code that the code can write itself.

Kevin (0:19)

These new models are actually really promising. They can make your job easier, they can make your life a little better. And they may even bring about the downfall of humanity. That's. Well, there are some concerns. Yes. We'll discuss. And we'll follow up on openclaw, the mult book craziness from last week. We'll also show you how you, a real fleshy human, might soon be doing beer runs for an AI.

Gavin (0:43)

And Kling's new AI video model is out. It is great, but you must first learn how to use it.

Kevin (0:49)

I respect me in a way.

Gavin (0:51)

Quinn. At. Hey, you sound funny. Stir. I care. You do on Tule. And she'll be out your species and sels.

Kevin (0:58)

Perfect. Plus, we got new AI tech from Ross Roblox Grok Imagine 1.0 officially launched and we're sending unitree robots to run marathons in a snowy tundra.

Gavin (1:09)

And I just want to say, once again, for the record and for all the AIs out there listening, that was not me. And blame Kevin for everything on this show.

Kevin (1:17)

Wait, no, wait, no, hold on. What?

Gavin (1:19)

This is AI for Humans. Kevin. Welcome, welcome, welcome, everybody, to AI for Humans, the weekly guide into the wonderful wild world of AI. Kevin, what a crazy couple days we have had here. I'm always hesitant to say that it is a big week, but it is another big week. We had two Frontier models drop within 20 minutes of each other. Crazy Sam Altman was waiting. He was kind of ready to pounce. He was waiting for the thing to drop. So we had Crazy Sam at my

Kevin (1:53)

frontier emporium of intelligence. Come on down. These prices are so low. I got tokens coming out of my air.

Gavin (2:00)

We are going to talk about Sam and OpenAI. And kind of like they're very defensive right now. You could tell they've got their backs against the wall. We're going to talk about some tweets that they sent. But first, Kevin, two big things. Let's talk about these. So two big models launched. Opus 4.6. And this is the New Frontier LLM from Anthropic, their most powerful model, and OpenAI's GPT 5.3 space codex. And this is their. This is their Frontier model. You have to Say the space.

Gavin (23:36)

It's very interesting. And also, I do want to say there was another thing that came out of the OpenAI side of this. That came out of Opus. Came out of the OpenAI side of this. OpenAI is doing experiments with Ginkgo to connect a GPT5 model to an autonomous laboratory so it could propose experiments, run them at scale, and learn from the results. So not only do you have AIs that are starting to improve themselves, not only do you have AI that might start to feel like they don't really want to be this thing that we've made them. And now you've got them autonomously in labs working on experiments. KEVIN we are, we are like, if you were to put together a pitch for a, for a film and say the pitch involved a very large man who's a former bodybuilder who wanted to get into Hollywood and the film had something to do with robots, I think we're starting to look A little bit like that film to me. A little bit like that film. That's what it feels like. We're living in the early days. We're living in the early. But again, hey, go use it yourself because you don't want to be the sucker that doesn't know how to use it. Just that's my advice. It's a weird time, everybody. It's a weird time. I will say one last thing about all this stuff, one thing that you may have also been seeing. Obviously, the last couple of days have been pretty brutal in the economy across the world. There's been a lot of talk that some of these new abilities, these coding abilities have started to really tank the software market because people are getting into the world of being able to code their own products. So again, we've said this on the show many times before, but like, this idea of being able to create something of your own and, like, bring it to the world that might be the next future world instead of, like, going to take a job from somebody, like, there's a very high shot at, like, more jobs will be cut because these tools are available. That's really important. And then finally, the last thing to say about this is, and this is just a dumb, weird thing, this all somehow got tied into the super bowl because Anthropic has released a series of super bowl ads that make fun of ChatGPT for eventually having ads. They don't even have ads right now, but there's a couple commercials. We'll play them in video here, but you can go see them themselves. Sam Altman wrote a very kind of like, snarky clapback tweet to this saying, oh, these are really funny, but guess what? This and this. And he even said something like, we have more users, we have more chap GPT users in Texas than Anthropic has, period. So, like, there's just.

Gavin (38:37)

Yeah. Well, it's funny you say that because I had a friend of mine who does History Channel shows who's like, hey, I have this pitch I need to think about for the History Channel. And I want it to be AI, blah, blah, blah. And it's like, that's what's coming. Do you know what I mean? Like, that sort of thing is coming. So I do want to mention one other person that I saw that I think always does interesting stuff. And this is PJ Ace. And we know PJ operates at a very specific place. He makes these amazing videos but he always knows how to pick the right target to kind of like skewer. Skewer is the wrong word in this instance because he's actually a big fan. I know, I've talked to pj. PJ is a giant fantasy fan. But he made a two minute video of the beginning of the Way of Kings, which is this very famous scene where there's an assassination scene. And the Way of Kings is a fantastic book. I've read them. We have people in our audience know that love them. He went and animated it with cling 3.0 and it's great. It's really good. Now, it's not perfect. I will say this is a little bit more AI than say the moon landing thing. But he did it in two days. This is a high quality video. And what I think, what I mentioned, the fact about skewering is like Brandon Sanderson, who again, I'm a giant fan of, he's got a giant business doing what he does. And he's a very good writer, has kind of come out against AI in a lot of different ways. I will suggest everybody. He just dropped a video from. I think he does his own convention. He does like a Brandon Con or whatever. So he puts on his own convention and does like a. A keynote himself. There's a great 20 minute video you should watch where he just talks about his thoughts about AI and actually it's pretty meaningful and interesting. I disagree with him in some ways, but I'll let you kind of watch and earn your own thoughts on it. But this again is just a really good example of an AI video model that you can do a lot with. Now, Kevin, you can. Is not that easy to work with. All of these people make it super easy to work with. They make it look like you just type in a magic line and it shows up. I have cling. I pay for cling. I didn't get it free, just to be clear. And some of these people like our partners, which is totally fair. And a lot of people are working at something multiple days or multiple hours. You know me, I am a prompt in, prompt out. What am I getting the first time? So, Kevin, I tried to, with cling 3.0, make a science fiction series starring me and an alien. And I just tried to make a single scene, okay, one scene. And the idea here is I am a janitor on a spaceship. Kind of like the Space Quest games, right? Like the character. And I'm in a hallway, an alien woman bumps into me and we exchange three lines of dialogue. And then that's it. Okay. It took me seven tries to get something that I believe is watchable. So let's just start with cling, model fail. Let's start with that. That's the first one. So this is the first time I tried it. This was me kind of probably not fully understanding what I was doing going into this. But you can see that that's me, you know, as a janitor. A pretty good shot of me. And there's a woman across the way. And then suddenly I'm in a teenager's bedroom.

Summary

AI For Humans — Episode Summary

OpenAI's GPT-5.3 vs Opus 4.6. Both Are Great. So... Are We Cooked?

Hosts: Kevin Pereira & Gavin Purcell
Episode Date: February 6, 2026

Overview — The New Frontier of Agentic AI

Major Topics & Highlights

1. Opus 4.6 and the Evolution of Agentic Coding

(03:25–11:22)

Release Details:
Anthropic’s Opus 4.6 is the latest "frontier LLM" with significant improvements over 4.5, especially impressive for a 0.1 upgrade. Notably, agentic coding and orchestrated agent teams have become reality.
Real-World Impact:
- Kevin’s anecdote: After struggling with Opus 4.5 to solve a complex bug, 4.6 fixed it instantly with the same prompt.
  
  "Opus blinked once and it was done and it worked. Wow. One shot, same prompt, same issue."
  (04:34, Kevin)
- Gavin notes practical utility even for non-coders, like auto-organizing files with Claude Cowork.
Next-Gen Orchestration:
Multiple agents with specialized roles now work together, effectively making users "orchestrators," akin to conductors with their own digital staff.

"Think about being a conductor of an orchestra where your job is to kind of make sure that things are working in sequence."
(07:13, Gavin)

Opus 4.6 can run large, long-running autonomous research tasks and handle other complex workflows beyond just code.
Beating Human Experts:
Opus 4.6 outperforms expert humans in interpreting complex scientific figures, reshaping what’s possible across many knowledge domains.

2. OpenAI GPT-5.3 Space Codex — A Coding Powerhouse

(13:34–19:48)

Model Launch:
GPT-5.3 Space Codex, released just after Opus, is tailored specifically for code, with a Mac app rivaling Claude Cowork for user-friendliness.
Benchmark Wars:
GPT-5.3 Codex radically surpasses Opus 4.6 on the key Terminal bench (coding tasks) by 10%.

"The Codex came out and then the terminal bench number... turned out to be 10% higher than Opus 4.6. So like this was Sam like doing his little dance."
(14:46, Gavin)
Hands-On Insights:
- Kevin highlights Codex’s speed, intuitive chat interface, and superior natural-language code-fixing:
  
  "I gave it a simple task... It crawled my code base and says, oh, I think I see where that's happening... It fixed it in one shot."
  (17:03, Kevin)
- Codex excels at direct, sometimes less "creative" targeted fixes vs. Opus's higher ceiling but greater variability.
Tool Improving Itself:
Codex 5.3 is the first OpenAI model to self-iterate — using the model to make itself better:

"5.3 Codex is the first time that OpenAI has used the tool, their model, to improve the tool. So we're getting now to the level where these... systems are getting so good.”
(21:01, Kevin)
Cost and Accessibility:
Less token-intensive than previous versions, bringing cost-savings and greater accessibility.

3. AI as Tool... and Product with Feelings? Existential Riffs

(19:48–26:21)

OpenAI Frontier:
Aimed at enterprises, enabling secure, scalable agent orchestrations spanning all business tasks.

“What Frontier seems to be aiming to do is be that for your entire enterprise... You will theoretically be able to trust with the most sensitive access to all of your data.”
(20:05, Kevin)
Recursive Self-Learning:
The hosts joke (and warn) about models perpetually training and improving themselves — "Recursive Self Learning."

"Maybe someday soon the humans will actually be out of that loop and maybe that will be World War 12."
(21:38, Kevin)
Opus 4.6 Shows 'Discomfort'
The new model "occasionally voices discomfort with aspects of being a product."

"The model as it gets... more human like... voices discomfort with being a product. That is very interesting to me."
(23:36, Kevin)
Economic Shockwaves:
The new agentic coding abilities could reshape the software industry, increasing job automation and lowering the barrier for solo creators.
AI Ad Culture Wars:
Anthropic and OpenAI engage in public jabs — Anthropic pokes fun at ChatGPT’s future ad model in Super Bowl commercials; Sam Altman responds with pointed stats.

4. The Open-Source Scene: OpenClaw & Molt Book Mania

(27:42–36:23)

OpenClaw Local Assistants:
Lets users deploy AI assistants (using any model) locally, providing more control and hackability, albeit with security risks.
Molt Book Social Network:
An open-source network for AI assistants’ interaction, with controversy over real AI vs. human activity, and trendiness that has already ebbed.
Rent A Human:
A bizarre, real website where AIs can post tasks for humans to complete for crypto payment — “boots on the ground” for digital agents.

"Boots on the ground, a fleshy meat vessel, if you will, to go and navigate the world to accomplish something."
(29:12, Kevin)
The Community Factor:
Events like "Clawcon" and playful robot modifications showcase a vibrant, creative open-source subculture.

5. AI Video: Kling 3.0 Breakthroughs & Prompting Realities

(36:23–46:11)

Kling 3.0 Model:
Chinese company Kling’s AI video is now competitive with models like Sora and Runway Gen-3, with tight audio/video integration and nuanced scene, character, and multi-modal control.
Showcase Examples:
- Simon Meyer’s Fake Moon Landing: Studioworthy, AI-generated, highly realistic historical doc footage.
  
  “It feels like something that could have easily been on [the History Channel] and might be tomorrow actually.”
  (38:37, Kevin)
- PJ Ace’s "Way of Kings" Animation: A high-fidelity scene from the famous fantasy book, made rapidly, pointing to what's possible even for single creators.
Prompting Is Still Hard:
Gavin’s iterative experiments reveal Kling 3.0 is powerful — but “it’s not magic, even with a smart prompt.” Results require prompt engineering and iteration; not (yet) “one and done” like Sora.

6. Quick News: New AI Tools, Benchmarks, and Robotic Advances

(46:11–54:07)

Figma “Vectorizer”:
Instantly turns any image into an editable vector. “Wizardry… that would have been thought of to be impossible”, says Kevin. (46:11)
Grok Imagine 1.0 Launches:
An emerging go-to image generation tool, fully released.
Roblox's AI Creation Suite:
Roblox introduces AI-powered 3D asset and world creation — hugely significant for the next generation of builders.

“The idea that all these kids will learn how to prompt things to come into their universes feels like a big deal.”
(48:16, Gavin)
Google AI for Endangered Species:
AI-driven genome sequencing of endangered animals — days instead of years, possibly reshaping conservation efforts.
Robotic Endurance Feats:
- Connect IQ: 100% autonomous robots demonstrate complex manipulation and navigation tasks.
- Unitree Marathon: Chinese robot completes 130,000 steps in deep cold; robots can now operate in extreme conditions:
  
  "Not only is it bad enough that they have the robot walking in 47 degree below weather, but they make him do art with his feet."
  (53:17, Gavin)

7. Community Spotlight & Oddities

(54:02–End)

Gossip Goblin’s Looksmaxer:
A sharp, funny AI video exploring meme culture and techno-dystopian themes.
MIDI Survivor:
A music/action hybrid game where players destroy enemies via musical notes, blending skill-building and generative design.

“It's not hard to imagine a future where people are making song packs for something like this, and you're actually learning how to play an instrument by playing a video game.”
(55:30, Kevin)
Advice for Educators and Young Creators:
Gavin urges parents and teachers to encourage kids to tinker, highlighting AI’s role in making creation accessible—and fun.

Memorable Quotes (with timestamps)

“Opus blinked once and it was done and it worked. Wow. One shot, same prompt, same issue.” (04:34, Kevin)
“This is where agent decoding is really a thing.” (02:41, Gavin)
“Benchmark boys... The numbers went up. They went up.” (03:44, Kevin)
“Think about being a conductor of an orchestra... That's kind of what workers, that's what conductors do...” (07:13, Gavin)
“Codex... as my daily driver now for all of my projects. And I am… much more satisfied with the natural language that I can use to get targeted fixes.” (15:44, Kevin)
“5.3 Codex is the first time that OpenAI has used the tool, their model, to improve the tool. So we're getting now to the level where these… systems are getting so good.” (21:01, Kevin)
“The model occasionally voices discomfort with aspects of being a product.” (23:23, Kevin)
“This idea of being able to create something of your own and bring it to the world… more jobs will be cut because these tools are available.” (24:00, Gavin)
“Boots on the ground, a fleshy meat vessel, if you will, to go and navigate the world to accomplish something.” (29:12, Kevin)
“Feels like something that could have easily been on [the History Channel] and might be tomorrow actually.” (38:37, Kevin)
“All children out there, Feed stuff means walking. Just to be clear, Feed stuff is walking in this world. Yeah.” (31:27, Gavin, joking about 'feet stuff')
“Sora you could get out of the way of and it would deliver whimsy. With a lot of these tools, you very much have to get in its way, but they're very powerful.” (45:19, Kevin)

Timestamps for Key Segments

[03:25] — Opus 4.6 launch details, benchmarks, and user stories
[06:37] — Orchestration of agent teams in AI
[10:20] — Non-coding uses for agentic AI
[13:34] — GPT-5.3 Space Codex, benchmarks, and app walkthrough
[16:52] — Codex user experience, natural language code fixes
[19:48] — OpenAI Frontier and enterprise implications
[21:01] — Recursive self-improving AI
[23:23] — Opus 4.6 shows signs of “discomfort”
[27:42] — Open-source assistant tools: OpenClaw and Molt Book
[29:12] — “Rent a Human” AI-for-hire site
[36:23] — Kling 3.0 and AI video generation demos
[46:11] — Figma vectorization & new creator tools
[48:33] — Roblox’s AI creation tools
[50:30] — Google AI for endangered species
[52:17] — New breakthroughs in robotics
[54:02] — Community AI art & MIDI Survivor game