Transcript
Alexander Enbirakos (0:00)
It kind of sucks to like go and write this prompt and then like wait 10 minutes. What you really want when you hire someone is to kind of tell them what the job is, give them the credentials to all the tools and just have them pick up work automatically. The goal is to get to an agent that is basically a teammate and is seeing what's going on on your team and picking stuff up for you. This form factor of an agent working on its own computer in the cloud is the future and is incredibly powerful and worth figuring out how to get right.
Podcast Host (0:25)
What happens when AI stops helping you autocomplete code in and starts acting like a real teammate? Today we're exploring Codex. OpenAI's coding agent, Anjaney Mitha, is joined in studio by Alexander Enbirakos, who leads product for Codex at OpenAI. They discuss the origin story, why reasoning models plus tools unlock agents, how developers are actually using Codex in the wild, and what all this means for the future of software engineering, from debugging and prototyping to how CS students should think about their careers. Let's get into it.
Anjaney Mitha (1:01)
Hey, Alex.
Alexander Enbirakos (1:02)
Hey, how's it going?
Anjaney Mitha (1:03)
Good. Thanks for coming.
Alexander Enbirakos (1:04)
Yeah, good to see you again.
Anjaney Mitha (1:05)
You are one of the folks working on product for Codex, which is probably one of the most exciting launches to come out of the OpenAI team for me, at least in a while. So for a lot of people though, it was confusing for sure because it was the fifth Codex release from OpenAI. But of course it's completely new in different from the previous Codexes. So let's just start with the origin story. What is the backstory on how the current version of Codex came to be?
Alexander Enbirakos (1:35)
Yeah, and man, our naming is so fun at OpenAI. I'm excited for the naming to make more sense over time with Codex as we bring this all together. But yeah, let's go back, way back to the beginning. The first Codex product was actually released, I think it was in 2021. I might get the Eurong, but actually it was like a code completion model that powered GitHub copilot and so recently we were basically talking about a whole bunch of coding stuff we want to do like models, but models in product. We were thinking about what to call it and we just felt like the Codex name was really cool and so we wanted to go back to it. So how did this Codex product come about? Basically, we've been thinking a lot about agents, as everyone has, and before that we've been thinking about reasoning models and basically in our minds, one way you could Think about an agent is you take a reasoning model and then you give that reasoning model access to the tools that some agent would want to use or some human in a given function would want to use. And an environment that tool works with take side effects and then from there you come up with what kind of tasks would this person do. So basically you have this model, you give it tools, and then you make sure that the model is really good at doing the specific tasks that some function would do. And the task bit is actually super important because if you think of like, there's a difference between like writing and journalism. And similarly there's a difference between like coding and like software engineering. So we've been doing a lot of this tinkering with reasoning models internally, getting them to write code. And so the first tool we'd given them was like terminals. And we've been like poking at this for a while and just starting. It was like actually the. One of the first, like real like feel the AGI moments for me was when someone showed me a website editing itself by being prompted to itself. Because we had this like reasoning model, like basically very hackily trait connected to a terminal. And then, you know, it was editing this terminal.
