
Loading summary
A
Hello, I'm Andrew Main and welcome to the OpenAI podcast. There have been a lot of exciting releases from OpenAI recently, including GPT 5.1, Sora and one of my favorite new applications, ChatGPT Atlas. Today we're going to be talking to the team behind it, Ben Goodyear and Darren Fisher, and explore some of the reasons for why OpenAI decided to make a browser, what the future of agentic capabilities meant and where everything's headed next.
B
The time is right because it's actually how people should be starting their journey.
C
We're moving to a world where you can just tell the computer what you want.
B
So I think it's kind of powerful, this idea that the agent has its own workspace.
C
My view for this has always been that this is like a long term investment.
A
Let's begin with what is Atlas and why?
C
So Atlas is a new kind of browser for an era of the web where people are interacting with new technology in natural language. And so it's the kind of browser where you can just tell it what you want, whether it's to find the next outfit that you're going to buy or to help you solve a really hard problem, and then it can help you harness the web to get a bunch of stuff done. And so central to this idea is that if we take ChatGPT and make it the heart of your browser, not just an add on, it's something that can actually help you make sense of the content that you're seeing on the web. It's something that can help you take action on the web. It's something that can learn from your browsing to personalize your experience and help you with tasks that aren't just done in a few minutes, but might take days or weeks or months, or just generally help you become a more curious, more effective person.
B
And it can help you come back to a task that maybe you've not had a chance to work on in a while, because it will bring remembered what you were doing for you and help you get right back into where you were.
A
Why now?
C
I think the progression of technology with these AI models has been really stunning to watch over the past couple of years. And it feels like we're at this sort of sweet spot where the capabilities of not just the LLMs that have powered ChatGPT, but also sort of this new area of computer use and some of the other surrounding technology is at a point where we can build some really compelling experiences for people. So we wanted to give it a shot.
B
Like Ben said, the models have gotten so much better, and they continue to get better. And you see the slope of innovation there and the pace of improvement. If you look back at the beginning of the year, when Operator first came out, for example, and it sort of hints at some of the potential. And now you fast forward to where Atlas is with Agent and how much faster it is, how much more capable it is. Just look at that slope and you start to project what's it going to look like next year, five years out, et cetera. And to get the sort of get that foundation in place. That's what we were excited about. And it felt like the right time for me, personally, I felt like I had made that transition to seeing how ChatGPT makes so much sense in my life and how much I was using it, and feeling like putting ChatGPT at the core of a browser, not just another tab that you have to go to, but to have it be at the core. And part of that flow time is right, because it's actually how people should be starting their journey. And so we want to make that just so natural and easy. And so I'm really excited that we've been able to bring Atlas up and I'm excited to bring it to more platforms.
A
You two have quite a bit of history working in browsers. Netscape, you've worked on Firefox and Chrome. And I'd like to kind of understand where you think we are right now with browsers. We've got like over a 30 year history of these, and now it seems like they seem like they'd been changing a little bit incrementally, and then all of a sudden now we're adding AI to it, et cetera. And how do you look at the browser landscape?
C
Well, I think we have entered this really exciting time on the web where we've added this very human form of interaction in the form of these large language models that you can just speak to the software and have it do the right thing for you. I think that's really going to transform the way that people get stuff done online. And we've gone from a world where you've had to remember website addresses, you've had to go and search for them, and now you're just going to be able to ask for the task that you want to get done and you're going to see it get done.
A
I've noticed that a lot of people were thinking like, well, we still have browsers. That was a question, are we going to have browsers? And it seems like browsers are going to be Here to stay for a while. Is that something you both feel?
B
It's a tool that people reach for a lot. You get a lot of things done on the web. And using a browser, it's hard to imagine that not being a big part of how people use their computers. I think that we've been through many phases of the Internet and many phases of browser development. There was a time when, here comes mobile. Why would anybody use their laptop anymore? Why would anybody use a desktop computer? And yet people continue to use desktop computers. They reach for it for very different things, maybe, and now that they can also reach for their phone for certain things. But the web browser continues to be such an important tool on your computer for how you get work done, how you do research tasks, how you look for information and do that kind of work. Substantive work happens within the browser, happens on the Web. Don't really see that changing. If anything, I see that growing because it's this conduit to all the world's information. It's such an easy platform for people to bring experiences and make it available to everybody. The browser just makes that so easy.
C
There's something sort of interesting about it where like every of all of the technological advancements that we've had over the past 25, 30 years with the Web, there's something very durable about the browser, where even if you look at this most recent wave of generative AI with ChatGPT launching, it launched on the Web, it's, I think, very powerful reflection of the capabilities of this platform.
B
And the platform itself is amazing. I don't really need to recount all of the ways. It's amazing. But the fact that it's this very inherently open platform content is published to an. An open platform, an open Internet, where anybody can stand up a browser to consume that content. There are really no gatekeepers when it comes to the web, which is a really remarkable aspect to it. So people can freely publish information, and people can freely go and find that information. And so it allows it to just blossom and grow and evolve in different ways. And then it's very natural that you might want to take something like an LLM and point it at it, because now it can, on your behalf, try to understand it and help you navigate it. So the idea that it can do all that means it just makes it so much easier when you're trying to take advantage of all that information that's out there as a user. Here it is. It can go and find it for you or understand it for you, or explain it to you. I just want to say I think it's been really interesting to see the evolution of the web. We got started in an era where it was coming off of the dot com boom. Right. The needs of the browser back then were different. The kinds of things people were doing on the Internet were different. They were totally exploring. And then over the course of time, as we've worked on browsers, the kinds of things people tried to do in the browser was just so much more. Go back to, like, early 2000s, you had the evolution of, like, these more advanced web applications. Like, you know, I remember marveling at Google Maps when it first launched, the fact that you could just scroll and pan through a map so effortlessly. And then it goes on from there. You know, all the different kinds of web apps that people take for granted. Things like YouTube or, you know, I'm going to list a lot of Google apps because I worked at Google, but, you know, Gmail, Google Docs, all these things. The kinds of things that. I mean, I can do all of that in my browser. It's kind of amazing.
C
This becomes sort of like this operating system for your life on your laptop kind of deal.
B
Yeah. And it pushes what the browser needs to be able to do. Right. And so this era that we developed Chrome in was an era where people were already pushing the boundaries of what you could do inside of a browser. But they had good reason to do it because the web being this platform where it's so easy to put new experiences onto the Internet, meant there was a lot of motivation to do that. And you want to. Developers are being very creative in how they could push the bounds of what the browser could do. But fast forward to today and it's like you have all that. That's this foundation. And now just think about how the world's gotten more complicated. I think there's a lot of opportunity for it to feel overwhelming, the amount of complexity for people.
C
Actually, I think even going back a few years, I remember when I was a kid and my school friends and I would trade shareware with each other on floppy disks, as you did back at that point. And my mind is just not meant to retain certain types of information. And so all of the things like the command line for how to run, like, your unzip tool or however that worked, this was just something that felt entirely bizarre to me. And so in that sense, the web was kind of a really refreshing take on that. It was something where I could just go and click on things and explore without having to understand the underlying nature.
B
Of the Machine and view source.
C
That's true. But what I find with where the future of technology is going is that when we have these AI assistants that are attached to your computer, I think we'll find that we make that computing capability much more accessible to more people who aren't necessarily experts. Even, maybe not just even in terms of how an operating system works or how a browser works, but even how individual websites work, can express yourself more naturally as to what your intent is, what your goal is, and then the system can kind of figure out how best to accomplish that for you.
A
It seems like there was a lot of ideas about what the web could be, and part of it is that a lot of standards were things that were sort of decided after the fact, or we had to sort of go here and do that. And then there was things that would have been nice to have, like the Semantic Web made sure that everything was sort of annotating. You did that, but just in the real world, the corporate world, it's often hard to do that. And now did you think you'd see a world where, hey, we could have LLM sort of understand this and make that.
B
Well, I think it's amazing to see. This is like kind of the magic of, of these modern AI models is that they are really able to interact with things the way we interact with them. So systems. And that's, that's, that's, of course the world is designed for people with eyeballs and online people with mice to click on things or fingers to tap on things. And so you talk about the Semantic Web, of course it would be really nice if people would publish websites that were more inherently understandable by machines. But their motivation is to go where the users are. And as much as we make a push for websites that are more accessible, screen readers and whatnot, the reality, of course, is that that's just not where developers spend their energy first and foremost. Right? And it's usually an afterthought to make sure that you make everything extremely accessible. So kind of the beauty of these AI models is that they kind of meet the technology where users are. You know, technology is designed for people to consume. So you take a look at the way that it interacts with language and understands language. It can interact with language the way we interact with language. Self driving cars, they can interact with the roads and the systems of transit, the way that we interact with them. And I think that's kind of the beauty of these AI models is that they can be developed for this world that was designed for humans. And so that extends naturally to the browser.
C
I think it's not necessarily that we predicted exactly 100% how all of this would play out, but there's some very powerful ideas in that original Internet where there was this idea that it was structured data that would be read and interpreted by a machine and then presented to the user in some way. And so with the original web, there was this idea of a user's agent, a user agent, and that's the browser. And then that takes that machine readable content and then applies some presentation preferences to it. Maybe you like your font to be a little bit bigger so it's easier to read. Maybe you like it a certain style or weight or other stuff like that. And it doesn't. That was back in the original web was that idea. And so I think that that carries forward in today's era actually very, very well. And you can view where we are today as just sort of the natural endpoint or a continuation of that, that.
B
Journey evolution of the user agent. Right. So back in the day, even in Netscape browser, you could write what was called a custom style sheet or a user agent style sheet. You could override the colors of any website and people maybe were who were more sophisticated would know how to do that. Later on, browser extensions kind of made that a more universally available kind of thing. People could write an extension, share it with other people more easily. But it just makes a lot of sense now to empower an LLM to be able to go and on your behalf, as Ben said, to really supercharge that user agent to be able to do more things on your behalf.
A
Yeah, there were a lot of interesting ideas, I think, going back kind of at the dawn of all of this. And you look even the names of some of the tools like Gopher and Watson and Sherlock and whatnot, and how it was kind of more proactive. The idea that as you mentioned, these are sort of tools that don't just aren't document viewers. And I think that we kind of take the browser for kind of granted. And the idea that it just sort of like it just shows me the website and I can see where it's helpful for you two probably having spent most of your careers working in the space of browsers and trying to understand that. And it seems like there's like an inordinate amount of complexity there that's invisible. Could you give me an example of like, you know, the kinds of things you have to deal with when you're trying to figure out how to make something work?
B
Browsers are Maybe surprisingly complex. I don't even know where to begin. I'm amazed at how much work there is and how much technology goes into building a browser.
C
It's basically like an app platform or a mini operating system that's running on your desktop. So everything, every discipline of computing, it feels like there's like you can nerd out on some aspect of the browser.
B
I was just having a conversation over lunch with one of the engineers on our team explaining how our OWL works, which is our embedding of Chromium that runs out of process. I was explaining the rendering model for this and it kind of led to a conversation about how Chrome first worked when we first built it, and then how the advent of GPU accelerated rendering evolved. And now fast forward to the way it all works today and what we're trying to do with Owl and just sort of the depth of complexities there. And I could go on and on.
A
Actually, it is interesting because I think that people kind of overlook like, kind of the browser wars led to technologies like running Node on servers to actually do stuff, which is, I think nobody even thought would happen, but because just trying to make these things much more efficient and faster. And again, I kind of marvel that anything works at all. And looking at some of the technical stuff you have released on ChatGPT Atlas and Understanding it's not just a plugin, it's not just a thing that adds a ChatGPT sidebar could explain a little bit more about the architecture.
C
So from a design perspective, I would say that we wanted to imagine the entire experience with ChatGPT at the heart of this thing. Not just, I mean, we have a ChatGPT extension as well that you can install in Chrome, for example, but there's some limits to what that can do. And so when we approach this from a design perspective, we wanted to just be really empowered to look across the entire browsing surface. And I think maybe 2 to 5% of what we want to do is there today. But we think that this being able to own the whole browser experience in this way gives us the opportunity to weave that ChatGPT magic in throughout. And that includes things, for example, like on any text field, you can invoke ChatGPT and have it help you write. And then as it is helping you write, that is your personalized chatgpt. It can sign your email as you, because it knows you from your other use of ChatGPT, ChatGPT. And so being able to build it in a way that enables these very richly integrated use cases felt Very important to have a browser as opposed to just an add on for an existing app.
B
And it gets to some of the foundational stuff. When ChatGPT is at the core of this thing and you enable things like the memories feature, it means that you can ask this thing, hey, what was that thing I was looking at again? And it's going to know and it's going to help you. Who hasn't had this sort of experience of oh yeah, I remember I saw some video or I saw something, what was that again? How do I get back to it? I want to share it with somebody else. Traditionally, you might go through your browser history or your YouTube history and scroll through there trying to find it. Or you're like, what was that tweet again that I saw? Or I was looking at a recipe with my son on the weekend. What was the third buffalo wings recipe that we found that I wanted to make? How do I find it again? And just to be able to easily recall it, because this thing is able to do that so effortlessly, it's a side effect of it just being there.
C
Probably what I would say is the biggest advantage for doing this, and especially in the way that we've done it, is how we thought about integrating this agent capability. And this is really where having our own concept of browser, including what it means to have a collection of tabs. If you think of your regular browser window that's a collection of tabs, then you can also imagine that your agent has a collection of tabs. Maybe each instance of the agent that you've chosen that you've asked to go off and do something for you, you might have like five of them, each running on different problems, and each one of those has its own collection of tabs. And of course they're not showing up on your top tabs because you didn't open them, but it nonetheless has them and it's working through them and it's getting information from them and it's processing it and taking action on your behalf and clicking on things and all that sort of stuff. That is the sort of functionality that you can have when you go and you design a system like this, sort of end to end, you can invent all these abstractions.
B
And you had this in the very, very first version. I remember when I joined Ben to work on this and he had already had sort of this idea, this idea that we would be able to segment tabs between the tabs that are the users tabs and that tabs that the agent is working on for your behalf and that kind of shows up in the product today. When you start an agent task, it goes off and it's going to work on whatever you asked it to work on. And it might need to open some additional tabs. And instead of those tabs just appearing in your tab strip and perhaps feeling a little discombobulating, because what are all these tabs? It has just accumulated some work in the background, and when it's done, then it presents it to you in a tabular form. You can go and click through and see what it actually did, or you're just happy with the outcome and you really didn't need to see all the intermediate steps. And so I think it's kind of powerful, this idea that the agent has its own workspace.
A
I would say for many people using this on the surface, it seems pretty easy to understand. I've got a Browser, I've got ChatGPT, but we also have agent mode. What is an agent task? What is not? What would be a ChatGPT task? Could you explain that?
C
Yeah. So agent mode is Basically, you're inviting ChatGPT to take action on the web on your behalf. So maybe you are looking at a website and you would like to do something on that site, but you're not quite sure how to do it. What you can do is you can just ask ChatGPT directly. Let's say you're on a spreadsheet and you want to synthesize a pie chart, and you don't know how to do that. You can just say, hey, make a pie chart with this data and it will go off and it will figure out how to use that software. Now, if you could think about sometimes some of the software that you use, it can be pretty complicated. So just being able to ask in natural language, in your own words, what you want to see, and then you could just sort of sit back and watch it take over and it starts moving the mouse around and doing stuff like that for you. It's pretty amazing.
B
And you could see how it's going to do it. So actually you can learn how to now make that pie chart, because it's going to show you, which is pretty cool.
A
I found it pretty useful. I like to study memory methods and stuff. And I have to do a thing where you have to have a system for decks of cards and stuff. And I didn't want to have to paste in a bunch of card emojis. And I'm like, can you just go do this for me? And it's like magic. It's like witchcraft.
C
One of the fun things I've seen people do around the office is they'll have written a doc and then they'll ask it to take a review, pass on their doc, and add some comments. It'll actually go into your doc and it can use the integrated commenting tool of whatever document editing system you're using. It will just add comments as if it was a collaborator.
B
Yeah, it's really amazing. Of course, being software engineers, we experience the model critiquing our code over the past year. Just amazing to see how much better it's gotten at that. Like, in the beginning, it was not always the case that it would find things that were useful. But it's these days I'm like, this thing is sharing important, amazing nuggets that are saving me from shipping bugs. And I can just. I can see the analogy to reviewing any document that I might have and just asking it to go and give me some style feedback or maybe some grammar suggestions or tone suggestions. And it's very exciting to think about how any tool you're using on the Internet, you could ask this, invoke this agent and ask it to interact with that in the same way that I would or somebody else would, and just see what it would do and maybe you'll learn from it or whatnot.
A
One of the things that's come up a lot during my conversations with the teams inside OpenAI is how much they're using the tools. They're using GPT5, GPT5 codecs to do that and how has that affected you? And do you think that's going to create an accelerated product cycle?
C
Absolutely. A couple of anecdotes on this one. One of the top Codex users at the company is on our team, and they're just sort of like raw productivity in terms of PR output is off the charts as a result of using this tool. It's really exciting to see what good PR output. Yes. So it's really exciting to see what experienced engineers can do with these tools, because they can. You can both sort of explore an area, help you explore an area, decide if something is worth doing, and then you apply your judgment, you kind of tell it what you want it to do, and it goes off and does it. And then for folks like me who maybe spend less time coding than I used to, I can also have it go off and prove some stuff out. So, like, everyone on our team is able to contribute. Our product managers are producing PRs, our designers are producing PRs because of these tools. And so I'm Like a true believer.
B
Yeah, I did a. For before ga, I did this week long refactoring to try to unlock a certain feature that we were trying to ship. And then I had one more to do and this time I asked Codex to do it and it was done within. The whole project was done within an hour and it was of similar scale and I was able to tell it, hey, just do this other one. Kind of like I did that one. So I had shown it the way and I just asked it to do this other task that was very similar. And it was, it was almost one shot.
A
One of the promises of really capable code tools that are able to write multiple languages, porting code from one language to another. We saw with Sora and how, you know, they said, hey, Android, bringing it to Android.
B
Yep.
A
Yeah. And people are like, oh great, when will that. Oh no, it shipped.
B
No, no. I mean, actually ChatGPT has been amazing at doing cross language translation for quite a while. I mean, coding languages. So in, you know, we're building, bringing this product to Windows now and we're actually going to be using Swift, which, because we are a bunch of. The team is full of Swift experts and we are excited to have a shared common code base.
A
And you use Swift in the Windows?
B
Yes, Swift on Windows. And so I'm not, you know, I think it's very. In my past experience, I was marveling even years ago at just how good ChatGPT was at being able to essentially generate code for me in Swift that did not exist on the Internet. So it was taking code that maybe was written for. Net and could translate it to Swift for me. And I was marveling at its capability or to generate a bunch of obscure winrt code that's normally a very tedious C code with lots of guids and all kinds of things that are very detailed, but it was all just spitting out this code and saving us just an enormous amount of time.
A
Yeah, I had Codex CLI spin up a Swift app without ever having to go into Xcode to paste anything in. And it was worked just right out of the box. It really impressed me.
B
It's kind of like one of the strengths of these models. So the fact that if you can ask the right question, get the right prompt and if it's on the right path and like how to build something, it can do it in any language. That's really not a problem.
A
So I've been using this and I've been switching it to agent mode to do stuff. And I know sometimes I can just leave a tab and go off and do something else. And sometimes it's like, hey, if you leave this tab, I'm going to pause what's going on there.
C
Yeah, so this is. Sometimes you'll have asked the agent to do something that's very sensitive. An example is it's looking at your email and we would like you to keep your eyes on the road, so to speak. I have a car that has sort of an auto drive feature and it wants me to pay attention to the road. It's helping me drive, but it's not going to let me check my phone or take a nap or something like that. And so it has a little camera that's watching my eyes, it's making sure that I'm paying attention. So you can kind of view the sensitive mode and in agent mode as kind of like that. It wants me to pay attention to this tab while it completes, just so that I feel like I have a good level of control over it. And in fact, if you look at the bottom of the tab, you'll see that there's a little bar that has a big red stop button in it. And if you've ever been in a machine shop, you'll know that the machines there have these big red buttons on them. If suddenly it starts to do something that you don't want it to do, you just whack that button and it stops. And so that's the idea. Just keep an eye on it, watch it go, and you can always take over if you want to do it yourself.
B
Yeah, we put a lot of thought into making sure that these features help you feel in control of the experience and take away some of that maybe uncertainty when you want to go use them. For example, on top of what Ben mentioned, there's also the signed out method of using Agent. So if you want to start without it being in an authenticated session, meaning it doesn't have the cookies required to even access your email, you could do that. And I think that can be a great way to kind of try some things and just do it where you're learning, like how is this, this thing actually work? And you might then hit a point where in order to take the next step with Agent. Well, actually it would be helpful if it were authenticated. And so then you might try doing a task where it does benefit from having your cookies.
C
You might not remember probably the first time you actually run Agent, it shows the screen, it explains how all of this stuff works for you. So you can, like, if you read through that, you'll See sort of the choices that you have and you'll learn about how to use them.
A
I have about a half a million unread email. I'm ready to go full auto. It can't do a worse job than I have so far.
C
You might find it just hit select all archive.
A
Yeah, it wouldn't be the worst.
B
Declare bankruptcy.
A
Yeah, I could have somebody to blame but it's been super helpful for me because there have been times when I'm trying to surface an email and the keywords just don't work or the results are too many and just being able to go in and say find this thing about that, it's saved me numerous times.
C
And so one of the other things that we did with Agent as well, because there are times when you'll want to keep an eye on it, is that we, and I'm pretty proud of what the team was able to come up with here is to make it very visually compelling. And so there's all of the little sparkles and pixie dust and so on that appear around it as it's working. It's pretty cool. So yeah, love to see more people try it out.
A
Yeah, it's fun to watch that. And also with ChatGPT too, in agent mode, I've said that I could probably watch a live stream of just watching these systems solve problems because it's like how it's made. But watching computers do it, it's exciting to see where this headed. I also sort of wonder what's it going to be like when there's going to be a lot of different AI powered browsers out there. And also we have to think about what is the ecosystem like when most of my tabs are opened by my agent and not me and somebody's trying to capture my attention.
C
The way Agent works right now is that it's only running in response to your request. So if the agent is doing something, it's because at some level you asked for it. Pages that it opens actually have some limitations. So you might be used to browsing around the web and you'll see some page show a pop up window saying give me notifications permission so that I can sort of spam you with updates.
A
Never ever clicked on that.
C
Anybody?
A
Nobody's ever intentionally clicked on that.
C
Yeah, the Agent tabs can't do that. They're actually blocked from doing so There's a bunch of stuff like the design of the system to avoid you accidentally ending up in that state. Now of course you're free when you browse to a website and you're asked for a notification permission. If you want to receive updates from that site, it might be your calendar, right? You're free as a user to go and say, yes, I want this, but the agent will never do that on your behalf.
A
It's an interesting world where we think about part of the beauty the early web was the serendipity. Oh, I found this other thing, I found these other links. But I think then that kind of got sort of weaponized against the user where basically you try to do a thing and it's hard to do a thing. And I think we got to.
B
It's really interesting. I don't know if this is where you're going with that, but like a lot of websites want to just keep you on their website. Maybe they'll run ads which would take you off, but otherwise they kind of keep you in that lane. One of the amazing things about the side chat or the model being present there and the agent even is just that you can ask it questions about that site where the answer might be something on a different site. And so it makes the web bigger for you. It helps you not just be stuck down that rabbit hole that you were on, but to help bridge you to something more useful to you maybe, or more helpful. I know you have some good stories on this.
C
There's something just wonderful about the, I call it beautiful chaos of the web where you kind of don't want to always be stuck in the same place. You want to be able to embrace the diversity of the web and all of the content that's on it. So yeah, I love that, I love that you can do it as well without having to leave the site. You can do it right there on the side and then you can choose to go somewhere else. But there's this aspect of, I know with Wikipedia you can go on these multi hour journeys through content that's really only. That's like a feature of Wikipedia. Whereas I feel like the Ask ChatGPT sidebar gives you that ability for the web at scale and so it gives you the ability to ask questions about random sites and then go off in different directions and that sort of thing.
B
I mean this is extremely useful if you're looking for certain kinds of products and you find yourself onto one product page for one company or one vendor. But now you can be like, well, what else is out there? And the model can say, well, here's some other sites to go check out that are related to this. And off you go. Now your world has gotten bigger, right?
A
I've had some wonderful discoveries with both videos and books that I couldn't find through the YouTube search engine or the Amazon search engine. I found places where, because ChatGPT understood a bit more about what I was looking for, what I was really trying to find, and that was like, it gave me more utility out of those sites.
C
That's actually another, like I said, For ChatGPT as a whole, like the personalization features, the fact that it sort of learns more about you the more you use it has been like a super popular feature of ChatGPT with Atlas, you know, this extends to your browsing activity, like your sort of web history. This allows the browser to create these browser memories, which Darren pointed out before, is something you can use to help you get back to a site if you can't remember it later. But it also helps in situations like with the agent. I am a United Mileage plus member and so I tend to like to look for flights on that site. It would be very tedious if every time I ask the agent to go and do something like that, if I had to tell it and always use United Airlines. But it kind of knows from my browser memories that I'm a frequent user of United. And so it will just go there.
B
It helps you in the forward queries, right? Because it's like now this search experience has so much more context about what matters to you. So it ends up being a lot more efficient. It saves you a bunch of time because you don't have to tell it as much again and again.
C
So I feel like that's something that helps me a lot. I think some people probably have different preferences around these things as well. So there are controls where people can go and see and control what memories are used. Personalization, you can turn it off entirely.
A
And you also have to when you use it. And I don't go into these tabs as much. I'm starting to do that more because I realize they're there as I can go to images, I can go to news, and I have kind of like a search engine. And that's the thing I'm trying to sort of understand is it's a browser, but is also OpenAI heading towards its own search engine.
B
Well, part of that comes from the fact that when you're building a browser, people come to that browser with existing intents, like navigational intents or the idea that they do want to look for images, right? Or they want to see certain kind of subset of information. And so we brought those. Those controls into the. The landing page of chatgpt.com so that it would be both familiar to people, but also useful in the way that they're. They're used to. Right. We want to make sure people didn't feel like they're so out of, you know, that we didn't want people to have to learn so many new things in order to be successful using this product. We wanted them to have a lot, a good dose of familiar tools and familiar sorts of things. Anyways, these are just useful for many people. Search and browser are just very connected, almost one of the same. And it's very important to internalize that as we're building this experience. I think it's very powerful that, and I was touching on this before that as people search and use the browser in maybe a very normal way, they're also learning about that there's a model there that's going to respond to them. So you get a set of chips across the top, which is like quick links to go to where you were trying to go, perhaps these different tabs where you can click on to see familiar, different kinds of subsets of information. But also this model response coming in. Using a product in a normal way, you start to learn that there's another way or that there's a superpower that this thing's providing. Some of it is just the normal ChatGPT experience that people are used to, but not everybody's using chatgpt to the fullness. When it's core and central to the experience. We have an opportunity to present that to people as part of their normal journey. And I think that's really cool. Similar to sidechat, of course, you have to activate it, but it's right there. Ask ChatGPT and you might be curious. And now you kind of unlock the superpower, but it's right there, the context is there.
A
An experience for me was the very first day when I started using it. And I look at this, I'm trying to understand, okay, this is basically, it's an app that has a browser and ChatGPT. It's not like we just sort of of glued those things together. It's sort of like they're both there and there's a deep connection to the ChatGPT. And I asked it to could you add a bookmark for Amazon? And then a moment later the bookmark appeared. And that was a really kind of special moment to sort of understand what happens when the LLM deeply understands the system and is able to make those kinds of changes.
C
We're very excited about this I think from a, just like a conceptual transformation point of view, we're moving to a world where you can just tell the computer what you want in whatever way you want to tell it, simplest way.
B
Possible to tell it.
C
And so what this means for making computing more accessible to more people is just really profound. And that's like the company's mission is to make AGI beneficial to all of humanity. We take that really seriously. And I think being able to transform computing in ways like this, that might seem very small on face of it, but they add up to something far more profound. And so, yeah, we're excited about that kind of thing.
B
Some of my first experiences with ChatGPT as a user was really this idea Ben's talking about. I was comfortable, happy just sitting there doing my Google searches, but sometimes I didn't quite know what Google query to type in. And when I realized I could ask a really poorly formed question to ChatGPT and it would come back with make some sense of what I asked, what I said, and it would give me something that maybe now I could query Google for. That's how I first started using ChatGPT. And then I started to realize over time that, like, oh, why am I not just asking it in the first place? You know, And. And it's sort of, I think, for people, they all have like a. There's like a bit of a journey with new technology, right? You're. We're all creatures of habit. We're used to the things, the way we work and, and it works well for us, the things we're used to. We're used to it, so it's not a problem. But as you maybe explore something new, you start to see, oh, there actually is a better way. And for everybody, that journey is a little different. And so for me, one of the things I was most excited about with Atlas was this idea that when you're typing into the address bar, that the default is ChatGPT. Because for me, that actually makes sense for most of what I'm going to do.
C
This is one of these things where I feel like now when I don't have access to that. Correct. I feel like there's this little bit of friction. Like, it takes longer now because then I've got to go find my chatgpt tab in another browser and like, figure out how to get at that and do that. Whereas with Atlas, you can just like open a new tab and start typing.
B
The old way was a much more manual way. This way is a much more. I don't have to be as like, clever about what I ask. I can just give some.
C
I have a problem.
B
Yeah, I have a problem. I can say it in a much more simple way.
A
I know that I'm still having trouble kind of context switching and understanding that it's not just a URL search bar or just an empty keyword search, whatever, that literally I can ask for things and not just have what is the capital of Nepal and not just have that pop into a Google search box. And that's the thing. Now I'm like, oh, yeah, when I go type in the thing, if I type in the URL, I get the URL, but I can also type in kind of my query and do that. But that's still taking me time to adjust.
C
Well, just as a general rule, I find sometimes modes can be a reflection of some of the limitations of the system. Underneath that, at the end of the day, humans don't understand. And so I think the North Star for us with so much of this stuff is can we just help you arrive at the right place, regardless of you needing to know, I should put it in this mode or I should put it in that mode. That is sort of the struggle. It comes down to how do you want to use this tool? And so we want to make this thing something that if you just go in, as Darren was saying before, you can just kind of tell it what you want, maybe this half formed thought, and it will give you something good, it will help you figure out the problem. Now, of course, there are ways that if you are a user that understands some of the underlying capabilities of the system, we want to give you the option to invoke those too, to bring them down and help you. And that's sort of an efficiency gain that you can get. But certainly the system shouldn't require that, you know, all of those sort of incantations, it should be able to just take what you say to it and give you something good.
B
Yeah, I think it reminds me back to like the early days of browsers, in the era when people would install like a toolbar for their search engine. And, you know, that meant they had yet another box on their browser. Right. And Firefox had a dedicated search box for doing your web search. But back then, as much as people were very used to that and very comfortable with it, you know, you have one box to type URLs and one box to type search queries. When we were working on Chrome, we were like, why have two boxes? Why do people have to stop and think about which box to type it into. Just give them one box. Now, if you look at Chrome, that's what its URL bar looks like, right? Just one box. And that's become the industry standard. But even on Chrome's new tab page, there's actually two boxes. There's one for the address bar at the top, and then there's this box in the middle. That's the maybe comfortable, familiar Google box.
C
Right?
B
What we wanted to do and we kind of pushed ourselves with is like this whole topic of like, hey, you might have a conversation you want us to with the model. You might be interested in navigational query, but really you might not make up your mind about what your intent is until you start typing. And just one box is a lot simpler. And so when you open up Atlas, you just have one box on the new tab page. And that was something from a design perspective that we really tried to achieve and I think we were able to. And it keeps the whole system a little bit simpler for people. It might be a little unusual and not what people are used to, but. But I think over time they'll get to like it.
A
What was some of your favorite features, some of the things you're glad you're able to implement?
B
Oh, man. You know, it's interesting, whenever you get a chance to build a new browser, having worked on quite a few, you get a chance to sort of start over and reset on certain things, not everything, because I think one of the core tensions is that people are used to their browser the way it is, but you do have a chance to rethink some things. So one of the features I worked on was the scrolling tabs feature. It kind of came from an insight of that tab life could be a little better if maybe new tabs all were inserted on the left or just on one side. If you're a user who pins tabs in your browser, which is a pretty advanced use case, maybe a lot of people don't know that you can pin tabs to the tab strip in Chrome or Safari or other browsers, but it is a common thing. And if you pin a tab, it'll be pinned on the left side. And whenever. Suppose that was like a Gmail tab. When you click Links, those new web pages would open just adjacent to that pinned tab. If you press the plus button, the new tabs would appear off to the right. And what ends up happening is you're working throughout your day, you're going to Gmail, opening a URL. From there, you're hitting the plus button, opening tabs on the right, and you're sort of accumulating old tabs in the middle. And so it becomes a little bit painful to close all of the tabs to the right from the middle to clean up those tabs, and you just end up with a lot of clutter. So scrolling tabs was one of the innovations that we worked on to try to make tab management better. And it's not an AI feature, but it's like when you have this opportunity to rethink browsers, it's an opportunity to rethink some of these primitives and try some different things.
C
Mini is a major productivity tool. So finding these wins can be really.
B
Yeah. And we think about. One of the things that I came to realize and appreciate only later is that if you have a browser that more naturally scales to having tons and tons of tabs, it means that certain kinds of things get unlocked for you. So everybody's or many people are probably familiar with the ability to search for a specific tab that you might have open. There's a command shift A or a button for that in many browsers. What with our system with scrolling tabs, the fact that it can allow for a lot of tabs to be accumulated without them all being in your face, you can still search across them and find these old tabs. So in a way, it's like this history of things you've done in your browser is there for you to search in a very familiar way because it's your command shift A. It's right there. And you can have that capability without it being cluttered.
C
Darren's talking about tab search. Search over your tabs. You can just type and that will find the tab that you. But I think the most interesting thing about this feature is the fact that you don't need to close tabs. And so you can end up having. I think my browser, I've got like, well over a thousand tabs open. And I just wouldn't think for that to be possible, or you might think.
B
That that would be a problem. Right. But it's not. No.
C
Because the system manages the memory for you.
B
Yeah. Now, this is the scrolling tabs feature that is not on by default. And part of the reason why it's not on by default, as much as we think it's magical. And I'm a huge fan of this thing we built. It is also a little different than what people are used to. And we wanted people to not have to learn so many new things all at once when they're approaching this browser that is bringing all these AI capabilities. But one of the amazing things when you allow for thousands of tabs to be open, means not only do you get to access it again with tab search, but the model can see them. The model can see these tabs. It means your working set can be very large, larger than what you might keep in your head, naturally. But you know that there was something there. So you're going to ask the model for it and it can go and interact with those tabs again. And I think that's actually pretty amazing.
C
I would be remiss if we didn't. If I didn't mention for this question as well, just the basic feature of Atlas, which is the Ask ChatGPT sidebar. This is something I get some value out of every single day. As I use the browser, I pull that thing open, I ask it to summarize a page if it's too long or I want to figure out, like, if I'm reading an article, like how it really matters to me in particular, or if I have a question about something that's going on in the world, it can go off and do some research for me and come back with stats and facts and figures. I've used it when I'm online shopping to make sure I'm looking at what really is the best deal on something that I'm looking at. I've used it to help spin up agent tasks to go off and automate some of my productivity workflows. I've had it build Google forms for me to help me quiz my coworkers on the best way to design new features for the browser.
B
I really like that example because if I remember correctly, you said, you also, you asked sidechat to help come up with the outline of the. Of the survey. And then you said, hey, can you just put it into a Google form for me? And it did it. And I looked really cool.
C
Yeah, so it's just, you know, we talked before about bringing the power of ChatGPT with you everywhere you go on the web. And I think that sidebar, really, it's like having ChatGPT sitting on your shoulder just right there to help, you know, give you some advice wherever you might need it.
B
And sometimes even just simple things. Like I was in Slack and. And there was somebody shared some text that was in another language. So I just selected it and I right click and asked sidechat about it and it translated it for me. It was so much easier than having to. You don't have to sit there and copy paste, right?
A
My favorite use of that so far in the agent mode. And I won't name the cloud provider, but it's a very big company that often you find out you run a lot of services and you forget what those services are. And at the end of the month, you get a bill. Bill. And it's a very confusing bill because you're trying to figure out, I thought I shut this down.
B
Isn't this all of them?
A
Yeah, well, some of them are a little bit, or maybe been around longer and trying to parse through. That's like reading the Soviet tractor manual. And I went in and I said, hey, I got this bill. I think I showed everything young. I don't know what's going on. Can you help me with it? And I watched it navigate through the website, go to the page, find the different things I was doing, explain to me what the service was doing. I'm like, can I shut this down? I'm like, yeah, shut it down. And that was like $100 a month bill that was just saved through.
C
Wow, that's really. That's awesome. Another one I had as well. I was actually. I had some. Some medical tests done recently. And sometimes it can take a while for the doctor to come back and explain to you, like, what they mean. And then in the meantime, you have the patient portal there and you can access the sort of doctor language stuff. Stuff there. And I can't read that. It's not written in English, normal English. And you can have the thing you can ask and that will tell you kind of like what. What that means for you. And I found that to be really helpful. Yeah. So saving you money, helping you get some answers. This thing feels very, very like. I'm pretty convinced that this is the way, increasingly the way that people will.
B
Interact with information, or you're using some very popular yet complicated HR tool or something like this, and you're like, where's that thing again? And it, of course, has studied the manual for you and can go and show you the way. It's kind of remarkable.
A
I feel like it took me a long time to realize that once I had an iPhone, that I always had a camera and a flashlight in my pocket. And there are many situations where I'm like, what was the name of this thing I saw on the store? It's like you could have taken a photo or, man, it's dark. It's like you got a flashlight. And I feel with these tools, there's a lot of capability there that we even saw that to a certain, you know, some of Us were power users of search and other people was a complete mystery. And do you think we'll see a faster acceleration here that people are going to start sharing and understand how to use this?
C
Yeah, I think the stage that we're in with Atlas right now is we think this is a really powerful tool, but we don't know all the ways in which people will use it. And it's kind of like the Internet in that sense. And so one of the reasons why we wanted to get this out when we did is we just want to see how people use it and hear from people where it works well, where it sucks and needs to get improved. But I think over the course of time we'll get a better feel for that. I also think we'll need to help explain in more cases when the right time to use it could be because I think there's a part of building something that feels like magic instead of making that magic real for more people in more situations. And we don't want to have to rely on people to always think I should ask this question at this time.
B
Yeah, it's really easy to just we're creatures of habits. We use the browser the way we use the browser, we use our computers the way we use them. We don't always realize when there's a better way to do do something or a more efficient thing that we could be doing. I feel this way about when the process of learning how to use ChatGPT in the first place, it's like realization, oh, I should just ask the model for that. It'll save me time but it takes a little while and there's a bit of a tipping point for people where they start at some point in their journey they're going to learn how to use these tools. And there might be some people who are early adopters and they can show the way and figure things out out and share those ideas. But also it's kind of like I think a lot of people haven't yet found their way to how to use these tools in the best way.
A
I find myself still trying to shut down tabs because I'm still. I started using browsers in the era of getting the pop up message, you have too many browser tabs open and now the compute and the capability and the management internally of these things is way advanced. And so I think about like I'm not optimized in many ways to.
B
Yeah, also another like say non AI feature of our browser was that we kind of took a page out of the playbook of Mobile browsers, recognizing that, you know, your laptop's computing resources are not really limited. You have a battery you care about, you know. So we put a limit on how many tabs would actually be backed by a live web page instead of trying to might be the more traditional approach that desktop browsers would take, which is to just try to mitigate the cost of those background web pages that you haven't used in forever. We will just close them down and if you go back to that tab, it'll get reopened and we keep a reasonable limit there. And we apply somewhat of a clever ish caching algorithm to try to be smart about making sure that the tabs you care about are kept in memory so as to sort of lessen the burden on your computer. You might notice also with Atlas that it restarts super fast when you restart Atlas because of the way it's structured. The OWL process is separate from the Atlas process, Owl being our embedding of Chromium. So the two can start up in parallel. We can restart Atlas very quickly with all your tabs and the data associated with them, but the web pages aren't loaded yet. And when you click on them, it'll bring those web pages back. But this way the whole system can stay fast, lightweight. And as we were able to build Atlas as a separate application from Owl, Atlas is controlling Owl. Owl is projecting data into the rendering of web pages into Atlas. But Atlas itself can stay a relatively thin, swift application.
A
Why Chromium?
B
That's a fantastic question. I answered this question on the site formerly known as Twitter by saying that web compatibility, it turns out, unfortunately, or for better or worse, a lot of websites are only really designed to work with Chromium. There are features of major websites which I won't get into naming names, but that are just not present if you're not using Chromium based browser. And the other reason is Chromium extensions. Extensions built on top of Chromium are very popular. And when you build your browser on top of Chromium, it means those extensions will just work. And so we want to make sure that we are building a browser that first off works for people, that all the websites they care about will be supported and all the features of those websites will be supported. And we want to make sure that they could install any of the extensions that they care about and that they're used to using in their browser.
A
And it seems like there's also we kind of non technical. They hear Chrome and they hear Chromium but not understand there's Like a really deep lineage that even goes further back. You also have WebKit and KDE and whatnot.
C
So what I'd say is I think there's a lot of excitement from among the community to see new rendering engines come about. And that's certainly been part of the DNA of the web too over the years. At the same time, just like Darren said, when you build a new browser and you don't have that many people using your product just yet, you kind of just want the web to work as people know it today. And actually back when we were starting working on Chrome, we had the same concern. Like Chromium today has Blink, which is sort of its own rendering engine that sort of diverged from that lineage. But at that point there wasn't much appetite for taking risks like that. And so that the Chromium rendering engine is based on WebKit, which is the Safari rendering engine, which is open source, and that itself was based on an earlier rendering engine from the Linux world called khtml. And so yeah, it's really interesting to go back in time. You can see how these sort of open source projects fork and branch and so on.
B
There's code in Chromium that comes from the Mozilla project too.
C
Going back to the 1990s, you can find this, this is true, this lineage.
B
So browsers are this layer cake of technology that's been built up over time. And really where we are wanting to innovate is that next layer where the AI model comes in and how it's articulating and interacting with the foundational layers. And so for us, building on top of Chromium gives us this well known foundation we built on top of Chromium in a very different way than normally browsers would. Most browsers are just taking Chromium and forking the UI or layering another UI on top of Chromium, but running right in the same process as Chromium. What that means is that if Chromium's doing work, your application's not doing work. And so in the structure that we set up with Owl, it means that Atlas is able to work in parallel with the rest of all the activities related to rendering the web and producing web pages, which is pretty cool. Cool.
A
So if the browser part crashes.
B
Yeah, if the OWL part crashes, if something goes wrong with that piece, then Atlas can restart Owl.
C
So that's actually a really interesting thing because I remember back when we were originally doing the design of Chrome, there was this thought of like, well, the webpage might crash and therefore your Browser should be around. And then because Chromium has sort of become this very sophisticated platform for web pages, it itself has become super complicated. And now like Atlas is this very lightweight frame around the outside that really is about that core productivity use case of using a browser with ChatGPT kind of as this tool that you can bring down onto any page. And that's really where its focus is. Whereas the chromium, I.e. owl piece, is able to focus on being that platform and then both parts are not really. They're resilient to each other's, you know, difficulties in that sense. Yeah, Just actually like another fun fact about, like, OpenAI and the benefit of the system that we built. Every engineer that starts at OpenAI merges code on their first day. And if you think about, you know, how massive Chromium is as a platform, you know, it's really super powerful, but it's a lot of code. It takes a while to get all that code onto your device.
B
Just a little bit complicated.
C
Yeah. And it takes a while to build it up all. And so we tend to get our new hires in the afternoon as they've gone through all of their onboarding training, and then they have to merge a change. So to be able to check out all of that code and build it and then make a change to the code in your first afternoon can be pretty tough if you have to do all of that. But because we have structured this in a way that they don't have to, you can go and make a change to the Atlas side, get that checked out and built very, very quickly. Our engineers are able to be productive right away, merge code on their first day and like, ship features in your.
B
First related to this. So always when you're starting a new project, you get to make new technology choices. You get to. When we were starting Chrome, we got to say, what is the latest and greatest way to build code? Right. Fast forward to starting Atlas. We're like, what's the latest and greatest way to build a native app on Mac os? So of course we're going to be using Swift, we're going to be using SwiftUI where it makes sense, and we're going to be using all these. The app is built in XCODE and just done in a very familiar way. So people who are used to doing Swift development, maybe because they're building iOS apps, they can come in and just be instantly productive because this is not a foreign code base, not a foreign system, and yet it's harnessing the power of Chromium at The same time, which.
A
Is super complicated when you look at this from the outside. A lot of people draw comparisons to go, well, there was operator, now they're doing ChatGPT Atlas. Is this going to be a real thing for them or is this another experiment?
C
So a browser, I think it's a super core tool for productivity and it's something that you need to be able to count on. And so my view for this has always been that this is like a long term investment. And so that's the way that we're approaching it. There's a set of functionality that we've launched which is sort of like the first phase. If I come back to browser history, I sort of say it's the Netscape one or two, if you like, of this new era of web browsing. So there's going to be a lot of future improvements to come. Features that we're building because people have told us about it from the initial set of feedback, things that we come up with through our partnership with research, a whole host of new functionality that will come out over the course of time. The other thing that we hear from folks is they want to see this across different platforms. Yeah, basically this browser isn't available for my Windows device or on my phone or that sort of things. So these are things that we're thinking about and working on. So there's like a long roadmap of enhancements. And so we want people to both feel confident that this will get better over the course of time. It actually gets better every week when we push an update and they will see it increasingly on more of the surfaces where they are. So definitely a long term investment for us.
B
Yeah, we got this app to the point where internally the users at OpenAI, where we were seeing them enjoy this product and we were at a point where the kind of feedback we were getting was.
C
Why haven't you shipped it yet?
B
Why haven't you shipped it yet? Is exactly what I was thinking. Because it's like we weren't getting new feedback and we realized we're ready to ship this. We want to share this with the world, we want to hear how other people are experiencing it. And I think that this feedback's been all kinds of amazing. Obviously there's been people who have their pet feature that's missing or like when Ben said, how come I can't? Where's Windows? When? But the other day my 14 year old son came back and he's like, oh man. My friends at school, they love this browser kind of a thing and I was asking like really what do they love about it? And then he just was like talking about all the different AI features that they were checking out. Out. I think it's really interesting to see that sort of spark of fascination from people, whether it's kids or people we work with or friends as they sort of share their experiences with this thing. And I don't know, I just love to see how people. My wife was so giddy when she first got to try Atlas when we did the friends and family testing and just to be able to go and explore researching some tasks and asking the side chat about what she was looking at. And just she had so much fun with it.
A
My wife loves it. This is not a lie, an exaggeration. The night that it came out, we're sitting in bed, I'm reading and I look over, how are you doing? She says, I swear to you, I swear. He goes, I can't stop thinking about ChatGPT Atlas because for her it was her first use of an agentic system like this and to be able to go do these things, it was a huge unlock. Was her favorite tools ChatGPT now for Connected to the browser.
B
Yep, exactly. Because just the tediousness that it takes away when you. I'm looking at this website or I'm doing some research and I can just ask it now about the thing I was talking about it to it before and now we can take this as context and we can kind of keep going and research and whatever the topic is. It just becomes a lot easier when the AI when the model is right there with you.
C
I think a lot of folks like struggle with, you know, how to do, you know, sometimes what seem like very complex tasks on certain websites. One of those websites I think is if I'm on a spreadsheet on a web based. Pick your favorite web based spreadsheet program. How do I visualize this data in a certain way? If you just have a tool there that you can ask, drop in and help in a very consistent way, I think that's really interesting. The other piece that I think is interesting, related to your wife's reaction is this is a lot of net new capability. I think for a lot of folks in the world. What I'm really excited about with this tool is that our model capabilities are always evolving. So at any given point in time it's not the ultimate state of it, but we get to show people how this stuff works. And I think with that build some more understanding, some more trust about how this technology is working on your behalf. And I think even if we came up with the world's best, best model tomorrow, that solved every problem perfectly in the way that you personally would have wanted that problem solved, you probably would still want to be able to come along for the ride and understand how that was actually happening, just for your own education. I think over time, I think as your trust level builds, eventually you'll feel comfortable fully delegating very complex tasks to this technology. But we're not at that stage yet. And so I think one of the things I'm really happy and excited about is that people are able to come and observe the next step of this technology and watch it, see how it does, tell us, yell at us when it doesn't work, that kind of thing. But I think you can kind of get a feel for how it works. And as a result, you will know what it's going to be capable of and you'll kind of know what the controls are like, where you want it to stop doing something or you want to do something differently. You can just tell it and so on.
B
I think you can kind of dial up and down how much you use the model and the ways you use it in this product. But it's kind of like right there, there, it's easy to try again. And I think sometimes the magic I've felt with ChatGPT is when it really works ever so well that I'm like, oh, I'm going to go back for that, right? You have those magic moments where it's like, oh my gosh, I'm going to change my habits because of how this works, right? And I think in this case, it's easy for people to use familiar patterns. I open the new tab page, I do a search. It lands me into an experience that includes some links I can click on, but there's also the model response, response. And so you start to learn that maybe instead of that website I was looking for, that was going to answer my question. It's just right there. And I can go and explore that and I can ask it another question. You start to learn the power of this model just by virtue of using the product in a familiar, normal way. For me, that's kind of exciting to see how people. It might open the door to people starting to realize just what the capabilities of this model are.
A
Looking to the future. First, let's start short term. I can understand how a Windows version, you're going to pretty much want parity of what you have right now on the Mac version. But when you talk about Mobile. And it's one thing when I have a lot of desktop space and I can put a sidebar and have the chat thing there. But when you're talking on mobile, browsing, as you guys know especially is a very tricky thing. And moving a search tab from the top to the bottom or whatever seems like a revolutionary change. How are you thinking about that, the mobile experience? And also is it going to maybe are we going to be thinking more agentically in how we use these things?
B
So maybe. Fun fact. Some of our initial explorations were actually on mobile and part of the way we were thinking about it was really what does it mean to bring the model to the web? And that can take many different forms. And of course ChatGPT exists as an app on your phone. As you can imagine ways in which you might share to that the kinds of memories that have been generated by using Atlas. Right. So there's a lot of different flavors and forms in which mobile could come, how this could manifest on mobile. But at the basic level it's like what are some interesting. We're going to be looking at how can we bring the web to the model and what does that mean, what makes sense on mobile?
C
The UX may look a little bit different obviously, as you point out, there's sort of a different form factor there. We've got very talented designers though. I'm sure that they'll come up with a good way to solve some of this stuff. Definitely. I think on mobile, or at least I noticed through my own use, my use case is far more information retrieval. Like I have a question, actually I use the ChatGPT app a lot. If I have a question about something, I'm around or I point the camera at the thing and what is this, that type of thing. So I think that there are a whole host of situations like that where there is web content that is part of that journey and we want to make sure that user flow that you have with Atlas, where you can view some web content, where you can ask follow up questions and then go back and look at the content again, that, that feels very good. That's something that we're sort of in the midst of figuring out right now. And so not too much more thought on it than that. Other than that it's something that we're going to want to make sure. Feels very good.
B
Yeah. I think we hear from people that the importance of mobile because they are doing work on their laptop and they want to continue in some fashion on their phone. So you can start to imagine the Kinds of space of possibilities there.
A
Where are we going to be in five years with how we're using the web and how we use tools like this?
C
So I would love to be in a place where. Where people think less about the particulars of the tools they're using and are more just expressing what they want to the system. And then the system is smart enough to understand how to respond to that in a good way. And so in that sense, we can be, as humans, we can be focused on the highest. The question is the highest order piece, which is like, what is the most interesting for me to do? You know, the model maybe can take over. An agent can drive. Maybe the less appetizing part of the work, the more I use the word toil to describe some of that sort of grunt work of pulling information from a bunch of different sources, maybe it can do a bunch of things that just seem very difficult to you because you've not done them before. It sort of knows how to do those things. And then you can be focused on the things that you want to do. So I imagine a world where actually there may be a lot of Internet traffic in the future that is agentic. That may even be most of the Internet traffic. I still see people as generating a bunch of traffic as well. But that should be, you know, it should be efficient. People should be focused on doing the things that they want to do and delegating more of this work to more of this toil, really, to agents that can take on a lot of that other stuff. And then if you have to make a decision on a project, maybe your agent comes back to you and gives you some choices. If you're going to take a vacation, do you want to stay at this hotel or that hotel? You can pick between them. You get to make a choice. But all of the sort of grungy clicking around and scouring the Internet for these things, maybe it took the first pass and presented you some choices.
A
We've talked a bit about how as users and developers of this tool, you look at it. If I was somebody who had an e commerce site, if I was looking at, I'm going to be putting information on the web. And I know that one. There'd been conversations for a while like, what happens when LLMs go search the web? And now we get into the world of agents. When agents are using this and plus ll empowered browsers, what advice, what direction would you be telling people to think?
B
You mean the publishers? Yeah, yeah. I mean, I think that it's really interesting. Like in some Ways, I think about the, maybe a little bit related to what Ben was talking about. You see the more recent models, they've learned how to, based on the query, decide, hey, I should actually look at the Internet to answer your question. Right. And so I think it's really interesting how these models can help you connect to publishers in web apps and whatever content that's out there, either, you know, giving you a snippet and like a citation to it so you can go deeper or even just connecting you to it, because that's actually what you, you know, if your intent was to navigate to a site, then it can help you get there. I think one of the things we've been exploring with Atlas is how to, for example, better handle and better serve navigational intents. Sometimes people come to their browser with absolutely the intent of, I want to buy this product on the site, site, you know, and that's our job, just to get you there as fast as possible. And so it's been actually an, an element of building Atlas was making sure we're serving those kinds of queries. Well, other times you just want to probe the, the knowledge of the model and have it go and research something for you. And sometimes that involves it needing to invoke tools to do that on your behalf. And so again, sort of to Ben's, what Ben was talking about, it's I, you know, I imagine a world in the future where when you don't have to be so prescriptive about what tool you want the model to use, but rather it has this incredible palette of tools that it can draw upon and some of them can be actuating your browser, things like this.
A
We're going to be using web pages in 10 years, I think.
B
So. It's kind of this fabric of this world where people are publishing through. It's the core primitive how people are putting content out there. Right. So it's kind of the, you know, the Internet superhighway, all that kind of those analogies. But it is like this, this open fabric for which people can publish. And I don't see that changing.
C
It's the world's largest, as you mentioned before, most open platform. And I think some of that power of openness is always going to make it attractive for people to put content on.
B
The way I look at it is these tools, yes, they are able to understand that fabric, understand that Internet, understand the content that's out there, there, but they're also able to bring that content to people and connect people to that content. And that can be very powerful. And again, it's all in service of what is that user's intent. I think it's fairly interesting to think about how we can do a better and better job of that, really serving the needs of those users. And ultimately, as people are putting content out there that's intended for people, if you're putting out content that's a gallery for somebody to go shopping or something like this, we want to help people find that, help people get connected to that, help people with the journey that they're on on and whatever that may be.
A
This is exciting. Thank you for sharing this. Any last suggestions? Any power user tips?
C
Yeah, definitely. Definitely. The scrolling tabs feature is like a favorite for both of us. Yeah, I would just say, like, challenge yourself, like I said at this point, very early stage, but challenge yourself with your curiosity on any given page. Ask a question on more pages that you as a it and you might be surprised with what you come up with.
A
Awesome.
B
And we'd love to hear from everybody about, you know, how they're, how they're experiencing the product. So please keep the feedback coming.
C
Definitely.
A
Ben, Darren, thank you very much.
B
Thank you.
C
Thank you.
B
It.
Date: November 13, 2025
Host: Andrew Mayne
Guests: Ben Goodyear and Darren Fisher, OpenAI
In this episode, Andrew Mayne discusses OpenAI’s new browser, ChatGPT Atlas, with Ben Goodyear and Darren Fisher—veterans of various browser teams (Netscape, Firefox, Chrome). The conversation explores why OpenAI built a browser, how AI is changing the experience of using the web, the technical and philosophical underpinnings of Atlas, and what this might mean for the future of browsing, search, and digital agents. The discussion covers Atlas’s design, unique features, integration of agentic capabilities, engineering choices, user experiences, and a glimpse into what's next.
ChatGPT Atlas represents a major shift in how browser technology can augment, automate, and personalize the web experience, leveraging LLMs not as an add-on, but as the browser's core. Its design enables both immediate productivity gains and serves as a proving ground for a future where digital agents interact with the web just like, or even for, us. The episode provides technical insight, user scenarios, and vision for a web where telling the computer what we want—in natural language—is the new normal.