
Loading summary
A
Foreigners. Welcome back to no Priors. Today I'm here with Simon last co founder at Notion. We talk about their new vision for Notion in the AI age as a platform for humans and agents to collaborate, how the engineering and product org at Notion is changing and these new tools for thought. Welcome, Simon. Hey Simon, thanks for doing this.
B
Yeah, of course. Yeah, it's really fun to be here.
A
Notion's at scale, amazing platform, lots of users. You did start quite a while ago. I think of Notion as one of the companies that has really like braced AI quite aggressively. I was told you first got your hands on GPT4 at a company offsite in Mexico. Is that true? What is the origin story of like starting to work on this stuff?
B
Yeah, I think, yeah, that year, that was 2022. I've been watching what's going on in general. I've just been super curious about the technology and fascinated to try everything and think about how we can apply it. It wasn't until I played with GPT4 that it became really, really real. So when we got access to it, it was sort of like a proto chatgpt like interface. And my co founder, Ivan and I both got access and it was just immediately clear. I would say two big things. One, one is that it was just pretty smart. It could follow reasonably complicated instructions, it could write things for you, it could edit things. And the second big thing was the scope of its knowledge was extremely interesting. Super, super deep and broad world knowledge. When we played with it, it became just instantly clear to both of us, like, okay, the time is now to start thinking about how to apply this. It's only going to get better.
A
We were Talking about Mexico, GPT4, you guys saw it was clearly the time. Did you start with a particular vision of what you should obviously be able to do with AI and Notion, or just start pulling people from different teams or recruiting people and say, let's experiment. How did you begin?
B
I think we immediately had a long term and a short term vision. I would say I'll start with the short term one. The thing that was immediately obvious was, oh, it could be like a writing assistant. So it could be in your document, you can select some text, have it rewrite it, you can have a write text for you, maybe look something up and then give you sources or more information. So that was the thing that we immediately got to work on and we sort of started a tiger team around it and then we were able to launch it in two or three months after that. And then the long term vision that we Immediately had was like, oh, the thing that looks like it may be possible is more of like a general assistant. So what if you could just give it all the tools inside notion that a human would have be able to create its own databases, query, manipulate them, create documents, edit them and sort of weave all of these things together to do a longer range task. And so we sort of immediately started on both. The short term one we were able to shoot very quickly and then the long term one didn't really work yet and so that took much longer to get working.
A
Are there specific first launch of the AI specific notion features and products was when last year?
B
No, it was February 2023 is when it launched.
A
Yeah, my timelines are wrong. Are there a few specific learnings or breakthrough moments you think since beginning to release that are interesting?
B
Yeah, I mean it's been a slog over many years or multiple years at this point with many, many learnings. I would say. Yeah. I mean just to give you a timeline of the arc of what we shipped is. So the first thing was our rating system. We called it AI Writer. That's the first thing we launched. It was the easiest to get working because it's like single step task rewriting, editing text. There's no retrieval aspect. It was just raw access to the model to write the text. The next big thing that we immediately started working on was Q and A doing a semantic index of the entire workspace and then letting you ask a question and I'll give you an answer that's grounded in the sources. That was also immediately obvious to us that that'd be super useful. And so we started work one we launched in, I think it was October 2023. So we started a beta before then, but then our GA was in October. That was a much bigger effort to get working. Obviously we weren't just plugging in the LLM, it was actually doing this real time updating index. We had to get much more serious about the evals and the quality there as well. The Q and A has been a multi year journey. Basically what we did is as soon as we got the notion index working, it was obvious that okay, we should index everything else as well. And so we indexed like Slack and Google Drive and we were launching new ones on a regular cadence. And now we have a, I would say fairly complete.
A
One could argue that those are like very difficult problems that those products natively have not solved perfectly yet. So how did you think about taking that on? I don't know if that's like an offensive thing to other product teams. But like it's not working yet.
B
Yeah, it's kind of true. Yeah. This has been something we talk about a lot because it's like, you know, it's like almost like what right do we even have to do this? But it turns out that most of the companies are pretty bad at making their indexes somehow. It's honestly kind of baffled us a little bit.
A
Right.
B
But I think my take after dealing with all of this and you know, working with the teams to try to get it working is there's a little bit of just AI pilled savviness that's pretty important. And then I think most of it is honestly just like a bit of craft and attention to detail. I think in particular with this indexing retrieval stuff, in order to really get it working, you have to be quite empirical and iterative and actually be trying queries. Each data source is a little bit special. You can't just apply a one size fits all to querying Slack versus querying Google Drive. Let's say they're completely different kinds of information and we found that there's just a little bit of craft and love has to go into it in terms of actually trying a bunch of different queries, actually using it every day and constantly iterating and rethinking and tuning how the retrieval works.
A
How did you think about the diversity of how people organize their workspaces? And just, I mean even notion is not use of it is not homogenous. Right. Like I'm probably part of 15 workspaces as an investor and so I look at them and I'm like, well, mine's a mess and these people are really organized and the workflow's reflected in how their notion works.
B
Yeah, totally. I would say. I mean the interesting thing is that with embeddings it almost doesn't matter as much anymore. The AI doesn't really care what the tree structure is, for example. All the AI cares about is that there's a snippet of text that has the context you need and then it can retrieve it. And so actually we kind of advise people now don't worry as much about organization, just find a way to get it all piped in and thrown in there.
A
You still make decisions that could change performance quite a bit, like chunking strategy or whatever.
B
That's super important. But that's sort of not. That's sort of transparent to the user and sort of independent of their particular method of organizing things.
A
It just seems like still a difficult technical challenge given how different the content bases are.
B
Yeah, yeah, yeah. I think. Yeah, that took a lot of iteration. Yeah. The chunk sizing, how retrieval works, the different steps in the pipeline of retrieval. Yeah, there's a lot of iteration on that.
A
Ivan said I should ask you how many times you've rebuilt Notion and rebuilt your harnesses.
B
Yeah, yeah. It's kind of a running joke almost. I mean, we rewrite our AI harness probably every six months or so. And the time to rewrite has kind of been decreasing just because I think progress has been accelerating. I think this is honestly a really key thing and something that a lot of companies get wrong is just like, doing one thing and then just sticking with it. You really do have to keenly aware of what the current state of the model is and the technology is, and then designing the harness, the system and the product deeply around that. And it basically means you have to rewrite it every six months. And I find it pretty fun. It's part of the process. You get to restart and rethink it. We're working on. We're about to release a new version of our harness in the next week or two, and then we're already thinking about the one after that as well.
A
I think that leads to a set of questions I had for you on just how does Notion as an engineering and product and research organization work now that you have the power of coding agents as well? Because I imagine your willingness to rewrite the harness goes up dramatically. Agents are going to help me do it.
B
Yeah, that's extremely true. Yeah. I mean, yeah, it's been really fun to use the coding agents. I think the ambition of what I even consider building has gone up a lot.
A
What do you think has most dramatically changed in how you think about how engineering and product should work at Notion over the last two, three years?
B
Yeah, I mean, it's definitely changed multiple times. I mean, in terms of the coding agents, we kind of went through multiple eras. There was kind of like the TAB autocomplete era, and then we got into sort of inserting, rewriting some code. But it wasn't really until the agents started working. I would say, like early last year, we started to adopt the agents. Like, I started using cloud code, I think around April last year. That was a huge unlock. I would say the big shift there is that you can really push on getting these agents to end to end implement and verify and maintain stuff. But it requires pretty significant thought in terms of how you architect things and what is the verification loop. But the upshot is, I think if you do it. Well, you can be much more ambitious about what you're building and also make it much more robust than you could have done with humans writing it. And then the flip side is if you do it badly, it's all slop.
A
Does that change your lens of what teams should look like at notion like size, seniority, anything like that?
B
Yeah, I would say the fundamental effect is that everyone's individual impact in terms of their output can be much higher. And your output increasingly depends on your ability and willingness to use the tools. I think that's the fundamental thing that's happening. And then how does that play out? I don't think we've seen that much impact on the team size, really. I think we like to work in smallish tiger teams for the most part. I think if you can make teams small, it's almost always better. That was true before and I think it's still true. Maybe increasingly a little bit, but not that much, I think. Yeah, the main thing is to just really harness the tools.
A
Do you think something different happens to the median engineer in an organization versus the 10x engineer or the engineer 10x more willing to use the tools?
B
Yeah, I think the gap is bigger. You can be like 100 or 1000x engineer if you're using the tools right now. I think the gap is much bigger. The minimum bar has not changed, but the maximum bar has extremely increased. One impact it's had internally, I would say, is broadly, things feel a little bit more messy and chaotic, I would say. But I kind of love that. I mean, there's way more prototypes. People are like, for example, our design team made an entire git repo. They call it the design playground. And it's essentially like a simplified notion with a bunch of UI primitives in it. And they've made it really sophisticated. It has an agent in there and it's pretty cool because it allows them all the designers can spin up super high fidelity prototypes really quickly. And so it's no longer pointing at a mock and being like, how will this look like? They'll give you a URL to a prototype that's been deployed. And that sort of thing is true all the way up and down the stack for all of engineering. Just a little bit more chaotic, more stuff happening. All the PRs are more ambitious.
A
Do you draw a line somewhere about stuff that is more dangerous to touch or sensitive? Like there could be risk of data loss over here and not. Or is it kind of. You look at it all as it's fair game.
B
We still do reviews on all the pull requests, and I would say, and all the pull requests are now written by agents, they're often larger and more complex. That's the worst part. But the better part is that they're often much better tested and we can demand a much better testing for the things that merit it. I never produce a PR that hasn't been fully end to end tested anymore. And so you can get to a pretty high degree of confidence that it works. But it requires you're not just vibe coding by saying the thing you want. You're sort of thinking carefully about what is the change I'm trying to make and, and how can it be verified and how can it be deployed safely and then enlisting the agent to help you with that process.
A
When you think about where you said the general assistant doesn't quite exist yet, what do you imagine Notions agents being able to do over the next year or two that are still unblocked, they're still blocked by either capability or your harness work?
B
We struggled for a few years to build an agent and it always sort of worked, but then wasn't that useful largely just it was too early. So we tried to build an agent, I would say actually three or four times, and then we finally launched it last fall. So like last August, September. So if you use Notion AI now, it's like the full agent that has access to everything in Notion pretty much. So that totally works. I would say the, a lot of the original vision that we had totally works now. And it's fully shipped. Last August or September we shipped our personal agent. So it's pretty much every user in Notion has an agent and basically it has access to all the things that the user has access to. It can create a database for you, it can update things, create documents, it can search the web, do research. Then the second big thing that we just launched last week actually was custom agents. So you can basically, you can create a new custom agent, give it a name, and unlike the personal agent, by default it doesn't have access to anything. So you have to grant it access. But then once you do, it can actually run autonomously in the background. So for example, you can give it access to its own database to file tasks, let's say, and then you can attach it to a Slack channel and then it will start responding to people in Slack and filing tasks. That's one use case. Another one is maybe you could, you can give it access to a database of like weekly reports and then let it search the web or search your workspace. So it's sort of a custom agent, sort of represents some work or job, some knowledge work tasks that you want to be done autonomously. One thing I'm really excited about this going forward is we want it to be extremely good at sort of bootstrapping its own capabilities basically from an initial kernel, allowing it to basically bootstrap itself to do anything. Right. So even for example, maybe building an integration that we don't support yet, deploying that and then using it.
A
So you imagine that notion agents are actually the broader definition of agent, where writing code is a tool. It's close to.
B
Yeah, I think it's pretty key. Yeah. I think of coding agents as like the kernel of AGI. AGI will be a coding agent and code is just a really, really useful primitive for representing deterministic logic. The thing that's really exciting about it, replying it to a knowledge work agent, is that it can bootstrap a capability. So yeah, like I said, if integration doesn't exist, it can build it. If it needs to connect itself to a new data source, it can do that.
A
Given you have a notion, is at scale, but is operating in a landscape of productivity and platform players that are at even more scale. Right. Many of these will end up with their own agents. Lots of people from the labs, the Microsoft world are trying to integrate other data sources. This cross attempt to integrate and index. How do you think that plays out? What do you imagine that notion agents are best at or what they have the right to go do?
B
If you look at the landscape, I would say there's the labs and then there's maybe the, the software platforms and then there's maybe like infrastructure. In terms of the labs, we see ourselves as kind of like the Switzerland for models, we think. And our customers, they don't want to be locked into a certain labs model. They're always releasing new versions any given month. One is better than the other. So we want to be a place where basically you can easily get access to all the best models at any time and you can easily switch around.
A
Do you think open source plays into that as well?
B
Yeah, yeah, absolutely. I think the open source models are actually getting really good. There's like four different Chinese models now that are quite good. We actually just released one of them in our agent last week and we're going to do all four for sure. They're actually quite good and they're way cheaper than the frontier models. So I think there's a lot of use cases where you'd want that. We want to give that as an option in terms of the other. So we think of our role as sort of taking all the best models that we can, creating really high quality, state of the art agent implementations where people can easily and conveniently get access to them, and then making sort of a collaborative workspace that is really good for humans and for the agents to coordinate on. I think it's something that's very needed in the world and we're just trying to do it in a really tasteful, well executed way.
A
You were describing you need the index to make the agents good. You give the agents access to the tools that we humans have in notion. How do you think about the structure of notion and where it's useful or even not useful or relevant? For agents like blocks and databases and
B
such, it's all still pretty useful, extremely useful. There's been a challenge to sort of we want to make it really convenient for the agent. I think that's a new thing that didn't exist in the past. It was convenient for humans. And then we also made APIs convenient for humans writing code to use our API. So we essentially have a new customer which is the agent. At first that was definitely a problem. So for example, our API uses this crazy JSON format for blocks that by default is like crazy verbose and horrible for the agent. But we basically took on that challenge and designed just really convenient APIs for the agent. We created sort of a markdown dialect that looks like the default normal markdown, but it's sort of enhanced with all the notion blocks. And the models are really good at it. It works really well. So that's how it reads and writes the pages. And then for databases we, we use SQLite. So basically it gets to speak in SQLite, which also works really well. So the default thing did not work really well. But then we just took that on as an engineering challenge. And I would say now we have extremely convenient APIs that the agents are really naturally good at.
A
How did you understand or figure out what would make the API better for agents?
B
That's a good question. Yeah, I would say it's a combination of just trying things, it's very empirical. So we're just playing around and noticing, oh, it's not very good at that. Oh, that's way too many tokens. How can we make this smaller? And then a little bit of just first principles thinking of what is it the models are being trained on and what's in their prior, what do they know and what do we think it would naturally be good at? And how does the Agent loop work and what would be the convenient efficient pattern for accessing these things? And then just a lot of playing around.
A
I hear user research where the user is actually agent and then ongoing eval.
B
Yeah, you just chat with it. The user's always there, it's ready to talk to you.
A
Yeah, actually that is wonderful. Where you have infinite access to it.
B
You have infinite access to it. Yeah. And you can script and scale the access as well.
A
I assume you have. Actually, I know you do because you walked in, you're like, hey, I need to get access to WI fi. I need power. We can't block the agents while we're doing this. What do you have running right now? Tell me about your setup.
B
I'm working on a new prototype and so I have a couple agents working on that. And then. Yeah, my setup these days is just either claw code or codecs. I like the CLI tools. They're super simple and work pretty well. I'm pretty comfortable in the cli. And then.
A
Yeah, you don't need my generated game to teach you cli.
B
It's a very cool idea, I would say. Yeah. My whole goal these days is essentially to just have as many running as possible and to run them all the time. And you know, so for example, like every night before I go to bed, I'm like, okay, let's go guys. Yeah. Basically what I have to do is make sure that I've given it enough stuff that by the time I wake up in the morning it will still not be done. And so I've maximum.
A
That's victory.
B
Yeah, that's victory. So yeah, I've done that, I would say last five nights pretty well. My personal record is that I've had a coding agent running for, I think it was 13 days straight without stopping and just basically working through tasks.
A
Well, prompted. Yes. I admit to having woken up in the middle of the night at least multiple times this week and just being like, are you still going?
B
Yeah, I know, yeah, it's kind of nerve wracking. I always like, there's always like, I'll check it one last time before bed and just really make sure that it's still spinning.
A
What about on the notion agent side? Like, do you have a workflow there that is core to daily work?
B
Yeah, I mean, I mean I use our personal agent all the time, so it has all the context about our company and everything that's going on, you know. So like for example, last night I was asking it about how the custom agent launch was going and what the signals we're getting from it. We're super useful for that. And then I have many custom agents that are running. My personal favorite is I have an email triage agent. So it has access to all of my work and personal emails and it just wakes up every day and just archives all the stuff I don't need to see. I trained it over time to learn my preferences.
A
Do you actually label data for it?
B
It's pretty easy to do this, actually. So all you have to do is you make the agent and then you give it extra email and then you can make a blank page. It's like it's memory and you let it edit that page and then you just say, okay, now go look at my emails and then interview me, ask me which things you know. So sort of it will like propose things that it thinks it should archive and then you can kind of correct it. And then it will use that to essentially generate like a list of rules about, like, what it thinks are correct or not. And so for the first couple days I was sort of like correcting it on things. After a couple weeks or so, I dropped the approval entirely and it just automatically archives all the things I don't need to see now.
A
Wow, that's trust.
B
It completely solved my email problems because for me, I don't use email that much for work stuff. It's Mostly in Slack. 95% of the personal emails and work emails that I get I don't need to see at all. And so it's just a waste of time. And so it completely solved that. So now when I open my inbox, it's like, only stuff I need to see. I've got lots of custom agents running. There's another one that I built that triages customer all internal feedback and bugs. So we have a Slack channel where basically people just post random product feedback and bugs. In the past, it would sort of sometimes get answered, but then sometimes haphazardly get ignored just because there's so many teams, we're in things. So its entire job is just to route it to the right place. And it uses a similar sort of memory pattern where it sort of learns on the fly where it's supposed to file bugs. And then over time it's built up hundreds of rules that it just sort of learned over time. So for example, there's a bug about the mobile app. It knows to route to the mobile team and then file a task in their database.
A
Do you look at that? Like they generated an updated memory because it's legible to you to Say, did that make sense to me?
B
I think I did it. I did at first, but then sort of once you trust it's kind of working, you just, you kind of ignore it. And then if, if it ever breaks, I'll, I'll go fix it. It'll break every now and then and then.
A
But the benefit of not reading your email is here.
B
Yeah, just not read it. So, yeah, yeah, I, I mean, generally, I would say, yeah, the, the general pattern I follow is sort of I, I, I build it as a prototype, I have it in sort of like an approval mode where I'm sort of, you know, watching it closely, but then after it runs a bunch of times, you kind of trust that it's working.
A
Is there anything you do internally at Notion to make sure non technical teams have the intuition for how to build agents or how to express that productivity too?
B
Yeah, that's a great question. I mean, we do sort of workshops and hackathons pretty frequently. So for example, a month ago I did a hackathon with the People team and sort of got them. The People team has been amazing. They're actually one of the highest adopters of custom agents. They do all these kind of workflows in Slack and Notion, kind of manual work like that. And yeah, I would say, yeah, people are super excited to try it and sort of maybe just need a little bit of a push in terms of intuition and getting them started. But then honestly, I've been super impressed. I think the concept is kind of intuitive, sort of like once you get past sort of a little bit of the technical barrier of like what is a prompt and what is the agent and how does it get triggered and woken up and how does that even work? But then once you sort of get past that, I think it's actually a very human like interface.
A
Yeah. Maybe the biggest barrier is actually just getting people to try and assuming it's going to work at all. Right, yeah. You and Ivan originally met on the Internet, Tools for Thought Community. It feels like the tools we have for thinking are very different now as your core conception of Notion changed over the last few years because of all the AI stuff. What thinking does the tool do for you? Should agents do for you? What do you get to do?
B
Yeah, I would say changed quite a lot, broadly speaking. Before AI, our goal was to create the best tool for humans to directly perform their work. And then now the goal is to create the best tool for humans to manage agents, to do the work for them.
A
That's a big shift.
B
That's a pretty Big shift. It's pretty fundamental, but it turns out that you need most of the same primitives. All the primitives that we built are actually still extremely useful. It's more that we just needed some, some new primitives like representing what is an agent and how does it interact with your pages and databases. But you still need the same primitives. You still need a document. It's an unstructured way to write stuff. Agents love to write markdown documents. So it's still very relevant. And you still need a database, you still need structured data. If you're working with your swarm of 100 background coding agents, you don't want to have 100 chat threads, you want a Kanban board. It's the same as before.
A
Makes sense. You still need the coordination structure. What is one thing that just because you're ahead on this stuff and then trying to figure out how to bring notion and then users along with you. What is something that's really changed about how you personally build even in the last six months?
B
I mean it's completely changed. I haven't written code since last summer. I don't type code anymore. Yeah, it's completely shifted. I mean we went from humans type all the code to we're still typing but we tab complete to sort of we talk to the agent and it does little tasks for us, but we are still in the outer loop. And then now it's more like I design a end to end task that involves making some change and end to end verifying it. And then I'm just the outer verifier sort of double checking at the very end that it's correct and if it's going off the rails, kind of monitoring it. So it's a complete shift. I'm now the agent manager instead of the coder.
A
Amazing. Well, thanks, Simon. This has been a super great discussion about how we're all going to become agent managers and hopefully in notion.
B
Cool. Yeah.
A
Find us on Twitter nopriorspod. Subscribe to our YouTube channel. If you want to see our faces, follow the show on Apple Podcasts, Spotify or wherever you listen. That way you get a new episode every week and sign up for emails or find transcripts for every episode@no-priors.com.
Episode: From Coder to Manager: Navigating the Shift to Agentic Engineering with Notion Co-Founder Simon Last
Date: March 12, 2026
Hosts: Elad Gil, Sarah Guo
Guest: Simon Last (Co-Founder, Notion)
In this episode, Sarah Guo sits down with Simon Last, co-founder of Notion, to discuss Notion's strategic and technical evolution in the AI era. The conversation focuses on Notion’s integration of AI agents, the transformation of their engineering and product organizations, and emerging paradigms in tools for thought. Simon shares the journey from early experiments with GPT-4 to Notion’s latest agentic features, providing behind-the-scenes insights and candid reflections on the fast-moving landscape of agentic engineering.
GPT-4 as the Catalyst:
Simon recounts how experimenting with GPT-4 at a company offsite in 2022 “made it really, really real” for Notion's leadership, revealing its capabilities to follow instructions and deep world knowledge.
Short & Long-Term Vision:
AI Writer & Q&A Launches:
Craft, Iteration, and Attention to Detail:
Rebuilding for Progress:
Coding Agents as a Force Multiplier:
Team Structure and Dynamics:
Testing and Risks:
General & Custom Agents:
Bootstrapping & Coding Abilities:
Notion as Model-Agnostic “Switzerland”:
Workspace, Blocks, and Agent-Centric APIs:
Simon's Personal Agent Workflows:
Internal Adoption and Training:
This episode delivers a vivid snapshot of the future of software engineering, knowledge work, and product design in the agentic era. Simon Last’s experience highlights a paradigm shift from building tools for users, to building collaborative environments for users and AI agents alike — and positions Notion at the forefront of this transformation.