wavePod

Get Wave AI

How to Build an Agent-native Product | Mike Krieger - AI & I | Wave AI Podcast Notes

Back to AI & I

How to Build an Agent-native Product | Mike Krieger

AI & I

Wed Mar 25 2026

Summary

Podcast Summary: AI & I – "How to Build an Agent-native Product"

Host: Dan Shipper
Guest: Mike Krieger (Co-founder of Instagram, at Anthropic Labs)
Date: March 25, 2026

Episode Overview

In this episode, Dan Shipper sits down with Mike Krieger to discuss the evolving landscape of software development in the age of agent-native AI products. Drawing on experience from Instagram’s early days to current work at Anthropic, Krieger explores what has fundamentally changed—and what hasn't—in product building, especially as AI tools accelerate prototyping, iteration, and scale. They dive into product design philosophy, team structure, feature curation, and the future of agent-powered interfaces, all while remaining candid about the challenges and trade-offs of rapid progress.

Key Discussion Points & Insights

The Changing Nature of Product Development

Prototyping & Acceleration:
- Today’s AI models (like Claude) greatly accelerate how quickly products can move from concept to completion.
- "You can get it to go…not zero to one, but zero to end pretty quickly over the matter of hours." (Mike, 02:31)
- Prototyping is less bottlenecked by engineering; the challenge is what to build, not how quickly.
Pitfalls of Overbuilding
- The acceleration tempers intuitive, incremental product development.
- "What would normally be this sort of incremental thing…You can actually kind of grow an entire tree indoors. And then you have this whole thing that…doesn't have the same level of intuition and exposure to experience at each step." (Dan, 04:47)
- Modern tools make it easy to add features, but not to curate—or cut—them.
Product Intuition Still Matters
- Intuitions and hard-earned lessons from real-world use are irreplaceable.
- "There haven't been a lot of breakout consumer products…even in the age of accelerated AI building…It just still takes time to hone your view about what sort of intervention you want to make on the world and then build from there." (Mike, 02:31)

Product Simplicity and the Art of Subtraction

Simplicity as Competitive Advantage:
- Story of Dan rewriting an overbuilt agent-native app from scratch, inspired by a much simpler, focused app (Monologue).
- "I just basically threw out the product and started over with…just a shareable markdown link…and…now we launched it and it just blew up." (Dan, 07:09)
Lean Startup & YAGNI Principles:
- Overbuilding in v1 is tempting with AI, but creates a brittle, hard-to-test matrix of features.
- "We way overbuilt for V1...just because you can doesn't necessarily mean that it should be in at least the first version." (Mike, 05:51)

Rethinking the Development Cycle

Embracing Rewrites & Iteration:
- Not taboo to significantly rewrite products, thanks to AI acceleration.
- "It’s no longer…a year long rewrite that might have killed a company like…Netscape…We’ve actually had several initiatives…built the full blown thing, realized we've overcomplicated…tore it down, done a V2…" (Mike, 08:59)
- Prototyping and iteration cycles are measured in days or weeks, not months or years.
Learning from Real User Contact:
- The value of shipping minimally useful versions early to allow real-world learning and validation.

Agent-native Products: Philosophy and Implementation

Definition and Principles:
- "Agent-native" means that anything a user can do, an AI agent can also do seamlessly.
- Importance of designing software primitives so agents can modify and operate on every function natively.
Evolving the Paradigm:
- Comparing Claude Code’s architecture (agent-native) to previous iterations.
- "It should have knowledge about itself and that unlocks so much capability…how do you imbue the software that cloud builds to be more cloud aware…and even just cloud agent native…because it still won't [by default]…" (Mike, 12:48)
Agent Native as a Test Case:
- Hard to anticipate or test for all emergent agent-native behaviors.
- Need for robust architecture to handle unexpected agent interactions.
- "It's much harder to sort of write an end-to-end functional test around an agent native product because part of it is that unpredictability." (Mike, 16:49)

Proof of Use and Robustness

Proof of Use as Value:
- Preference for proof of actual use/demo (e.g., Loom videos) over just passing tests for feature validation.
- "You want to have a playground within a safe environment. That's the only way you can have a playground is if it's safe around the edges." (Dan, 19:07)
- "It's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through?" (Mike, 20:13)
Robustness Over Time:
- "Does it feel like it's built on sand or does it feel robust? And the agent native part adds something totally even beyond that." (Mike, 22:47)

Team Structure, Roles, and Changing Skills

Smaller, Conviction-led Teams:
- Early phases benefit from solo or paired founders who hold the context and drive conviction.
- "Scaling the teams too quickly actually is a net negative…you [just] end up in this sort of meta coordination game." (Mike, 33:23)
Shifting Skillsets:
- With AI, hiring can now focus on great product sense and the ability to use AI tools, not just deep engineering.
- "We just hired a new GM who's…lightly technical, but he spikes super high on product and writing sense…and now we can hire someone like that where a year ago we wouldn't have been able to." (Dan, 24:21)
- Designers are moving into more hybrid builder-roles, writing as much code as engineers in some experiments.
Agency-Like Resource Model:
- Both Anthropic and Every use lightweight core teams, supplemented with shared designers, marketers, etc., as projects require.

Managing Features & Enterprise Adoption

The Difficulty of Unshipping:
- Need to be ruthless about removing features, but hard when serving enterprise customers.
- "Deleting features as a sort of imperative…if this is not working, let's go unship that." (Mike, 35:33)
- Legacy features may become load-bearing for certain customers; plugins/skills may allow keeping complexity out of the core.
Enterprise vs. Startups:
- Enterprise buyers may lag behind state-of-the-art; startups risk becoming outmoded if catering only to current needs.
- "You have to be willing to put out the V3 or the V4…a big rethink…cloud can help…host both for a little while…but then also be willing to cut over…" (Mike, 38:50)

Personal Agents, Ownership, and Trust

Emergence of Personal Agents:
- Open-source tools like OpenClaw demonstrate how quickly users form personal/identity-based relationships with agents.
- "My girlfriend's claw is called Shelly…there’s this thing that happens where it feels like it's mine…mirrors me in this way…" (Dan, 43:26)
Social Dynamics of Agents:
- Agents can reflect both the skills and trustworthiness of their owners, creating "shadow org charts" within organizations.
- "...everyone has a claw, their claw becomes known for and used for the thing that they're specialized at that per their owner is specialized at…" (Dan, 46:39)

Notable Quotes & Memorable Moments

"You can get it to go zero, not zero to one, but zero to end pretty quickly over the matter of hours."
(Mike, 02:31)
"If you grow a tree indoors, without it being exposed to wind, it doesn't get as strong…we've accelerated the pace of development so drastically, [but] you can actually kind of grow an entire tree indoors…doesn't have the same level of intuition and exposure."
(Dan, 04:47)
"Because vibe coding is so fun and so addictive, I just found myself being like, yeah, like I'll do this and I'll do this…and it just created this monstrosity that wasn't that good."
(Dan, 07:09)
"Instead [of adding the right feature], it just made for…something that felt really complicated."
(Mike, 08:59)
"Agent native means…the agent can do anything on your computer that you can do, and it's customizable and flexible and extensible."
(Dan, 11:38)
"It should have knowledge about itself and that unlocks so much capability in there as well."
(Mike, 12:48)
"It's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through?"
(Mike, 20:13)
"Scaling the teams too quickly actually is a net negative because they end up spending all this time on coordination…alignment conversations."
(Mike, 33:23)
"Being willing to delete code…they have deleting features as a sort of imperative."
(Mike, 35:33)
"It's like you have to be willing to put out the V3 or the V4 that is a big rethink of how the existing piece worked and then maybe have a transition period."
(Mike, 38:50)
"My girlfriend's claw is called Shelly. There's this thing that happens where it feels like it's mine, like it's really mine…has a personality that sort of like, mirrors me in this way."
(Dan, 43:26)

Important Timestamps

[02:31] – Challenges of AI-accelerated product building
[05:51] – The pitfalls of overbuilding and lessons from Lean Startup
[08:59] – Accepting rewrites and embracing iteration
[11:38] – Defining agent-native products
[12:48] – Claude Code and engineering for agent-native functionality
[16:49] – The need for robust, unpredictable, and testable agent behaviors
[19:07] – Proof of use, not just proof of work
[24:21] – How AI is changing who and how you hire
[33:23] – Team size, scale, and coordination challenges
[35:33] – Feature deletion core to modern product thinking
[38:50] – Startups, enterprise, and relentless product iteration
[43:26] – The psychology of personal AI agents

Summary

This episode provides a candid, deep-dive into the craft of building agent-native products in the AI era. The discussion balances optimism about rapid innovation with hard-won lessons on simplicity, team dynamics, and the critical importance of product intuition. If you want a playbook for harnessing AI to shape the future of software, this episode delivers—while also acknowledging where human experience and taste are irreplaceable.

Loading summary...

Transcript

Mikey K (0:00)

The models today are good at adding features. They're not necessarily good about figuring out what to cut out of the product. You can get it to go zero, not zero to one, but zero to end pretty quickly over the matter of hours. It's made a lot of decisions along the way and some of the sort of intuitions you build about what are the right things to put in there, I think you build over time. I feel like that is the art and science of software design. In 2020,

Dan Shipper (0:36)

Work moves fast. And in the age of AI, the pressure isn't just to move faster. It's to make sure that what you send actually sounds like you. From emails to proposals to stakeholder updates. Generic and rush just doesn't cut it. If you've ever stared at a blank page, knowing exactly what you want to say but not how to start, Grammarly fixes that. Grammarly gives you one place to think, write and finish your work. Write where you already write. Most AI tools either take over or stay out of the way. Grammarly does neither. It helps you break the blank page, adjust your tone so a message lands right for the specific person reading it, and works seamlessly across more than 500,000 apps and sites that you're already using. It's loaded with agents built for every step of your process. And 90% of professionals say it saved them time, 93% say it helps them get more done. This is AI that works with you, not over you. In a world of generic AI don't sound like everyone else. With Grammarly you never will. Download Grammarly for free@Grammarly.com that's Grammarly.com Mike, welcome to the show.

Mikey K (1:39)

Great to be here. Thanks for having me on.

Dan Shipper (1:41)

Great to have you. I'm super excited for people who don't know you are the co founder of Instagram and now you are at Anthropic and Anthropic Labs. I've admired your work from afar, both at Anthropic and at Instagram for a really long time. And you're obviously at the forefront of building products and AI. So thank you for coming on.

Mikey K (2:03)

Absolutely.

Dan Shipper (2:04)

Where should we start? What we were talking about just now in the pre production is what has gotten easier and what has gotten harder or maybe stayed the same in product building as the underlying substrate or the process by which we build products has changed completely. So like, tell me about your experience now versus, you know, earlier on topic versus Instagram and how you think things are changing.

Mikey K (2:31)

Yeah, I was doing the thought exercise a couple of weeks ago of, you know, we know the Instagram story. We had another product called Bourbon. We worked on that for almost a year. It wasn't working. We pivoted. We basically spent three months building what became Instagram, launched it and then scaled it and asking the question like, what is now trivial and what was actually inherent in that building process, that doesn't get easier, right? And that year we probably could have hit some of the dead ends we had eventually hit sooner. But there was value in getting there too, right? Like, we overcomplicated the product so that we then had to simplify it. I find even the models today are good at adding features. They're not necessarily good about figuring out what to cut out of the, of the product. And that took a lot of just sort of, you know, hitting actual, actual real world usage. And there was something about the process of incrementally adding things right now, I mean, today day, especially some of the stuff we're building, labs like you can get it to go zero, not zero to one, but zero to end pretty quickly over the matter of hours. But it's made a lot of decisions along the way. And yeah, you can ask it to follow up with you and do input, but some of the sort of intuitions you build about what are the right things to put in there, I think you build over time. And so I've been reflecting, like there haven't been a lot of breakout consumer products, even in the age of accelerated AI building. And I think part of it is because it just still takes time to sort of hone your view about what sort of intervention you want to make on the world and then build from there. Now, the actual building part, once you know what to build, is of course so much easier. I had Claude basically rebuild Bourbon. It took about two hours. It was feature complete. It added filters, which Bourbon didn't have. We added those for Instagram. But I think it knew, you know, it knew the eventual future of the products, have decided to build that in. So I think that that part feels, feels really different. But I think there's also, you know, I remember there was a week where Kevin went off and built all the filters for Instagram v1. I went off and build like sort of the rest of the. And you know, sitting there, I would stay up till 4am and then sleep till noon. That's like my natural day, night cycle. And like in that process you're making so many decisions like how should location work, how. And you know, it's, we gotta find a way of accelerating building while still sort of Helping people build intuition of those decisions along the way. Because otherwise, I think you either get. Just get very generic products that are unlikely to break out, or ones that just don't reflect some deeper intuition that you come to about your space or your product.

Mikey K (8:59)

Yeah, Just as a brief aside on that, I remember with Bourbon, our biggest mistake was adding functionality over time rather than deleting it. Right. And because oh, you know, eight features doesn't make for a good product. Maybe the ninth one will. Instead it just made for, you know, something that felt really complicated. I mean, I think a couple of things are also like part of how we're dealing with it is actually being more willing to do rewrites. You know, like classic, you know, Fred Brooks mythical man month. Like you shouldn't rewrite software because all the things that were imbued in v1 you're going to mess up and. Yeah, yeah, exactly. And that whole second system syndrome. And there is still a lot of truth to that. But one, you know, the models can help you sort of diff and basically see did you miss anything that was in that first one? But second it's just, it's no longer. You're not like talking about a year long rewrite that might have killed a company like you know, famous like Netscape. Like these are like days probably especially off a given source. So we've actually had several initiatives like usually pre launch, rarely post launch, but at least pre launch, like have built the full blown thing, realized we've overcomplicated or made some kind of core assumption and then like tore it down, done a V2 and then, and then iterate on it from there. So it doesn't surprise me that that's become sort of part of what you've had to do as well. But it doesn't feel as painful. You're not like oh, a year of building this thing. It's like oh, that was last week and then I get to do it this week and I get to cut out a lot of, a lot of what was there as well. I think functionality wise and how we're dealing with it from a product development standpoint, I think we are learning to launch earlier and it's definitely a balance around, you know, we've grown, we have like a strong enterprise footprint. People have expectations about like what the initial version is, but not assuming that we're going to know what every connector or everything that we need to add to the product is ahead of launch because people still will absolutely surprise us, right? We're, we have a strong contingent of, we call them ant fooders because we're ants that anthropic. But that only gets you so far before you need that real world contact. Like take Cowork for example. We'd been noodling on a product of that shape for a long time. And then once we decided, no, let's get this out, let's actually, you know, build the. Build the V1 that we think solves the problem in the most minimal way possible and get that out in 10 days was really a good push around. Yes, there are a hundred things that V1 should or could have had, but it didn't. And at the same time it was, it was useful enough to prove something out there. And I'm not sure developing it for another two months adding, you know, 50 features would have been more useful. In fact, we probably would have been building in a. The indoor tree would have been getting built then the second behavioral world use. It's like actually nobody wants to do that. They want to do this this other piece. So I think that piece that again, there's like the intuitions of the original Lean Startup ideas are still here. It's just they manifest at different timescale and in a different way.

Mikey K (12:48)

Yeah, there's so much in here and I love the agent native write up you all did. It's like to me, the canonical exploration of this. So thanks for putting those ideas out in a really clear way. So I think a few threads to pull on this one is a conversation I had with somebody recently where they said they're a non technical person, they're like you are talking about agents and all this stuff. They're just like actually computers just work now. I always wanted computers to work and they didn't work and now they work. And it's just a funny thing where if you knew the incantations to properly get on the command line and brew install the thing that like nobody is going to do that but now Claude can do it for you and therefore like the computer now feels like a tool that is alongside you. And I think that core insight, it's more than even just adding power and functionality to new software. It's also just unlocking the functionality that always should have been there or available and just felt like extremely hard for people. So that's like maybe thought number one, thought two is actually comparing our products that do this well versus not. I think cloud code does it well. I think cloud AI still needs to evolve a lot. So as an example, I was watching somebody use cloud and they were in a project and they had built I think an artifact or a new document and they said great, can you add this to my project knowledge? And cloud's like, yeah, let me tell you the steps to go add it to my project knowledge. Like no, that should just be a thing that it can do really natively. And so I think even in that you see a product that was a 2024 product that has been iterated on and evolved a lot, but still I don't think has been baked in from the very beginning the idea that every single one of its primitives it should have knowledge about and the AB to modify. And I think that's essential in products these days. I think cloud code is the 2025 vintage of that and I think there's even further aspects of it when you see what some of the harnesses that folks are experimenting with where can actually sort of modify the harness itself, that starts getting to the next maybe level of that where, you know, it's probably esoteric for most people, but even unlocking that functionality means that you don't have to sit there and be like, oh, I wish it did this a little bit differently. You know, I wish Gmail worked in this slightly different way instead just asking it to. And I think that that feels like the big next even within like cloud code. Just teaching cloud code about cloud code was a really valuable experience. I was. This definitely relates. This is now getting very circular meta, but bear with me. I loved your write up on Agent Native and I was like I want this as a skill. So whenever I'm prototyping something it thinks in an agent native way. So I had it packaged it up as a skill and that whole process was, you know, hey, cloud in cloud code. I, you know, can you create a skill for this? It's like, sure, I'm looking at my skill skill. I'm going to create a skill about it, I'm going to install it. I'm like, great. Is that available now or do I need to reload it said right. I think you need to restart it. Let me check. Yep, you do. All right, let's get. And everything was. It has knowledge about itself and that unlocks so much capability in there as well, which maybe is like the last thread to pull on. I think all of these could be hour long conversations which is I think. And one of the things that we're really thinking about in labs is how do you imbue the software that cloud builds to be more cloud aware and even just cloud agent native sort of building aware so that it even thinks to build in that way to start with because it still won't partially because decades of software is not that. Right. So how do you get new software to have that principle baked in?

Mikey K (16:49)

Yeah, I think there's two parts to it. One is the more sort of mundane part and the second one I think is the one that's more sort of interesting in developing. The first one is even just having good patterns and paradigms available to the model while it builds has been really valuable. Finding the right balance of templatized to skillified. Right. And what that right balance is. But having one of the things that we like have now is a skill about the cloud API, which sounds super obvious, but even just having that is really valuable because you would sometimes find, you know, we'd launch a new model, it wasn't in the, the model's sort of innate knowledge. And then you'd get into these really funny arguments like, no, no, you made a typo. It's, it's Sonnet 4 or 5. You're like, no, I, I know, it's like, no, no, no. So like, like having that capability, having like good templatized examples of that and skills, I think helps. But then the second part is, what's also interesting is that class of software is just a different type of test. Like it's much harder to sort of write an end to end functional test around an agent native product because part of it is that unpredictability. And so another idea we've been kicking around a lot in labs is like, how do you increase like the sort of fidelity of the verification? The other day I had a agent native iOS app that I was working on and I was, I was having Claude interact with it and Claude was ended up having a conversation with itself in like a chat feature in the iOS. It's very funny watching Claude talk to Claude because it's like somebody's pretending to like, be what humans are. This particular one was a prototype I was doing about like, sort of like work journal, reflections and the cloud was like, yeah, my boss is really rough on me, like I had a hard day. And the other cloud's like, oh, I'm so sorry to hear that. And they're just going back and forth. But you wouldn't have written a unit test for this and, you know, maybe it would have come up with some other emergent idea as well. So I think you just have to go much more towards, you know, setting up harnesses that are actually exercising as much of that agent native capability as possible because you don't exactly know what things are going to. And things are going to end up in a weird place where Claude's going to try to do something that you didn't even think it was going to do. And it might put your app in a new state. So maybe it's circling all the way back to still like what's hard. It's like having the underlying architecture still be robust to that is really important. Right? It's like it's agent native, but it's also able to flex in a way that you might not have anticipated. But you've got the right primitives, right? I feel like that is the Art and Science of Software design in 2026.

Mikey K (20:13)

Yeah, I think there's probably like three layers to that. Like, the first one is like, claude, prove to me that you've exercised this in some way. You know, I've started doing that in all my prompts is I end, you know, when it's working on a feature, I'm like, and by the end, you know, before upr, prove to yourself and then to me that it works as intended. Like, find the right way of doing it, which actually ends up you have to change your own sort of way. You build and scaffold around saying, what is the right way to get Claude able to at least test this change, you know, succinctly, rather than what it likes to do is like, I read the code, it looks good. I'm like, you wrote the code, I don't trust you. So, you know, you got to really test this thing. And then the second one is that what you described is like, you know, everything having some, you know, sort of proof around. Like, did. Is it. Is it working as intended and as you intended, too? Because Claude is going to make or any of these models is going to make a lot of decisions for you. And sometimes you'll, you know, I'll have engineers on the team put up a pr and I'm like, oh, why did you choose to do this versus that? And many times the answer is they didn't choose. It was just the choice the model made. And maybe it was a reasonable choice, it was probably a reasonable ish choice, but it was it like the optimal choice, does it fit into the paradigm? I feel like that is the. It's like, it's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through? And I was talking to an engineer yesterday and they was like, I was really. I knew you were going to ask me a lot of questions about this, so I was reviewing what Claude had done so that I wouldn't be like, I'm not sure, you know, and that's. I don't. I don't push on that for most PRs, but when there's one that's like, oh, I'm refactoring this system and there's going to be these new primitives, like, great, let's make sure those are good and that you've thought through how they interrelate. Because it's very easy to end up otherwise with sort of this sort of tower of assumptions that you're not fully aware of.

Mikey K (25:11)

Yeah, I love that. I think it's actually, you get pulled in two directions, but they're both Important there's the sort of primitives and architectural robustness which I think still need a sort of senior technical force. I was laughing with somebody. They're like, I thought, you know, my skills and distributed systems were like not going to be useful anywhere. But actually those are maybe some of the most useful skills and reasoning about that and you know, thinking things through. Like I had a long debate with Claude last week around like whether the system that I was building needed redis or not or could go away with just postgres. And you know, it was a healthy debate where like I only because I was grounded in having used a lot of those technologies before. But then there's the other side of robustness which is have you just papered over all the problems with like fixes to your system prompt and additional instructions or have you sort of architected the actual like set of tools correctly? And so the latter is as important and probably where this GM can be really valuable in that okay, like I'm making changes. But just like you wouldn't patch a sort of flakiness in your distributed system by just being like, well just retry it in five seconds. I'm sure it'll work. Like also not doing the same thing with never ever, you know, all caps use, you know, markdown or whatever. The thing that you're trying to patch is like they're both actually symptoms of the same thing, which is the underlying piece, robust or not. And Claude actually I'd say this about all the models, but I think Claude could be much better at both. It's like still a place that still needs a lot of human oversight on the systems part. It's now able to debug production systems which is really valuable. But architecting them in the first place, I feel like still benefits from somebody who's really thought these three things through or has experience. And on the prompting side, if you give it a. I've seen people get into this dev loop even internally here like here's the prompt, here's a mistake that the system made iterate on the prompt. Its natural tendency is to just add more things to the prompt and then eventually just get to this thing that you know, if you onboarded a new employee and you gave them 100 instructions on their first day, like always answer and markdown. Except when the in they'll be like, I'm just going to remember the last thing you told me or I'm going to like short circuit it. So then rethinking, okay, is these, are these actually two different tools? Is it actually two agents that each have a smaller amount of context that then you can break apart. So back to your original question. We're hiring for people with, you know, systems expertise, even within labs, which you think of as like more zero to one prototypes. Like it's still really valuable because again, that robustness matters and also just who's going to be, you know, helpful in sorting through, you know, systems permissions and provisioning and early testing. Like that stuff is still, you know, it's still hard even for cloud when it can't edit the permissions itself, which it can't for good reasons. And then on the, on the robustness side, actually we've had a lot of success pairing our product teams with our applied AI teams. Our applied AI teams are the teams that are in the field every day helping customers iterate on their prompts. And we found that we actually are very, we're customer zero now for those, you know, efforts because we have a lot of products that are, you know, very AI powered. So how do we bring that expertise in there? Because that expertise does not sit with our software engineers today.

Mikey K (35:33)

Yeah. And being willing to delete code. I think that's something that Claud Code team has done really well, is they have sort of deleting features as a sort of imperative of people on the team, like, if this is not working, let's go unship that. You know, and it's often when you've created something else that even if it doesn't entirely supersede it does enough of what that other aspect does that actually makes sense to deprecate and then remove that first one. It does get harder as we get more and more enterprise focused even with these tools because they come to depend on it. I'll never forget we one of the things I did maybe six months into when I was still Chief Product officer was we did a big sort of redesign of cloud AI and we were so proud and we shipped it and we had a bunch of kudos and then we got this really angry email for somebody like I just recorded 20 hours of enablement content for my company to do for cloud enterprise and I have to like redo all of it and we're like oh okay. Like you're playing at a different release cadence and of course like shipping twice a year at one of our our conferences is not an option. So we are going to keep moving quickly. But then we've since learned to maybe moderate how we roll it out to the enterprise side a little bit more. But yeah, I think the unshipping piece then you end up with people who have built. I'll use an example. So there's a feature in Cloudia called Styles. It's not widely used but the people who use it use it a lot. And we've talked at different points like yeah, does styles still make sense in the product? There's other ways of accomplishing the same thing. There's custom instructions and projects now there's skills now there's so many other ways of accomplishing that. And I don't know how long styles end up in the product, but I know that the last time we talked about removing it ended up being really load bearing for a few companies. Entire use cases like oh, we have our house style that the CEO personally authored and gives to every employee and that's how they operate. And so finding ways of doing that is also really interesting. I would hope that in the long run what we can actually do is come up with a system of plugins and skills such that they no longer have to live in the core product because I think that is always the hardest to delete something that is the core thing that you're shipping to everybody if you don't have the story around. Great, you still like that feature. Awesome. Like here's how you can keep using it forever in your own and keep iterating on it and make it your own. But it doesn't have to add complexity to every future person. That's adding, that's signing up for the first time.

Mikey K (38:50)

Yeah, no, this such a good question especially because then a wave will come like being more agent native for example and can you adopt it within your existing paradigm? Does it require to throw everything out and are you just stuck in that like oh, we kind of adopted it, we kind of bolted it back on. I think a couple things for us what we've started doing is basically treating like this train's going to keep moving and we'll provide enterprise toggles along the way, but the core of it will continue to evolve. And that's sort of the, the BET understanding you're taking working with us. And I think that's been well received because I think companies have also seen that, you know, things are moving so quickly that the only way they even get comfortable with a year long commitment, for example, is to believe that we'll continue to evolve along the way but then we'll provide, you know, Cowork is a great example where you know, from day one there was like a way to turn it off for your employees if you didn't want it, for example. And that, that, that's I think a reasonably good paradigm. But the other one is just as we were talking can actually rethink and, and, and, and, and sort of rewrite a lot of the, the, the stack is, I think companies should be way more willing to do that. And it, everything is getting compressed right in, in previous cycles it was the kind of idea of like having to fire some of your customers who might have been, you know, really into your product for a different reason than where you're going sooner. But that was on a multi year kind of time range thing where it was like, yes, last year's product versus not three months ago product. It seems crazy, but I actually think that's the kind of way you have to think about it, which is you have to be willing to put out the V3 or the V4. That is a big rethink of how the existing piece worked and then maybe have a transition period. And cloud can help. Probably host both for a little while before it cuts over, but then also be willing to cut over and say, yes, this is how we think the future of this piece of knowledge work or this AI powered manufacturing is going to be. We got to keep it moving or else to your point, you're either going to get replaced by the next company that then rethinks it from scratch or yourself replacing it yourself. And again, it's just the same old story but now compressed to months.

Summary

Podcast Summary: AI & I – "How to Build an Agent-native Product"

Host: Dan Shipper
Guest: Mike Krieger (Co-founder of Instagram, at Anthropic Labs)
Date: March 25, 2026

Episode Overview

Key Discussion Points & Insights

The Changing Nature of Product Development

Prototyping & Acceleration:
- Today’s AI models (like Claude) greatly accelerate how quickly products can move from concept to completion.
- "You can get it to go…not zero to one, but zero to end pretty quickly over the matter of hours." (Mike, 02:31)
- Prototyping is less bottlenecked by engineering; the challenge is what to build, not how quickly.
Pitfalls of Overbuilding
- The acceleration tempers intuitive, incremental product development.
- "What would normally be this sort of incremental thing…You can actually kind of grow an entire tree indoors. And then you have this whole thing that…doesn't have the same level of intuition and exposure to experience at each step." (Dan, 04:47)
- Modern tools make it easy to add features, but not to curate—or cut—them.
Product Intuition Still Matters
- Intuitions and hard-earned lessons from real-world use are irreplaceable.
- "There haven't been a lot of breakout consumer products…even in the age of accelerated AI building…It just still takes time to hone your view about what sort of intervention you want to make on the world and then build from there." (Mike, 02:31)

Product Simplicity and the Art of Subtraction

Simplicity as Competitive Advantage:
- Story of Dan rewriting an overbuilt agent-native app from scratch, inspired by a much simpler, focused app (Monologue).
- "I just basically threw out the product and started over with…just a shareable markdown link…and…now we launched it and it just blew up." (Dan, 07:09)
Lean Startup & YAGNI Principles:
- Overbuilding in v1 is tempting with AI, but creates a brittle, hard-to-test matrix of features.
- "We way overbuilt for V1...just because you can doesn't necessarily mean that it should be in at least the first version." (Mike, 05:51)

Rethinking the Development Cycle

Embracing Rewrites & Iteration:
- Not taboo to significantly rewrite products, thanks to AI acceleration.
- "It’s no longer…a year long rewrite that might have killed a company like…Netscape…We’ve actually had several initiatives…built the full blown thing, realized we've overcomplicated…tore it down, done a V2…" (Mike, 08:59)
- Prototyping and iteration cycles are measured in days or weeks, not months or years.
Learning from Real User Contact:
- The value of shipping minimally useful versions early to allow real-world learning and validation.

Agent-native Products: Philosophy and Implementation

Definition and Principles:
- "Agent-native" means that anything a user can do, an AI agent can also do seamlessly.
- Importance of designing software primitives so agents can modify and operate on every function natively.
Evolving the Paradigm:
- Comparing Claude Code’s architecture (agent-native) to previous iterations.
- "It should have knowledge about itself and that unlocks so much capability…how do you imbue the software that cloud builds to be more cloud aware…and even just cloud agent native…because it still won't [by default]…" (Mike, 12:48)
Agent Native as a Test Case:
- Hard to anticipate or test for all emergent agent-native behaviors.
- Need for robust architecture to handle unexpected agent interactions.
- "It's much harder to sort of write an end-to-end functional test around an agent native product because part of it is that unpredictability." (Mike, 16:49)

Proof of Use and Robustness

Proof of Use as Value:
- Preference for proof of actual use/demo (e.g., Loom videos) over just passing tests for feature validation.
- "You want to have a playground within a safe environment. That's the only way you can have a playground is if it's safe around the edges." (Dan, 19:07)
- "It's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through?" (Mike, 20:13)
Robustness Over Time:
- "Does it feel like it's built on sand or does it feel robust? And the agent native part adds something totally even beyond that." (Mike, 22:47)

Team Structure, Roles, and Changing Skills

Smaller, Conviction-led Teams:
- Early phases benefit from solo or paired founders who hold the context and drive conviction.
- "Scaling the teams too quickly actually is a net negative…you [just] end up in this sort of meta coordination game." (Mike, 33:23)
Shifting Skillsets:
- With AI, hiring can now focus on great product sense and the ability to use AI tools, not just deep engineering.
- "We just hired a new GM who's…lightly technical, but he spikes super high on product and writing sense…and now we can hire someone like that where a year ago we wouldn't have been able to." (Dan, 24:21)
- Designers are moving into more hybrid builder-roles, writing as much code as engineers in some experiments.
Agency-Like Resource Model:
- Both Anthropic and Every use lightweight core teams, supplemented with shared designers, marketers, etc., as projects require.

Managing Features & Enterprise Adoption

The Difficulty of Unshipping:
- Need to be ruthless about removing features, but hard when serving enterprise customers.
- "Deleting features as a sort of imperative…if this is not working, let's go unship that." (Mike, 35:33)
- Legacy features may become load-bearing for certain customers; plugins/skills may allow keeping complexity out of the core.
Enterprise vs. Startups:
- Enterprise buyers may lag behind state-of-the-art; startups risk becoming outmoded if catering only to current needs.
- "You have to be willing to put out the V3 or the V4…a big rethink…cloud can help…host both for a little while…but then also be willing to cut over…" (Mike, 38:50)

Personal Agents, Ownership, and Trust

Emergence of Personal Agents:
- Open-source tools like OpenClaw demonstrate how quickly users form personal/identity-based relationships with agents.
- "My girlfriend's claw is called Shelly…there’s this thing that happens where it feels like it's mine…mirrors me in this way…" (Dan, 43:26)
Social Dynamics of Agents:
- Agents can reflect both the skills and trustworthiness of their owners, creating "shadow org charts" within organizations.
- "...everyone has a claw, their claw becomes known for and used for the thing that they're specialized at that per their owner is specialized at…" (Dan, 46:39)

Notable Quotes & Memorable Moments

"You can get it to go zero, not zero to one, but zero to end pretty quickly over the matter of hours."
(Mike, 02:31)
"If you grow a tree indoors, without it being exposed to wind, it doesn't get as strong…we've accelerated the pace of development so drastically, [but] you can actually kind of grow an entire tree indoors…doesn't have the same level of intuition and exposure."
(Dan, 04:47)
"Because vibe coding is so fun and so addictive, I just found myself being like, yeah, like I'll do this and I'll do this…and it just created this monstrosity that wasn't that good."
(Dan, 07:09)
"Instead [of adding the right feature], it just made for…something that felt really complicated."
(Mike, 08:59)
"Agent native means…the agent can do anything on your computer that you can do, and it's customizable and flexible and extensible."
(Dan, 11:38)
"It should have knowledge about itself and that unlocks so much capability in there as well."
(Mike, 12:48)
"It's not just proof of work, but it's like proof of thoughtfulness. Like, did you think this through?"
(Mike, 20:13)
"Scaling the teams too quickly actually is a net negative because they end up spending all this time on coordination…alignment conversations."
(Mike, 33:23)
"Being willing to delete code…they have deleting features as a sort of imperative."
(Mike, 35:33)
"It's like you have to be willing to put out the V3 or the V4 that is a big rethink of how the existing piece worked and then maybe have a transition period."
(Mike, 38:50)
"My girlfriend's claw is called Shelly. There's this thing that happens where it feels like it's mine, like it's really mine…has a personality that sort of like, mirrors me in this way."
(Dan, 43:26)

Important Timestamps

[02:31] – Challenges of AI-accelerated product building
[05:51] – The pitfalls of overbuilding and lessons from Lean Startup
[08:59] – Accepting rewrites and embracing iteration
[11:38] – Defining agent-native products
[12:48] – Claude Code and engineering for agent-native functionality
[16:49] – The need for robust, unpredictable, and testable agent behaviors
[19:07] – Proof of use, not just proof of work
[24:21] – How AI is changing who and how you hire
[33:23] – Team size, scale, and coordination challenges
[35:33] – Feature deletion core to modern product thinking
[38:50] – Startups, enterprise, and relentless product iteration
[43:26] – The psychology of personal AI agents