Summary6 min read

The AI Daily Brief: Artificial Intelligence News and Analysis

Episode: How AI Starts Doing the Work in 2026 With Anthropic CPO Mike Krieger
Host: Nathaniel Whittemore
Guest: Mike Krieger, Chief Product Officer of Anthropic
Date: December 24, 2025

Episode Overview

In this end-of-year episode, Nathaniel Whittemore (“NLW”) sits down with Mike Krieger, co-founder of Instagram and Artifact, and current CPO at Anthropic, to explore how AI’s role in coding, work, and enterprise is rapidly evolving. The conversation zeroes in on the emergence of agentic and “vibe coding,” the changing landscape for both enterprises and individual users, new interaction patterns with AI agents, and predictions for how 2026 could mark a tipping point where AI begins to reliably “do the work.”

Key Discussion Points & Insights

1. Anthropic’s Early Focus on Coding & Agentic AI

Clarifying Early Intentions
- Krieger emphasizes that Anthropic always believed coding would be central to AI’s utility, not just for software creation but as a problem-solving tool ([02:05]).
- “...For very powerful AI, you need the ability of the model to reason about things, to plan genetically and work for a long time horizon, but then also to be able to write and run code…” — Mike Krieger ([02:05])
The Claude Model’s Milestones
- The release of Claude 3 marked a turning point when the outside world realized these models could generate substantive code, beyond basic functions ([02:48]).
- Anthropic’s artifact product was among the first to enable interactive, AI-driven coding experiences outside a formal IDE ([03:01]).

2. The 2025 ‘Year of Agents’—Especially Coding Agents

Evolving Use Cases and Community Adoption
- 2025 saw “coding agents” quickly become the most exciting use case, confirming internal bets at Anthropic ([04:39]).
- Internal hackathons revealed a breadth of applications – from bioinformatics to “SRE-in-a-box” – demonstrating agent tech’s applicability far beyond just engineering teams ([08:01]).
Designing for Non-Existent Capabilities
- Krieger discusses Anthropic’s product principle “ride the exponential.”
  - The team builds products for today’s needs but ensures they will naturally improve as models do ([06:42]).
  - Sometimes improving the product meant removing scaffolding as models became more capable ([06:42]).

3. Expanding Accessibility: From Engineers to Tinkerers & Non-Technical Users

Non-traditional Users Adopting AI Tools
- Internal and external hackathons showed non-engineers quickly using these tools in creative ways ([08:01]).
- Anthropic rebranded “Claude Code” to the “Claude Agent SDK” to better reflect the diverse array of tasks agents tackled ([09:20]).
The Emerging Gap: Usability and Trust
- Current tools still most benefit “tinkerers” who bridge technical and non-technical roles ([10:04]).
- The challenge now is making these systems both more understandable and reliable for true non-developers.
  - “If you had a human coworker…sometimes it would sort of make a mistake you would never have expected…it'd be a pretty complicated relationship with that coworker.” — Mike Krieger ([10:04])
Changing Habits and Patterns of Work
- Even power users are only just beginning to truly rewire their workflows for “vibe coding” ([11:26]).
- Krieger shares a personal story: Once he’d built one tool in Replit using Opus, it became second nature to build more (even automating a Secret Santa for his family over breakfast) ([12:17]).
- “I do think…there's this sort of still like habit creation and adaptation of even knowing you can do that that we still need to close.” — Mike Krieger ([12:17])

4. The “Three Buckets” of AI Coding Adoption

Software Engineers:
- Fast adopters, iterative feedback cycles, push the limits of agent capabilities ([18:34]).
Vibe Coding / Builders / Product Managers:
- Need help climbing the complexity curve; often stymied by “magic words” or knowledge gaps ([18:34]).
Enterprise / Business Users:
- Focused on productivity, want high-quality outputs—not just new interfaces or chatbots ([21:25]).
Bridging the Gaps
- AI tools must help non-engineers “move up that complexity ladder in a structured, thoughtful way” ([18:34]).
- Krieger draws an analogy to Instagram’s own early growth pains—a product may initially be simple, but true value comes as it scales and infrastructure matures ([18:34]).

5. Enterprise AI in 2026: Infrastructure, Reliability, Integration

Big Trends for Enterprises
- Growing focus on deploying “horizontal agents”—copilot-style AIs that can handle both creative and repetitive back-office tasks ([22:16]).
- Enterprises move from “AI sprinkles” (chatbots, isolated features) to agent-native workflows that rethink core product or service design ([22:16], [23:55]).
Infrastructure, Retraining, & Data Readiness
- 2026 may become an “infrastructure year” for enterprises, focusing on making systems “AI-friendly” through better data annotation, process mapping, and robust connectors ([25:28]).
- “Figuring out what are the missing connector bits is going to be, I think, a lot of 2026.” — Mike Krieger ([25:28])
- Moving from data retrieval to AI-driven action and decision-making is the new frontier ([25:28]).

6. Roadblocks & Predictions for 2026

Blockers:
- Legacy systems, regulatory constraints, complex permission structures, and integrating with existing infrastructure ([27:00]).
- Anthropic is prioritizing the “distributability” of their agent tools, making them deployable in diverse environments ([27:00]).
From Tool to Colleague
- 2026 may be the year when AI transitions from just a tool to something that can “reliably take work off your plate” ([30:01]).
- Early evidence: GitHub partnership with Claude, where the agent autonomously handles pull requests ([28:42]).
- Krieger is optimistic but realistic—total autonomy remains over the horizon but is beginning to show in specialized workflows ([28:42], [29:50]).
  - “...you might delegate to somebody else—that’s very much around the corner.” — Mike Krieger ([28:42])

Notable Quotes & Memorable Moments

On Anthropic’s Product Philosophy:
- “One of [our principles] is ride the exponential, which is like we're trying to build products that both, you know, meet the moment…Maybe the ones that are a little early, we won't release yet, but that they can naturally improve.” — Mike Krieger ([06:42])
On Broadening AI Coding Beyond Engineers:
- “When we launched it we started seeing things externally too, like people using Claude code as their project manager...as a data scientist...So we started seeing this much more, it’s why we eventually renamed the underlying SDK…because we realized calling it code was doing it a disservice…” — Mike Krieger ([09:20])
On Changing Work Habits:
- “It's easier to build the incremental n+1, but…that first one…requires that sort of uplift if you're not in the habit.” — Mike Krieger ([12:17])
On the Reliability Challenge with AI Coworkers:
- “If you had a human coworker and…sometimes it would make a mistake you’d never have expected...you’d have a pretty complicated relationship with that coworker. I still think [we’re] in that phase…” — Mike Krieger ([10:04])
On Enterprise Blockers:
- “A lot of the work that we are doing for next year is...distributability...If we want to bring our intelligence and even our agentic primitives...into actual enterprise workloads, we need to really embed and meet them where they are.” — Mike Krieger ([27:00])
On the Vision for AI in 2026:
- “Reliably take work off your plate.” — Mike Krieger ([30:01])

Key Timestamps

Origins of Anthropic’s Coding Focus ([02:05] - [03:28])
Rise of the Coding Agent (Claude Code & Hackathons) ([04:39] - [09:20])
Non-technical and Tinkerer Adoption ([09:40] - [11:26])
Rewiring Habits for Vibe Coding ([12:17] - [13:36])
Three Buckets of AI Adoption ([17:23] - [21:25])
Enterprise Trends and Process Redesign ([22:16] - [25:28])
Technical and Organizational Blockers ([27:00] - [28:42])
From Tool to Colleague ([28:42] - [29:50])
2026 Vision Statement ([30:01])

Conclusion

Mike Krieger and NLW paint a picture of an AI landscape on the verge of transformation: 2025 was the year coding agents hit mainstream, enabling both technical and non-technical users to partner with AI in new ways. Yet challenges remain, especially around reliability, integration, and reimagining workflows. Enterprises will spend 2026 focused on infrastructure and truly baking AI into their core products and processes. As AI agents become more capable and trustworthy, the promise for 2026 and beyond is clear and ambitious—AI that can finally “reliably take work off your plate.”

Loading summary

Transcript54 lines

[00:00]
Host / Announcer
Today on the AI Daily Brief, the future of Vibe coding and what's in store with AI 2026 with Mike Krieger, the Chief Product Officer of Anthropic. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. Alright friends, quick announcements before we dive in. First of all, thank you to today's sponsors, Super Intelligent robots and Pencils and Blitzy. To get an ad free version of the show, go to patreon.com aidaily brief or you can subscribe on Apple Podcasts. To learn more about sponsoring the show, send us a Note@ SponsorsAidailyBrief AI and if you are interested in our forthcoming AIDB intelligence service, check it out@aidbintel.com now as we move forward into our end of year episodes, I'm excited to add a couple of conversations into the mix. You might know Mike Krieger as the co founder of Instagram Real Ones will also know him as the co founder of Artifact, an AI powered news app. However, for most of you right now, Mike's most important role is as the Chief Product Officer of Anthropic. In this conversation we talk about the origins of Anthropic's focus on coding, how enterprise AI usage has changed over the course of the year, and some of the trends that Mike is most excited about heading into 2026.
[01:14]
Interviewer / Podcast Host
All right, Mike, welcome to the AI Daily Brief. Great to have you here.
[01:17]
Mike Krieger
It's great, great to be here. Thanks for having me.
[01:19]
Interviewer / Podcast Host
Yeah, so, so this is a super fun. Like I was just saying, some of my favorite episodes of the year are these end of year episodes where we get to kind of think, think big, look forward and you. One of the big themes I think for, for me heading into the new year is sort of everything, Vibe coding, everything agentic. And so I was super excited to have you join the show.
[01:36]
Host / Announcer
What I wanted to do though is.
[01:37]
Interviewer / Podcast Host
Actually kind of go, go way back a little bit. I think, you know, a lot of folks see Anthropic as, as sort of the torchbearer in a lot of ways for AI coding.
[01:46]
Host / Announcer
And I wondered, you know, I was.
[01:47]
Interviewer / Podcast Host
Thinking about when you joined the organization and just how, how early was that sort of focus clear? You know, was that an emergent phenomenon as it became clear that there was something very differentiated in these models and that's how people were using it or is that sort of like intention from very early on that this is a broad sort of set of use cases that matter to you guys?
[02:06]
Mike Krieger
Yeah, the thing I always like to say, whenever there's sort of product folks inside Anthropic, they're thinking about sort of which direction to take things in, is the more you can align with the sort of company's general long term perspective about where powerful AI will come from, the smoother things will go. Because Anthropic is nothing but focused. Right. And I think that that's shown through in sort of the bets that we choose to make versus not. And definitely there's this belief that for very powerful AI, you need the ability of the model to reason about things, to plan genetically and work for a long time horizon, but then also to be able to write and run code, not only to produce software, but because it's a really useful tool for SOL problems. And so that belief was in there and it predates me. I joined in May of last year, but it kind of coincided sort of with the outside world realizing it because Claude 3, which had come out I think a month before that, was the first model. And I remember there was like that moment on Twitter when everybody said, oh, wow, this model can actually write not just like sort of function level, but like entire, you know, sort of files of code. And of course it compared to now, it was not very good at it, but it was already, you know, amazing what it could do then. And then we paired it with our first sort of more coding oriented product, which was artifacts, so you could have, you know, Claude kind of generate, you know, at the time was mostly react sites, you know, alongside the chat. And that was kind of, I think for a lot of people, the first moment they realized, oh, this is an interesting new experience of kind of coding alongside the model and not necessarily doing it in a, in a development environment.
[03:28]
Interviewer / Podcast Host
Yeah, it's interesting. I think you can in a lot of ways almost chart people's sort of the viability of a lot of this to key releases alongside Anthropic. You know, I remember when I first started this show, it was actually April of 2023 and already sort of agent coding was like the thing that people were most excited about, like GPT Engineer, which would later actually become sort of morph into lovable like 18 months later or something like that was. It was like my first viral YouTube episode was about GPT Engineer. And so it's interesting to see kind of like at each stage how more use cases get unlocked and sort of a broader set of people come into the fold. Coming into 2025, you know, I think that the odds on favorite for what the year was going to Be about at least if you had looked back at kind of all the AI content creators, it was going to be the year of agents, right? And I think looking back, it was, but it was the year of coding agents. Did you guys have a sense coming into this year that there's sort of, that this was poised to be kind of, you know, the significant use case or the breakout based on what conversations you were having, based on the capabilities that you were seeing?
[04:40]
Mike Krieger
Yeah, it's a great sort of moment to reflect because going into the sort of last couple weeks of the year, last year we had built something internally we called Claude Cli, which we later released as Claude Code. And that was the emergence of. That came from our Labs team, which is a team that really focuses on trying to do sort of disruptive zero to one ideas. And that was everything from like early computer use explorations and some wacky things and also this cloud clique thing. And between I think September, when the first version got sort of rolled out internally to December, it rapidly overtook every other sort of coding tool we had internally. And it was because it kind of had this bet that the models are going to be able to do more and more, maybe not this model, but the next one and the next one and the next one. So let's let the model cook for longer. Let's let it sort of act for, you know, longer periods of time. And so that, you know, going to the holidays, it was that question of do we release this, you know, like, do we now add a kind of third component to the product portfolio beyond just cloud AI and then and the API? And so that was the active conversation that was happening. But we really felt like if not us, then at least somebody using our models would sort of co discover this, this piece where you don't need to hold the model so closely anymore. You can let it operate over a sort of fuzzier task definition and over a longer time. It still needed a fair amount of handholding then, but you could see, you could see the shape of it. So it was definitely the. Coming into this year we felt like that was going to be a major shift in how people were going. Old software.
[06:04]
Interviewer / Podcast Host
Well, you know, it's a super interesting qu. One of the things, I mean you, you're sort of, you know, you have a deep product experience and one of the challenges I think now for product folks and just for entrepreneurs in general is there's this sense that to be successful you have to, you know, not just give lip service to the idea of skating to where the Puck is going, but actually sort of design and orient what you're building for capabilities that do not yet exist. And that's an extraordinarily hard thing to do. And it sounds like that was part of the genesis of cloud code, was just some sort of attempt or, you know, some like, you know, scratching against that itch in some way.
[06:43]
Mike Krieger
Yeah, we have product principles inside Anthropic and one of them is ride the exponential, which is like we're trying to build products that both, you know, meet the moment so they, they're useful today, or at least they, they poke at something useful today. Maybe the ones that are a little early, we won't release yet, but that they can naturally improve. And it's been interesting even on the cloud code side. We've deleted parts of the harness over time rather than added to it because the model can do more and it's really interesting. Also, we work with a lot of kind of downstream customers that are using the model and sometimes we'll drop a new model, a research model, and they'll say, oh, it doesn't look like it improved very much. And then we'll send some applied AI folks to spend time with them and they realize, all right, now we're actually harness bound and we need to actually let them evolve and let the model do a bit more to loosen that as well. But it's definitely an active conversation that we have with folks building on top of the platform. They have some visibility about where we're going. Maybe they'll be in a research program, early access, but they still have to do a fair amount of this. All right, so if the models are here now and I need to do this much additional scaffolding, what does this look like? If I need to do less scaffolding, is my product still useful in adding value and can the model then do even more for me, or is it now going to squeeze the piece that I thought I was adding value in?
[07:49]
Interviewer / Podcast Host
Have you been surprised at all with the way that people have used cloud code since you released it? Because, you know, it is much broader uptake than just sort of, you know, core audience of software engineers?
[08:02]
Mike Krieger
Yeah, absolutely. We internally, we had this internal project that people were using and then we like buttoned it up and put on a more fancy suit to be able to release it publicly. But then as you can imagine, the internal use cases kind of kept co developing and so we do like every two to three times a year we do a hackathon and it's been notable that every hackathon We've done has been around the time that some technologies like poised for a breakout. So the first one we did was around MCP and every single project was MCP based before really the rest of the world had kind of caught on to mcp. The second one we did was around the time that Claud code had been released. And what was really interesting was how many projects were not coding projects but they were using CLAUDE code as the underlying engine. So there was one that was using CLAUDE for doing bioinformatics which we later kind of channeled into cloud for life sciences. Another one that was using CLAUDE as a sort of SRE in a box and was able to use CLAUDE code as a way of, you know, looking at data sources. There was CLAUDE as a data scientist. There's all these sort of pop up projects that what was nice is that they didn't have to reinvent the tool used kind of bit that could just add value on top of that. And then when we launched it we started seeing things externally too like people using cloud code as their project manager, cloud code as their pm, cloud code as a data sciences externally. So we started seeing this much more. It's why we eventually renamed the underlying SDK to the cloud Agent SDK because we realized calling it code was doing it a disservice relative to what kind of use cases we were actually seeing.
[09:20]
Interviewer / Podcast Host
Yeah, so this is one of the questions that I'm most interested to see in coming year, but even the coming years is what, what it takes to kind of rewire people with these new tool set. It's like this, this whole language, this whole infrastructure that they have access to, especially if they, that they haven't before, especially if they're, they're not developers.
[09:41]
Host / Announcer
Do you think that some of this.
[09:42]
Interviewer / Podcast Host
You know, if on the spectrum from this early kind of usage of CLAUDE code for non coding use cases is tinkerers who are kind of, you know, more technical than they let on on the one end of the spectrum versus actually kind of heralding a different set of interaction patterns that people are going to have kind of. How do you see that evolving?
[10:05]
Mike Krieger
Yeah, I think it's early still. Like even when we look inside companies that have deployed like cloud for enterprise and they have builders within their sales team or their ads team or whatever, different non technical team, you will always find this sort of Persona which is the tinkerer builder like early adopter within that space that usually is not an engineer, doesn't even have an engineering background but has figured out enough and has like Learned the primitives and can then talk to Claude enough about how to fix these issues that they can then kind of build something pretty powerful. Whether it's, you know, automating part of what they were doing, sort of enriching what they were doing, making their teams lives easier, like all these different pieces. But it does still take that person, which I think is probably a natural part of the kind of software life cycle we are. I think there's still this gap and I think that's both a gap in interface in terms of how people think to interact with and how these products reveal their full capabilities and then also the actual capabilities themselves. Where if you had a human coworker and it was very creative at solving problems, you gave it a high level task, was able to do it most of the time, but sometimes it would sort of make a mistake that you would never have expected to make based on it having just done it great last week, you'd be have a pretty complicated relationship with that coworker. I still that phase still of this gap between understandability of these systems, but then also gap of, you know, how reliable and predictable are they when they do start working and can they feel more like a thing that gets just predictably better over time?
[11:27]
Interviewer / Podcast Host
Yeah, I think that's true. And I also think that there's just, you know, I don't have the exact right words for this, but there's some lag in terms of just, you know, unwinding and undoing. However many years or decades of the way that you've been doing a thing before that is, it just, it just takes time. You know, I think about it, you know, we're Now, I guess 10 months into vibe coding as a named phenomenon. Right. It was same February of this year, same month that cloud code came out. And I'm still finding myself as someone who literally, you know, podcasts about this every day and is living inside these tools. I'm only just now starting to find myself actively ask on a regular basis, like could I be building something to do this instead of using a Google sheet or instead of, you know, however I, however I used to do it. And again that's, that's me as someone who's as deep in this as you can get.
[12:18]
Mike Krieger
I think there's something really to the, you know, building with one tool that gets you comfortable with it and familiar with it. It's easier to build the incremental n + 1, but it's the, that first one that requires that sort of uplift if you're not in the habit. So I was working on a project over the weekend. I was using Replit and using Opus under the hood. And then, then I also needed to create a secret Santa for my family. And that, you know, because I had been in the tools already, it was over breakfast. While I was cooking eggs, I, you know, kind of kicked off this asynchronous request and by the time I was done, it actually had built the whole thing. And that was really cool. But I don't, I wouldn't have Reach as my first tool had I just not been, you know, sort of interacting with that same software. So I do think that there's this sort of still like habit creation and adaptation of even knowing you can do that that we still need to close.
[13:00]
Interviewer / Podcast Host
Yep, exactly. Sort of similar example, someone had mentioned building a gift tracker with, with one of these tools. And I love that because we always end up, it's Christmas Eve and I'm like, we're shoving presents in the closet for later because we just bought too many things. And so I copied that, I imitated it and it was probably for whatever reason, the first time in a month that I built something and just huge, basically since this last wave of sort of models had come out and within, I don't know, three or four days, I had like six or seven different applications that had just sort of like, you know, spiraled from there.
[13:36]
Mike Krieger
I think it's also interesting like when we, you've seen this not even just for our model launches, but other model launches, there's been the sense from people that are primarily maybe interacting with it, you know, in chat, and it'll say, oh, it seems a little smart, smarter, maybe it's a little bit warmer. But they're very sort of vibes based assessments. And if you're not sort of checking in and dipping back in and trying to build something with it, that's where I think you see the biggest leaps. And I definitely saw it over the weekend actually getting to build with Opus, you know, for real, over, you know, two days of getting really deep in there, I was like, oh, these are applications that would not have been doable even in Sonnet 4.5. It would have hit some ceiling or it would have gotten stuck in some loop. But then just watching it hit a wall, you know, debug itself, tail the logs, add debug, logging, roll it out again, pull up a browser, check the like all of these capabilities that I think have just either been co created along with the model or now the model is using better as it's improved, but you really got to sort of dip in and push it to really see that difference. But I think where there's some of that jadedness, I think from the outside sometimes about are we hitting a plateau? Whereas, you know, if you, if you checkpoint it, you can definitely see that continued improvement. Foreign.
[14:46]
Host / Announcer
Is brought to you by my company, Superintelligent. Superintelligent is an AI planning platform, and right now, as we head into 2026, the big theme that we're seeing among the enterprises that we work with is a real determination to make 2026 a year of scaled AI deployments, not just more pilots and experiments. However, many of our partners are stuck on some AI plateau. It might be issues of governance, it might be issues of data readiness, it might be issues of process mapping. Whatever the case, we're launching a new type of assessment called Plateau Breaker that, as you probably guessed from that name, is about breaking through AI plateaus. We'll deploy voice agents to collect information and diagnose what the real bottlenecks are that are keeping you on that plateau. From there, we put together a blueprint and an action plan that helps you move right through that plateau into full scale deployment and real roi.
[15:38]
Interviewer / Podcast Host
If you're interested in learning more about.
[15:39]
Host / Announcer
Plateau Breaker, shoot us a note. ContacteeSuper AI with plateau in the subject line Small, nimble teams beat bloated consulting every time. Robots and Pencils partners with organizations on intelligent cloud native systems powered by AI. They cover human needs, design AI solutions, and cut through complexity to deliver meaningful impact without the layers of bureaucracy. As an AWS Certified Partner, Robots and Pencils combines the reach of a large firm with the focus of a trusted partner. With teams across the us, Canada, Europe and Latin America, clients gain local expertise and global scale as AI evolves. They ensure you keep peace with change, and that means faster results, measurable outcomes, and a partnership built to last. The right partner makes progress inevitable. Partner with Robots and pencils@ropotsandpencils.com aidaily Brief this episode is brought to you by Blitzi, the enterprise autonomous software development platform with infinite code context. Blitzi uses thousands of specialized AI agents that think for hours to understand enterprise scale codebases with millions of lines of code. Enterprise engineering leaders start every development sprint with the Blitzi platform, bringing in their development requirements. The Blitzi platform provides a plan, then generates and precompiles code for each task. Blitzi delivers 80% plus of the development work autonomously while providing a guide for the final 20% of human development work Required to complete the Sprint. Public companies are achieving a 5x engineering velocity increase when incorporating Blitzi as their pre IDE development tool, pairing it with.
[17:07]
Interviewer / Podcast Host
Their coding pilot of choice.
[17:08]
Host / Announcer
To bring an AI native SDLC into their org, visit blitzi.com and press get a demo to learn how Blitzy transforms your SDLC from AI assisted to AI native. Well, so this is actually an interesting.
[17:24]
Interviewer / Podcast Host
Point at which to sort of maybe try to fork the AI coding or vibe coding conversation into a few different buckets. You've got sophisticated, the sophisticated sort of software engineering conversation around what AI coding is going to mean and sort of, you know, the autonomy spectrum and sort of where these models are and what they can do. And you almost have, you've gone through kind of a full cycle this year where you know, it was, you know, huge amounts of uptake, but now we're.
[17:50]
Host / Announcer
Kind of sifting through the new challenges.
[17:52]
Interviewer / Podcast Host
Right. You know, new technology doesn't solve all problems. It trades one set of problems for a hopefully better set of problems. But then you still have to solve those.
[18:00]
Host / Announcer
But then on the other end of.
[18:01]
Interviewer / Podcast Host
The spectrum you've got, it's incredibly nascent, the individual, the non technical usage of these tools. I think we've barely started to scratch the surface on those things.
[18:09]
Host / Announcer
And then somewhere in the middle you've.
[18:11]
Interviewer / Podcast Host
Got kind of enterprise usage which includes some of that software engineering reorganization, but also includes, I think, lots of folks who are thinking about how to use these types of tools for other aspects of the business rather than just sort of, you know, building software.
[18:25]
Host / Announcer
How much do you think these are?
[18:28]
Interviewer / Podcast Host
The same conversation versus, you know, again, three. Three kind of. Or two or three different conversations using all the same words.
[18:34]
Mike Krieger
Yeah, no, I think you're right. I think even if they have an underlying model that's the same and even if some of the other building blocks that you might use in there, either from an SDK perspective, they all sort of end up needing different applications or different sort of manifestations. They feel quite different. I think you're right. So on the, you know, software development side, you have in general like a pretty motivated population that has always been interested in tinkering with their tools. Like, hence the emacs versus, you know, vim, like, you know, tabs versus spaces. Like programmers maybe notoriously have the, the desire to sort of optimize their own building environment, which other disciplines might have, but not always to the same degree. And it's not always as easy to swap one thing out for another. So the adoption there, the evolution has become, I think this flywheel where it's really clear for, you know, engineers at Anthropic where the model needs to improve. And that's very helpful for us to coordinate with research and close that flywheel as well as, you know, feedback from external folks. In the middle piece there's still this sort of ceiling of complexity that you can hit now. There's been really impressive sort of 0 to 1 Vive coded, you know, applications that have even been released. But you still, still. I think the gap I was perceiving, even just like watching my wife, who's a sort of product manager or UX designer by training, not a software engineer, use some of these, is that you still sometimes need to know the right sort of magic incantation words to use. We were building a product together like a side project and the way it was using LLMs was just filling the context window and I was like, okay, well you actually probably need to move to some like semantic retrieval piece. But Opus wasn't suggesting that out of the box and she didn't know the magic words. And it took me saying, all right, we actually probably need to move to this embedding solution. It was just like a layer of complexity above. And so I think one thing that our models, maybe all these coding models can do a better job of in that middle category of helping non technical people build things that are, you know, effectively software is helping them move up that complexity ladder in a sort of more structured or thoughtful way. Where yes, the five coded front end only thing is great to show off an idea and then you want to persist data. Okay, that's the next step. Right now you want to persist data. You're thinking about launching this. That's going to take a whole other step of security reviews and thinking about things and like, oh, now you've launched it and the thing is melting under load. Okay, great. Now I got to put on my performance engineering hat and then build from there. In the same way with Instagram we went through that process of first we were just building the ui, then we built the backend, then we launched and it totally fell over because it got attention. And then we kind of rebuilt it over the next weeks and months to sort of manage. You got to speed run that but with model assistance now. So that feels like the big piece on the middle one and then the last one I think on the enterprise software side, you know, I saw you cover the famous MIT report of that like gap of expectations and I think that was such a, it was one of those things that was truthy and even if there was like sort of methodological problems underneath the study, it did point to something that a lot of people had, which is like, I got AI rolled out to me at work, but I'm not sure I'm more productive. And I think the place to close that gap, I think there's a bunch of things. One is just making sure the output quality is actually good enough, where you're saving yourself time where something is half done. I think for most people, they say, well, I probably would have been faster doing this myself rather than ending up with something that's like not quite there. And then I'm struggling to get it to iterate with me to where I need it to be. So a lot of the emphasis that we've been doing is actually less on the agentic side. It's less like take my two sentence description and generate an entire PowerPoint deck out of it. And it's much more, you know, require a little bit more upfront work, but really focus on making sure that that initial, you know, sort of thing that got created was high quality enough where you felt relief and happy that it saved you time rather than, oh man, I've just created more work for myself by using AI.
[22:01]
Interviewer / Podcast Host
As you look into 2026, where, where do you see enterprises starting this year maybe especially as compared to where they were starting in 25, what do you think the big, big goals are that you have sort of thinking about, you know, both model design, but also product design?
[22:16]
Mike Krieger
I think maybe two things that feel markedly different now versus a year ago. One is enterprises getting more interested in rolling out what we've been calling horizontal agents. But basically if you think about the sort of companion agent or copilot agent, where there's a strong human in the loop and you're kind of co creating either a document or an email or whatever that may be. Seeing also a lot more interest now in great. We have this repetitive back office task we're trying to scale up to handle international, know your customer requests, whatever. Those sort of complicated but repeatable processes that have something that is bespoke to that enterprise, but also something that is sort of regulatory. For example, we're seeing a shift there where there's a lot more interest and we've been deploying, you know, applied AI and engineers into these enterprises to help them get those agents running. And it's often about like sort of translating what those requirements are into that, you know, process again, where the model can be creative and flexible, but still repeatable enough to, you know, follow their Operating procedures. That feels markedly. A year ago we weren't really having really any of those conversations as well. And then the second piece, which also feels nascent is I think all of these enterprises, especially any that have this public facing product that they might be shipping, is kind of going beyond V1, which was like, let's kind of sprinkle AI on these different surfaces and hope that improves the product too. Do we need to rethink some fundamental pieces of the product to be more agent native, to use a buzzword. But what I really see it as is have you unlocked the full power of your product to any AI that is sort of running on top or alongside it? And we could talk about that, but I think that's a harder transition to make than. All right now we've got a sidebar that you can chat with your, you know, AI and kind of integrate with the, with the rest of the product.
[23:56]
Interviewer / Podcast Host
Yeah, it's interesting. I think that again, kind of looking back at maybe what, what expectations were versus what played out. Again, if, if we take the idea that 25 was the year of agents, but maybe a little bit differently than we thought. You know, one part of that was it was the year of coding agents, but another part of that was it.
[24:12]
Host / Announcer
Was also the year of agent infrastructure.
[24:14]
Interviewer / Podcast Host
You know, this is a year where MCP became ubiquitous more recently. I just today, before recording this, did a show about OpenAI adding skills support or, you know, starting to experiment with skills support. And it's very clear that everyone is much more interested at this stage at sort of building the necessary infrastructure to be able to move forward faster than in sort of getting waylaid in the sort of standards wars that we've had in the past.
[24:39]
Host / Announcer
And I wonder, it feels to me.
[24:41]
Interviewer / Podcast Host
Like we're poised a little bit for enterprises to almost go through their kind of infrastructure year in 26, where, you know, again, going back to the lesson at the MIT study or the truthiness of it, hold aside the specifics. The fact that it had such resonance suggests, and I agree with this, that there is something, you know, some gap there, I think that a lot of organizations are embracing now that it's just, you're not just going to drop a chatbot in or you're going to do that, but then to really go kind of to the next level, it's going to involve a much more sort of, you know, comprehensive review of how you do things. And it feels to me like, you know, perhaps that some, some amount of that process redesign is, is what organizations at least the Ones who are kind of, you know, ahead are going to be in 4 and 26.
[25:28]
Mike Krieger
Absolutely. And I was talking to somebody who runs technology at a large bank, and he was telling me that they had to rethink not just the data storage piece, which they'd already been doing a lot of work on, but also the sort of data annotation and sort of line piece to be more AI friendly. So that when you asked, you know, Claude, to, hey, help me construct, you know, a dashboard on this or help me understand this data query, even having that additional layer of annotation or understanding of what these different tables are and what they represent been a huge way to actually making that a useful sort of product. And so figuring out what are the missing connector bits is going to be, I think a lot of 20, 26, which is great. We have MCPS. We're seeing more and more enterprises wrap some of their internal services or internal data stores and mcps so they can get access to it inside, for example, Claude, now the next turn is that's maybe on the retrieval side. Can you actually start taking action and making it a useful participant in business processes by enabling it to either make a human assistant decision or queue up a decision that a human can confirm, whatever the right sort of metaphor human in the loop piece is, but moving up that complexity ladder so that again, it can actually start providing value that befits its level of in the discourse.
[26:42]
Interviewer / Podcast Host
I want to talk in our last few minutes about some of your predictions or thoughts about how 26 is going to end up differing from 25 with AI. And maybe just to get us started, we were just kind of talking about expanded enterprise use cases. But what do you think are going to be the biggest blockers for enterprises and how do you think they're going to get through them?
[27:00]
Mike Krieger
I think for a lot of enterprise that we talk to, there's still this gap between sort of the idealized great, if you ran this perfectly on this like one cloud with all, you know, your, you know, permissions all perfectly set, and you were okay with inference happening in this way, then we could unlock use case tomorrow. And the reality, which is there's legacy systems, there's often sort of regulatory reasons why they, you know, for example, will only run in this particular way on AWS in this particular kind of setup. And so a lot of the work that we are doing for next year is the word we've been using is distributability, which I think the spell corrector tells me is not really a word. But what we really mean is if we want to bring our intelligence and even our agentic primitives, you know, whether it's skills, whether it's the agent SDK, whether it's storage, whether it's memory, all of these pieces into actual enterprise workloads, we need to really actually embed and meet them where they are. And so there's a lot more work on, hey, let's actually like componentize this, make it available everywhere you see it now that we're on all three major clouds like that, that the general sort of set of projects is kind of closing those, those gaps because there is interest and especially from the sort of more sort of forward looking CTOs and CIOs, but they also do need to work with sort of the existing constraints and setup they have. And you can kind of get the pilots done in a like pretty rough and ready way just to prove it out. But to really reach that production scale.
[28:20]
Interviewer / Podcast Host
I think that's the biggest blocker tool versus colleague. This is something that, that, that we've been sort of talking about for a while and, and it's, I think this is maybe a false binary in terms of, you know, when we reach maturity of AI, but do you think that we'll start to see more of that kind of treating AI not just as a tool but as a thing that can take on ever bigger workloads? Do you think that that sort of starts to come to reality next year?
[28:43]
Mike Krieger
Yeah, I think that that probably more than anything is what will define the year, is you start seeing this already with coding. So we did this GitHub partnership with their agent HQ piece where now you tag cloud in a pull request and then you go have your coffee and you come back and it's done, whatever you needed to do. And we did the same integration with cloud code that's sort of pointing at the kind of interaction that you might expect now is it already going to be at the place where it can onboard onto the organization, understand the problem space, understand the sort of dynamics of all the relationships and just pick up work? No, I don't think we're going to be there. Maybe near the end of the year we'll have some kind of early glimmers of there. But I do think the sort of more piece of the job function that has a clean sort of great, we need to prepare this kind of report. Here's the work I've done already, here's where you can go get more information, here's what good looks like, report back to me. You know in a way that you might delegate to somebody else that's very much around the corner. And how we're thinking about a lot of our product strategy and going into next year is how do we enable that? What are the interfaces that we need to create that make that possible? And then what do we learn about what's working on the software domain that we can apply to knowledge work?
[29:51]
Interviewer / Podcast Host
This maybe asking you too much to put on a marketing hat, but if you add sort of a phrase for capturing, you know, what you hope AI does in 26, what would it be?
[30:01]
Mike Krieger
Reliably take work off your plate.
[30:04]
Interviewer / Podcast Host
I like it. All right, Mike, well, this is super, super fun conversation. Could go for another half hour hour easy. But appreciate you making the time and really excited to see where you guys cook up.
[30:14]
Mike Krieger
It was great to be here. Thanks for having me.