wavePod

How Devin replaces your junior engineers with infinite AI interns that never sleep | Scott Wu (Cognition CEO) - How I AI | Wave AI Podcast Notes

Back to How I AI

How Devin replaces your junior engineers with infinite AI interns that never sleep | Scott Wu (Cognition CEO)

How I AI

Mon Sep 08 2025

Summary

Podcast Summary: "How Devin Replaces Your Junior Engineers with Infinite AI Interns That Never Sleep"

Podcast: How I AI
Host: Claire Vo
Guest: Scott Wu (CEO of Cognition Labs)
Date: September 8, 2025

Episode Overview

This episode dives deep into how Cognition’s AI agent, Devin, is reinventing the way software engineering teams operate. Host Claire Vo sits with Scott Wu (CEO and founder, Cognition Labs) for a hands-on exploration of Devin—dubbed the ultimate “infinite intern”—and its transformative workflows. They discuss how Devin mimics a junior engineer, enabling asynchronous, multi-threaded productivity, and touch on best practices for task scoping, integrating AI into team culture, and even leveraging voice AI for collaborative meetings.

Key Discussion Points

Introducing Devin: The Infinite Intern

Devin as Async Junior Engineer
- Devin acts like a junior engineer: given a well-scoped task, it works independently and asynchronously.
- “Devin's going to start working and looking through the code... it's just as if you gave your intern a project and your intern is going and working on it.” — Scott Wu [00:00]
Multi-threading Tasks
- Teams can spin up multiple Devin sessions in parallel, substantially increasing throughput without needing to “babysit” each task.
- “You can multi-thread a lot with tools like this and set 2, 3, 4, 5, 10 of these going at once on different projects and not feel like you have to sit there and babysit things.” — Claire Vo [00:32, 13:01]

Scoping Work for AI

Tasks vs. Problems
- Devin excels at executing clear, specific tasks built from well-defined specs—its strength is in “tasks, not problems.”
- “Devin is not going to go and solve some, you know, really hard architectural problem... But where Devin really shines... is kind of like tasks, not problems.” — Scott Wu [04:04]
Using AI-Generated ‘Deep Wiki’
- Cognition's Deep Wiki creates AI-generated documentation for any codebase. It's a foundation for both understanding context and refining tasks for Devin.
- “You can come in and get a full AI generated documentation of the repo.” — Scott Wu [06:48]
Prompt Refinement
- Instead of sending five-word prompts, users are encouraged to gather context, build a richer prompt, and then hand off to Devin for asynchronous execution.
- “Take this prompt and turn it into an effective prompt given the context... It feels like extra friction ... but I think pretty soon is one going to be the job to be done of the tool itself.” — Claire Vo [10:27]

The Async Engineering Workflow

Synchronous Setup, Asynchronous Execution
- The ideal workflow: research with Deep Wiki/search (sync), build the prompt, then let Devin work independently (async).
- “If you had an intern... Often what you would actually do is sit down with them, talk it through for two minutes... then you kind of hand off there.” — Scott Wu [11:24]
Freeing Up Human Engineers
- While Devin runs, humans can focus on meetings or other tasks, then check back to review outcomes (e.g., pull requests, bug fixes, front-end changes) generated by Devin.

Operationalizing Devin in the Team

Devin as First-Line Responder
- Devin is tagged in Slack channels for new issues, crash monitoring, and even mundane engineering toil—becoming the “first person” on the task before escalating to a human.
Collaborative, Public Workflows
- Encourages using Devin in public channels rather than private DMs to increase AI adoption and cross-team learning.
- “Hiding your AI use is kind of the worst thing you can do... So I say do it all in public.” — Claire Vo [24:56]
- “Devin is a naturally multiplayer experience... you’ll often have a few different folks going back and forth... Devin is just one of the players in [the] thread.” — Scott Wu [19:19]

Top Use Cases for Devin

Scott Wu’s Top 5:

Front-End Fixes
- “You tag Devin, you explain 'here's a screenshot, I want to make this button rounder'... and it'll go and do that.” [26:09]
Version Upgrades & Migrations
- Automates tedious dependency updates and finds necessary changes across the repo.
Documentation Generation
- Writes and maintains technical docs, including for DeepWiki itself.
Incident Response
- Acts on PagerDuty alerts, investigates crashes, and generates incident reports—often before a human is notified.
Adding & Running Tests
- Creates unit tests, runs them locally, and iterates until CI passes.

Claire Vo’s Additions:

Rubber Duck Debugging
- Devin is always available for troubleshooting and ideation, even during off-hours.

Making Voice AI Socially “Normal” in Meetings

ChatGPT Voice as an “Extra Attendee”
- Scott uses ChatGPT’s voice interface during meetings to surface info or answer questions instantly—making AI participation a shared, inclusive experience.
- “I almost think of ChatGPT voice as a better Google. You can get an even faster answer ... fully synchronous, you can do it in the conversation.” — Scott Wu [32:36]
- Claire: “If you flip it ... this is just another meeting participant that I'm putting into the room, it actually is more socially inclusive.” [35:15]

Future of AI Engineering Interfaces

Form Factor Evolution
- Scott envisions a future where the “agent” is the atomic interface, erasing the need for IDEs or even code:
  - "Tony Stark doesn't have a laptop. ... At some point, if you have your Jarvis plugged in... you’re just looking at your own product and saying, hey, let’s make this button rounder." [36:14]
For Today: Start Where You Work
- Slack/issue trackers for leads/managers; IDE extensions for IC engineers.

Tips for “Prompting” and Unblocking Devin

When Devin Gets Stuck
- Trace through Devin’s steps to see where it erred, provide missing info, and reinstruct—akin to pair debugging with a junior teammate.
- “With an agent... you can go through and look through all the history... and then you understand, oh, Devin was missing the link to this page.” — Scott Wu [38:51]

Memorable Quotes & Moments

“Devin's my favorite intern on my team and I have infinite of them.” — Claire Vo [00:10, 06:09]
“Devin is a junior engineer. We're working on getting Devin to senior engineer—obviously, we'll get Devin the promotion!” — Scott Wu [04:04]
“I DM Devin all the time. It's because I have no employees, no one to talk to. He's my only buddy.” — Claire Vo [22:30]
“Hiding your AI use is kind of the worst thing you can do... do it all in public.” — Claire Vo [24:56]
“The way that I like to say it is Tony Stark doesn't have a laptop. ...At some point, if you have your Jarvis plugged in... you're just looking at your own product.” — Scott Wu [36:14]

Notable Timestamps

00:00–00:32 — Introduction to Devin and async, multi-threaded workflows
06:09–08:40 — Devin’s strength in executing “tasks, not problems”
10:27–13:01 — Importance of context-rich prompts and async handoff
14:10–15:31 — Real-life async-use; checking in after meetings
18:17–22:30 — How Devin is institutionalized as the first responder
24:56–25:49 — The value of public, multiplayer AI collaboration
26:09–30:41 — Scott and Claire’s top Devin use cases
32:36–35:15 — Using ChatGPT voice in meetings, lowering social frictions
36:14–38:51 — The future interface of AI engineering and handling AI frustrations

Conclusion

Scott Wu demonstrates how Devin can act as an “infinite intern,” powering productivity by tackling well-scoped engineering tasks asynchronously. From automated bug fixes to documentation and incident response, Devin frees up human engineers for higher-order problem-solving. Integrating Devin publicly across teams enhances AI adoption and organizational learning; meanwhile, the embrace of voice AI hints at a future where human-computer collaboration is seamless, social, and increasingly natural.

Where to find them:

Twitter: @Cognition, Devin AI
Podcast and transcripts: How I AI

Loading summary...

Transcript

A (0:00)

Devin is Async. Once you kick off a Devin session, Devin's gonna start working and looking through the code, but you're not expected to be there with it. It's just as if you gave your intern a project and your intern is going and working on it.

B (0:10)

Devin's my favorite intern on my team and I have infinite of them. Why don't you pick a task that you might bite off for your product and show us how you would work through that end to end?

A (0:20)

I'll say please go research the chat PRD MCP server so this will produce a pull request for us. Often you're running a few of these at once, just like a nice way to have multiple tasks going and then check in on each of them.

B (0:32)

One of the benefits of this from a How I AI use case is you can multi thread a lot with tools like this and set 2, 3, 4, 510 of these going at once on different projects and not feel like you have to sit there and babysit things welcome back to How I AI. I'm Claire Vel, Product leader and AI Obsessive, here on a mission to help you build better with these new tools. Today is a very special episode for me because we're talking to Scott Wu, CEO and founder of Cognition Labs and the builder of one of my favorite AI products, Devin. We're going to hear about how Scott uses Deep Wiki and Devin to kick off well scoped tasks to get things done. Uses Devin as his favorite and most tagged employee inside of Slack and how he's making it not weird to bring ChatGPT voice into your meetings. Let's get to it. This podcast is supported by Google.

A (1:29)

Hey everyone, Shrestha here from Google DeepMind.

B (1:32)

The Gemini 2.5 family of models is now generally available. 2.5 Pro, our most advanced model is great for reasoning over complex tasks.

A (1:42)

2.5 Flash finds the sweet spot between performance and and price and 2.5 flashlight is ideal for low latency high volume tasks. Start building in Google AI Studio at.

B (1:54)

AI.dev Scott thanks for joining How IAI AI as Devin's number one reply guy on X. I am really excited about this conversation and for you to show off how your company uses and you use the product, that at least makes me very happy and I'm sure makes lots of software engineering teams out there very happy. So welcome.

A (2:17)

Thank you so much for having me now. I'm honored to be here. Honestly I'm a big fan of you guys and all the work you do.

A (4:04)

The way that we like to describe it is Devin is a junior engineer. And so Devin is not going to go. And, you know, we're working on getting Devin to senior engineer, obviously, you know, we'll get Devin the promotion and everything, but, but, but like, Devin is not going to go and solve some, you know, really hard architectural problem or make some big strategic decision that you, you know, you're going to make and then kind of like execute on for the next month. Like, you probably want to be involved in those as well. Devin can help you with the decision, obviously by, by kind of like referencing the right things or giving a few things as an input. But I Think where Devin really shines is one way that we say also is kind of like tasks, not problems. And so often when you have a very clear like here is exactly what we need to go do and here's the task and here's all the details of what we need. Devin is really great at going and executing that for you and makes that much faster. And so actually I think the next question that comes to mind then is like, how do you figure out the spec or the task exactly. That you want to do? And so a lot of the other tools like wiki and search really are here for you to be able to kind of like ask the right questions that you want about understanding the code base or what needs to be done and then putting a task together. I think in practice, like a lot of the use cases that we see all the time are, you know, probably number one is just crawling through your issue backlog. You know, whenever you have an issue that comes up or we have a lot of slack channels where we talk about issues and then on every single one of them we just tag Devin as the first pass. And so that's a big one. And so like, you know, someone says, oh, you know, we need to go fix this thing in the front end or you know, maybe we need to go support this other, you know, support this other MCP for example, which we'll show in a second, things like that. And then for a lot of the other kind of like I'll say like engineer and toil use cases, it also does really, really well. And so often that's like, you know, going and doing a version upgrade or added documentation throughout, you know, your, your, your repo or adding unit tests for a specific thing that you have up or responding to, you know, a crash report that just came up and trying to diagnose what went wrong.

A (11:24)

Yeah, for sure. And I think it's, you know, it's a great call because you know, as we said, Devin is async. And so from this point onward, the nice thing about this is, you know, Once you kick off the Devin session, Devin's going to start working and looking through the code and reading online about chatbod, for example, right? And it's going to do all this, but you're not expected to be there with it, right? And so, you know, it's going to work on its own. It's just as if you gave the. Your intern a project and your intern is going and working on it, and so they can ping you on Slack and ask you if there's questions or something. Or you can go kind of like, you can go take a quick look and see how your intern is doing, but you don't have to be sitting there with Devin for every step of the way here. And so one way that we kind of describe it is for a lot of tasks, there's often this sync component, like the synchronic component, and then this asynchronous component. And a lot of what search and wiki is for is for doing the synchronous part of the task before you do the asynchronous. Right? And so, like, if you had an intern, for example, would you just send them 5 word slack message and just leave it at that? Maybe sometimes for something that is like, you know, super clear and then, you know. Exactly. Often what you actually would do is you would sit down with them, talk it through for two minutes and be like, okay, yeah, like, you know, you know how we have this MCP marketplace and then we go and look at it together, you know, we read the pushing error line of the code and then you say, okay, yeah, so let's add check PRD to this and you know, take a look at how that MCQ server is implemented and make sure we add it to the list. And then you kind of hand off there, right? So you kind of have the first two minutes of going back and forth with Devin, your intern, and then as soon as you hit go on the Devin prompt, you're kind of expecting it to be more of an asynchronous thing where you don't have to be in the loop.

A (19:19)

For us, a lot of it is just setting the right workflows in our Slack and in our org and so on. And so Devin obviously has knowledge, which means it'll learn your code base over time as you keep working with it, or you can kind of give it more details about how certain things work. And a lot of things, it's almost just like institutionalizing Devin as first line of response is how I would describe it. And so I could show a few examples. The big thing is to really get to the point where for a lot of these different things that we file, you know, like Devin is first person that gets tagged on all of these. Right. Devin won't be able to do every single thing, you know, on one shot on the first try. But often you're working back and forth with Devin and Devin puts up a PR and if there's some slight touch up that you have to do at the end or that you have to build, then you're able to do that. And so we have a ton of channels where we go and talk about issues or various things that we need to build or things like that. We have one for all the crashes that come in. We have one for core infrastructure things that come up. This is the one for our web app, which is hopefully a little bit less sensitive. And you can see here basically every single thing that folks talk about. And remember, we do, you know, we start in Devin Session and so it's like, hey, you know, can you standardize the font size, spacing and style for these three levels? Right? And then, you know, we just go and start the Devin session and Devin will make the pr. It'll go through the pr. This one gets merged because. Because there's some back and forth feedback here. And so, so, so like Devin goes and edits. He imports up. And see, Devin made this br. There were a couple back and forth edits. And then Dave, our engineer went and merged this. This is often how we work. It tells. This is another good example. Hey Devin, can you make it so that when you come in, click on the notification, it takes you to that in a new tab natural feature. Probably one of our users requested it. And you just started Devin Session. And Devin will give you this progress update of. Here's what I'm doing so far. Here are the files that I'm looking at and here's what I see in this case, by the way, it's actually confidence medium. And then Walden says, oh no, no, no, you should take a look at this thing instead. One of the cool things I want to point out too is because of this, Devin is a naturally multiplayer experience. And so we will often have a few different folks going back and forth or if somebody else is looking at this issue, or if somebody else is the expert on this part of the code base, they'll go and give their own kind of input here and Devin will just go back and forth with them as well. And so really it is just a thread where a group of you are communicating and figuring out how to. How to work on this issue. And Devin is just one of the players in thread, right? And so, you know, Ethan comes into Walden's thread here and says, hey, make sure to use a link element from Tanstock Router and then gives that feedback, right? And then Devin goes and makes that change in the pull request. And so you can say, see, Devin had like an initial thing and then had some additional commits and it went and did this link from Tanstack Router instead.

B (22:30)

As an AI founder You're used to sprinting towards product market fit, your next round or that first enterprise contract. But speed isn't enough for AI startups. Buyers expect security, compliance and transparency from day one. That's why serious AI startups use Vanta. With deep integrations and automated workflows built for fast moving AI teams. Vanta gets you audit ready fast and keeps you secure with continuous monitoring as your models infra and customers evolve. AI innovators like LangChain, Rider and Cursor scaled faster and closed bigger deals by getting security right early. With Vanta, listeners can claim a small special offer of $1,000 off vanta@vanta.com howi AI. You know one of the things that I like about this and again, kind of a a shout out and our use case for folks that are trying to drive more AI adoption in their teams is doing this as much as possible in public is really helpful from a learning perspective. So one of the experiences I had running the engineering team at LaunchDarkly was when we started putting Devin and Devin like agents in public channels. We saw a lot more adoption and upskilling of our team on how to actually talk to these agents, how to get the right outcomes. And so, you know, I, we were talking earlier and I was saying I DM Devin all the time. It's because I have no employees, no one to talk to. He's my only buddy. Um, but I dm, I DM Devin all the time. And we have these sort of like side conversations. He's sort of my intern on the side, but in larger organizations I was very much a do it in public channels, do it where people can see it. Because not only does the work get done and it's nice kind of muscle memory to tag in these tools immediately, but also just learning how you use them, what is an effective prompt, what are the kind of things that it's good at and not good at is really useful for just overall engagement with these tools. And so I think hiding your AI use is kind of the worst thing you can do. You can do it in. Org. So I say do it all in public.

A (27:41)

Yep, yep. Fixes new components, changes that you want to make in your front end. It's super, super nice because yeah, as we've seen, you can just kind of do this all inside, basically. And so that's probably number one for me. I think number two that comes to mind is version upgrades, migrations, things like that. And so, you know, like upgrading your Node version or getting onto the latest packages and so on, it's a big time saving. We all have to do it. And then somehow these new packages just come out so quickly. But obviously the devil is in the details of finding this new version. Will say, oh, every instance of this component is, we recommend that you use this structure instead or something. And Devin will be able to go through that and do the semantic search and find each of the components and make the right changes. Number three, I'd say is documentation. Big one as well. And so we have our Devin docs, for example, our own kind of docs page, like the external docs page. And I mean Devin has written the entire thing. DeepWiki itself obviously is kind of an extension of that. But even writing your own docs pages or putting materials together, a lot of what Devin does is go again, processing the code base and understanding this, references that, and here's what this does, and so on. And so it's a funny one in the sense that it's not strictly a writing code use case or it isn't always, but I think it's so closely related to it that a lot of the same capabilities are really valuable there. Number four, that I would say is incident response, actually. And so we have this set up so that whenever there's a crash, the first line defender, you know, on pagerduty basically is dev. And so Devin gets a page and Devin gets started, goes and you know, kind of runs a session. And obviously you probably want a human there too, you know, especially for these big incidents to make sure what's going on. But the nice thing is, you know, it's like 4:00am and you're kind of like half asleep and then you get to your computer and Devin has already written a report of like, hey, I looked at it. I think it was this change from like last week that happened or yesterday that happened. Here's exactly where the trace of the error Goes, we use that a lot. It's a huge lifesaver for us. Then number five, let's see. I would say adding testing is a big one for us. It's a very common thing where this is especially for individual engineers as they're going and working on things. You have your whole pr, you built things out, you built a new feature and always, you know, the last thing that you have to do before you ship it is you have to go and add your own unit tests and make sure your thing works right. And the nice thing again is like Devin will go and do that. It'll make the test and then it'll run the test locally itself and make sure those tests pass. And so we can iterate with you to make sure the LIN pass, make sure the CI passes and so on and just kind of like add those for you.

A (32:36)

Yeah, for sure. So I'm a big fan of voice. I actually think there are a lot of interesting we've played around with. We have Voice and Windsurf now actually as of wave 11 too, and partially because of that. But in short, part of the way I'm describing is like I think Google itself 25 years ago was basically a better encyclopedia. We have all sorts of things that you want to look up and pull together and so on. And basically it got you a faster answer and it got it to you with more up to date information of what was going on. And I almost think of ChatGPT voice as a better Google. You can get an even faster answer. It's fully synchronous, you can do it in the conversation and then obviously you have all of the detail. It can go and research and do these other things too. What I'll often do is if I'm in a meeting and we'll be talking about things, there are always questions that come up. Yesterday I was in a meeting and we were talking about this, which is, you know, there's so many orgs out there with tons of software engineers. And so we were kind of thinking like, yeah, like what are all the companies that have, let's say 10,000 plus software engineers, you know, and how many are there in the world? Right. You know, obviously like, you know, the big banks out there, tens of thousands of software engineers, the big tech companies, you know, those are the first couple, maybe the Accenture Infosys, you know, that category, those are the first ones that come to mind. But like what are all these different companies that have it? And you know, naturally in a meeting it's kind of rude to just go on your phone and just kind of like, you know, be totally unresponsive for like two minutes as you're looking. So instead what, what I'll often do is I'll just pull out ChatGPT and go on voice and it's basically like adding ChatGPT every conversation, you know. And so when I say, hey, like, you know, can you please like tell us like how many companies out there have 10,000 plus software engineers? Right? And then, you know, whether it's voice to voice or whether it's, you know, Voice and then you kind of get the response in text. Like I use both of those modes a lot, but I find it to be like a very natural, a natural stepping stone where I just find that voice lowers the friction even further in a way that actually really matters. Like, like I was going to say it's like, you know, in the encyclopedia area, right? If you were going to look something up, it took like, I don't know, five minutes or something. So you have to go pull the right like letter of the Alphabet or something and find this. And then Google got it for like 10 seconds, you know, and like voice is kind of like getting it from like 10 seconds down to like one or two seconds where you can just get on instantly and just say what you want to say. And that actually matters, I think, for, for, for being able to go back and forth or, or just like having, you know, very off the cusp, like off the cuff questions that you, that you want to ask.

A (36:14)

I really think of this in the future as we call it coding agent. And a lot of what this becomes is actually just the next generation of human computer interface. And the way that I like to say it is Tony Stark doesn't have a laptop. You don't need one. At some point, if you have your Jarvis plugged in and you're going back and forth with your agent and then go and do these things for you, and you can imagine that Builded software is just, you're not looking at your code, you're not looking, you know, you're just looking at your own product, right? And you're looking at your own product and you're saying, hey, let's make this button rounder. Look, let me add a new thing over here. Let's save this and you know, let's ask the user for this and that info, you know, and you're just making the changes in real time in your products and your agent obviously is going and implementing this for you. And so I think it's a, it's, it's certainly very agentic, but, but I think it's almost like we might, whether we call it an IDE or an agent or whatever, it really is basically just like a, a different human computer interface where you are just looking directly at the product rather than having to go through all your code or go through. And so I think that's the future version. Some years out, I think today, I would say, I think a lot of it depends on the cohort. And so I'm, for example, in meetings all the time. Unfortunately not that. But yeah, you know, and because of that, I actually think the slack agent workflow is a super supernatural one, you know, or, or like linear, for example, and tagging, you know, dev and from Linear, I think for an engineering ic, who's, who's, you know, gets to code for, for, you know, eight or ten hours a day, again, must be nice. But then the IDE is kind of the natural place where a lot of this starts, right? Which is, you know, you'll have these things that run in the background and you'll have these asynchronous processes that are going as you're doing your thing. But the natural place to get started for that is the IDE Today, I'd.