
Loading summary
A
Where should PMs actually be using AI tools in their workflow?
B
I think exposing users to context limits for the models or BS on the back end. You know, most of them are handling this better anyway and you can see that even in tools like Claude code, there's a much better way to handle it. I ended up going from main US back to GPT for agent mode and then immediately switching back. It's just infinitely better.
A
Mike Ball is leading product at David's Bridal, which is undergoing a massive digital transformation. So he has been testing out all the AI tools. I have been testing out all the AI tools. We, we have put our heads together in this episode to create the AI native PM operating system. We've been talking a lot about Claude. Claude, Claude.
B
Why not ChatGPT GPT with 4o? I think the last two model releases I felt like got more towards lazy and less useful. So I just don't spend as much time in there. I just feel like Claude's more reliable for me.
A
Wow.
B
If you're stuck behind corporate structure, enterprise tools into organizations, not really supporting, here's how you can kind of think about working around it with your own tools and projects.
A
That's a way better method than most people do. Before we go any further, do me a favor and check that you are subscribed on YouTube and following on Apple and Spotify podcasts. And if you want to get access to amazing AI tools, check out my bundle, where if you become an annual subscriber to my newsletter, you get a full year free of the paid plans of Mobin, Arise, Relay App, Dovetail, Linear Magic Patterns, Deep Sky, Reforge, Build, Descript and Speechify. So be sure to check that out@buildle.akashg.com and now into today's episode. Mike, welcome to the podcast.
B
Thanks for having me. Excited to be here.
A
What makes an AI native pm?
B
I think there was a term floating around for a while that I think was on the right track. I'll say think in prompts is kind of this like fundamental thing that sounded silly at the time, but when you start doing this day to day and you're actually integrating tools, I think it's how you end up working, right? Like, what is the thing I need to get done? What is the best way to do that thing? And if you're thinking about AI as like an extension of yourself, you're thinking like, what are the instructions, what are the steps? And if you're reflective like you probably do that to yourself at some level anyway, you have your inner Dialogue. Right. But I think that's the. The biggest thing is the AI native PMs are actually working from what do I need to do to what are the steps that I need to get done to what are the best tools to get me those things. And they're pushing themselves beyond that mental block that's like, oh, that's too technical. Oh, that sounds too hard. Or I'm not sure how to go about doing that. And they're working their way up to that, even if the thing's maybe outside their comfort level.
A
So can you show us what the best operating systems really look like?
B
Yeah, I mean, it should be a pretty familiar view for most people who are playing with AI right now, for example. So let me bring up my screen. We're going to jump around to some different tools, but the first thing we worked on together is kind of this map. Right. That I think takes this idea. And the more I thought about it, the more was kind of obvious to me that it's not necessarily a set of strict tools. Right. And that's why you mentioned operating system. We're actually operating with a layer of abstraction from UI. Like logging into 20 different tools for a bunch of different tasks throughout the day is a bunch of time wasting. That kind of happens that people aren't going to have a lot of patience for. So what I see more and more people doing, especially a lot of the thought leaders who are kind of teaching and educating others in our space right now you have your kind of centralized tools like cloud desktop or Cursor, and you're connecting those to everything else. You're either bringing things into it like contacts, data, whatever else. Or in some cases you're having it interact with the other tools that you maybe use, like a Jira or a GitHub. So maybe I'm in working on a PRD and I'm not sure if something got completed or if something actually shipped or if that was in the backlog. Instead of opening a tab and checking JIRA or looking through the prs, I can actually say, hey, can you check and see if this issue on this project was completed? I think it was, but I'm not sure off the top of my head or, hey, I know there was a PR waiting for this. Did that ever get pushed? We've been waiting for a bug fix in Prod or something. I don't have to leave the tool that I'm in right now to get an answer for that, which is nice. I can just kind of stay in my flow state where without Changing the UI for anything, which is pretty cool. There's other things that I think happen outside of the os, right. Like maybe some design concepts or brainstorming. Like, I'll use tools for research. And we were talking earlier about not all the research you get right away is something you want to bring into your actual project. So there's maybe an intentional layer of abstraction, but most of the work work I'm doing is happening in this middle area. We're between either Cursor or Cloud desktop.
A
Awesome. Can you demo for us how this all gets put together?
B
Yeah, for sure. So let's start with like a really simple, more technical task. So if you're not familiar with Cursor, there's kind of three areas to this. There's. This is collapse right now, but this is the cursor agent. You can kind of see conversations I've had about how to troubleshoot or different things that were happening in the past. You can collapse that down this middle area you have two things, which is your file that you're editing or looking at, which you don't actually even have to have up. Most of the time I don't. I have Cursor setting, which is telling me I have some kind of MCP configuration errors. And we'll talk about this. And then down here is Terminal, where I use to interact with Claude code. The left side is your file search. So again, I don't really actually use the IDE like most developers would for the files. I'm using Cursor to connect tools to either the agent that I showed on the right side, or I'm strictly using the terminal inside Cursor so that I can navigate between files and interact with cloud code. So, for example, this was a case where some of the content for all the wedding planning tasks and our thing was in a spreadsheet and it was driving me crazy. So I want to start moving it into actual content management system. So I can ask, like whenever I start a new session, for example, I can have something like most recent changes, Insanity Sanity is the cms. I have the MCP connected already.
A
And for people who don't know CMS is like a content management system, it's often like the back end for a blog or something like that.
B
Yep. And MCP is Model Context Protocol, which is letting you connect these apps and tools like Cloud Code or Cursor to the other tools that you're using without having to do most of the coding and integration yourself. So it's going to check. Basically, it has an API key That proves that I can access the information. It has kind of its own map to the API so it can go and find the information that I'm asking for. It's going to think for a second as it does this and try to get me an answer. So, well, this is giving me an answer about Sanity's actual acquisition and changes that they're doing. Let's ask about get history instead. You see this? You are helpful assistant integrated with Sanity through the model context protocol. I guess that's the initiation prompt whenever you mention mcp. Okay, so this is a little bit of like debugging behind the scenes for them. I suppose we can share this with them later and let them know. Okay, Okay. So we can start this back up again. So now that it's awake. So usually what I'll do is come in here and actually start the project. So it's connected to my Sanity Studio or whatever they call it. It's giving me the information, confirm it has access to these things. So it has a data set, production, project ID, yada yada. So it's looking, it found 10 total documents. It's checking within Sanity itself and it's showing me changes to the actual tasks that we have at our, our planning thing. So I could go and ask it, or I could ask it more questions about the task structure or things that we were working on, or I could go ahead and ask it to make changes like, say if I wanted to add something to it, like a task for car rentals or something like that for the honeymoon, I'll remember to undo this later. So instead of me having to go insanity and spinning up, it runs the studio locally. It's pretty technical to get into. And then going in and adding, adding new tasks manually, it's just going to do that for me. So creating wedding planning test titled rent a car for your honeymoon, it's creating that document, which means creating the content and then that goes into the tool. I don't have to go to the tool. So this is like not super exciting task, but if you're editing different things and managing them, you can interact with a lot of these tools that have MCPs or even APIs without having to leave your kind of home base. And that's conceptually how that part works.
A
Very cool. So this is the power of MCP connecting to whatever other tools you have and using the operating system as the home base to connect to things.
B
Yep. And that's, that's a really simple version of it. I've done this with Supabase for Structuring, you know, database schema or even doing migrations on the fly. I've done it with hosting a little app, like if you end up vibe coding something you want to launch live. You know, I was saying Render has a nice MCP. I didn't have to know anything about DevOps to actually get it up and hosted and get it running, which is pretty nice.
A
Very cool.
B
So if we go back here, that was kind of the, you know, let's connect to technical tools via MCP and bring that into Cursor. You can do the same thing in Cloud Desktop if you want, like what I just did in the ide. We could connect the Sanity MCP to Claude desktop. I don't have it on mine right now because it would be duplicative. But you can if you're more comfortable with this interface. Right. You can do the same kind of thing. You can go into Claude Settings should be here. You can go to Connectors, and then you can connect all these different tools they support out of the box. And then you can also go and configure your own if you want to as well, which just is a simple text file.
A
Okay, cool. Powerful. So in this episode we've been talking about how all these tools connect together. And one of the tools, the center of the stack for some of the best AI native PMs. Well, that's today's episode sponsor Linear. You just saw Mike pull from Jira and Confluence. But here's the thing. Linear has basically the best system of record for the top teams. There's a reason OpenAI, perplexity and visceral use it. Even Cursor uses it. Linear is built for speed and those teams need speed. And here's what makes Linear different. Is Linear built for agents? With Linear agents, you can call something like a Cursor agent to code for you or a cloud code agent to code for you simply on a task. This is a really cool workflow for AIPMs to actually make changes to production code. There's a reason already 60% of enterprise linear accounts are using these agents. I spent six months doing a deep dive on Linear. I talked to their CEO, their head of product, their head of design, their coo. And what amazed me is that they are operating like, like teams from the future, over a billion dollar valuation with just two PMs. And that's because Linear uses Linear to build Linear. And if you want to build like linear or like OpenAI or cursor, then you need to check out Linear app Akash. That's L I N E A R app slash A K A S H the coolest thing is that you get a free year of Linear's paid plan, a $600 value completely free for becoming an annual subscriber to my newsletter. So check it out. Either go get a discount link or become an anal subscriber and get a full year free of Linear. And now back into today's episode with Mike. And I noticed you had some custom instructions in your cloud. What did. What are your favorite custom instructions?
B
You know, at the project level, mostly what I do. So we just have a bunch of different initiatives. So I try to frame the context of the project rather than everything in the entire application because it's all related. In this case, like this is a my own account, so it has different projects in here. Like this is the wedding planning app, specifically Media Network is advertising and partnerships that we have across like E Comm and everything else. So having specifically this is the app we're talking about, this is the app we're working on, or this is the ecosystem we're talking about, and this is the specific lens we're working on, or David's Breadl I don't have one in there, but that's more of like overall architecture, tooling, internal things and initiatives. Right. So I like try to set that for the specific ones that need to be differentiated and then hope that it can maintain. But I also use like the memory MCP so that it has those relationships that it picks up over time to the different ideas. Because sometimes I'll have to say, hey, I need you to actually look across project knowledge. Or I'll need you to pull information from these two different projects to do what I need you to do right now too.
A
Oh, cool. Very cool.
B
And they're rolling out memory features and with Claude, which I think helps with that too. But having the memory MCP was the thing that's let me do that for the past six to nine months or however long I've been using it now.
A
And some people might be asking, we've been talking a lot about Claude. Claude, Claude, Why not?
B
Chatgpt, Anthropic ruled. I mean, they created the MCP framework and have supported it best. Claude Desktop had it out of the gate and I think supports it well. They have their own team that rolls out supported MCPs and things like that too. Open OpenAI has MCP support, I think via API and maybe they're rolling it out in the chat app now. I was pretty happy paying for both or at least using some GPT with 4.0, I think the last two model releases I felt like got more towards lazy and less useful. So I just don't spend as much time in there if they're still. I feel like they started rolling out connectors in the consumer app. Like if you go in via ui, you can do it. So you could do all this stuff there if you want to. I just moved away from, from that because I think the type of work that I do is much deeper and requires a little bit more like connecting and tooling and I just feel like Claude's more reliable for me.
A
Yeah, I think Claude is also just a better writer and PMs are doing a lot of writing, so generally you're going to prefer Claude, but If you have ChatGPT at your work, they're starting to roll this out. So you can do a similar sort of stack with ChatGPT at the center.
B
I have a post somewhere about a deep dive on MCP on the my newsletter and there's a handful of like desktop apps and open source projects that you can run that kind of act as a gateway. So they'll let you connect mcps and then they'll connect to the. Whichever models so you can jump between them. I have one called, I think it's Fire, but it starts with a five that I've been playing with that is pretty cool. But it's like. It's like a project to get set up and going. But if you're. If you don't have the tools you want or they don't have the capabilities you want, you can actually kind of like hotwire mcps in a weird way, which is interesting.
A
Cool. What's next?
B
Let's. Let's look at Design Concepts. Because you and I were doing this earlier and I thought this is a simple but pretty interesting use case when we were trying to conceptualize this. So I opened up this, which is similar to what we just were showing on the other screen in Canva. Right. And the bad. The bad thing about this is it's flat, so it's just an image. We can't edit it or tweak it without a lot of prompting and back and forth. So what I did. And I do this with some product design work and development too. So I just dropped it into figma and I'll zoom in on this so we can see it a little bit better. I just did a blank file for this. But what I'm going to do is if I right click this frame, you get this send to option and you can go to Figma make and I'm sure you can just go to Figma make and upload that too. But the cool thing about doing it that way is you can do this with any of the designs that you're working on with your team anyway.
A
Yeah.
B
So it's going to send that and I think it picks up. In this case, there's not very many layers, but if you're sending a design file that has layers, it's also going to pick that up and make it better.
A
Help me make this an interactive visual
B
that I can edit and rearrange. Let's say that's vague enough. I don't know what it'll give me, but let's just see what it does because I don't want it to work for 20 minutes while it's building this out. So I don't really trust Figma make to code anything useful. I haven't seen it functionally finish anything to the level that I'm like, oh yeah, I can move this into GitHub and work from here so far. But I think it has a unique advantage in that if you're working on your product files that are structured in your design system, it's easy to move them in here and you can visualize like maybe a specific flow for a user, a variation or a new feature and see how you might integrate that and then bring that back to the designer because they're probably working on other things anyway and be like, hey, conceptually, like functionally, this is what I was going for. What do you think? And when you bring it back, I'll show you, you just copy it and it's fully layered and everything. So they can take the different pieces that they want from that and just add it into the design, create the new model, create the different screen or whatever and bring it back into the same file. So it's a big time saver for designers.
A
So if you're thinking about which AI prototyping tool is sort of most designer friendly, Figma make is up there. Is that the AI prototyping tool you generally recommend?
B
I don't even use it for prototyping. I think I use it mostly for design variation, like if in my head. And maybe you'll relate to this as a pm. If I'm like, oh, we really need to see this edge case, this error handling this state for this functionally. And I also am asking the designer or that design team to start working on the next thing because we're trying to move pretty quickly. But I want to have it for the developers. I'll actually take the design we have and throw it into here and then make it. Usually it's interactive at some level. Right. So what you can do in here, which is nice, is you can edit these and then you can change the state of it. So maybe I say I want drag and drop. I don't know which will give me for this. This is kind of a weird use case. Right. But it abstracted the visuals and things that we had, but it gave us something. We can at least play with it and then if editable.
A
But it's not nearly as nice as what we had out of.
B
Right. It will probably. There's a little hub icons in here. Maybe it's still loading some of this in. But let's see what it does while it's doing that. Say the nice thing about it is you get different states. So say we have like an edit state right now we have this drag and drop. I don't know what it's doing. It's just letting me move it, which is kind of weird. So I'm not super happy with this, but just for the sake of time, let's say like I'm going to copy this design, I'm going to go back to my Figma file, I'll zoom out a little bit and I can paste this in here. Now I said, the cool thing is now we can actually open this up and we have the entire diagram in Canvas with the containers and all these different elements. Yeah, all the text bubbles and everything. So this, what you were doing in Canva was trying to create all these manually. Right. But if we wanted to go in here and say, like, change any of these or how we organize them or whatever, it's actually easier doing it in here than it would be on. Oh, there I have the edge of it in Figma make or even in Canva from scratch. Right. So you get the different UI elements, you get the states that they're in or whatever. So it's just kind of a nice, like, shortcut to get the, the state, like the visual state that you want without having to sit down and be like, here's 20 screens I need you to work on for this week. That are all edge cases, the functionality we already agreed on kind of thing.
A
Yep. And what do you recommend for AI prototyping if not Figma Make?
B
So personally, I think the last three months I've been blown away by AI Studio and what I mean by this. This is like Google's DeepMind developer focused or AI developer focused team has AI Studio as like a play space for developers, right? And they've done a good job of also not alienating non developers. Like they're really encouraging people to get in here. But this is where you can test the newest models. You can get like API key really quickly and for free to start playing around with some of these things. So if you're like, oh man, they just made a huge announcement, I want to go play with this thing. This is kind of the ideal place to go. On a personal note, I also feel like in terms of how they manage like chat context and memory and the actual experience, this is infinitely better than Gemini the app because I'll go in there and ask for an image and then if I ask for a variation of the image, it'll just give me the same thing again. Like it doesn't know that I'm asking like reference the first thing and give me this edit to it. And there's a lot of little contextual things like that in the Gemini app that I get frustrated with that don't exist here. I think the developer team just built a better experience for developers than the Gemini app did for consumers. So if you're in AI studio and you go to build, you can one shot especially they have a bunch of kind of pre selected functionality or example apps. But we did. My team was asking me to like prompt and create a bunch of like we had some dresses that were on sale and come up with Halloween outfits for example. And instead of me manually doing that and trying to understand all the different like dress subtleties and stuff, which I'm not great at, I was like can we just spin up this app real quick, right? And let them. We didn't. It's not like a consumer facing app in this case, right? It was just like can we create a costume from this? And I think it should run. I haven't used it for like enriched the west. Okay. So I didn't connect this to my API queue something like this, but it does by the update it or something. But we just use this to generate some marketing images that we launched. So it'll take the dress image, it'll take the prompt and it'll create like a costume image kind of mock up for marketing in this case, which I did this in literally 10 minutes because somebody was messaging me in Slack like hey, this is my prompt. Here's an image I'm trying to go for but I keep running into X issue. I'm like, let me just put it in here quick. And it Was it was usable in 10 minutes and then I probably broke it afterwards tweaking it or something.
A
Got it. So Google AI Studio for prototyping, how does that fit into the overall operating system?
B
So what I'll. What I tend to do is go into, build and work on some of these like rough concepts. Here's just like a general notebook type app I was playing around with, right? Of like it's Infinite Canvas, but it looks like a notebook. You can do something. I don't even remember why I did this at the time. What you can do once you get to a point where like, okay, I want to play with this more. I'll either push it to GitHub or I'll deploy it to Cloud Run, or both in this case. So I'll download the zip, push it to GitHub and then I'll switch to Cursor and just open it up and start working inside Cursor to edit. From then on, get the local setup and start running it. And then that way I have more of my normal workflow of like I check my browser and see that app running locally. I can kind of experience it and then I go back in here and give it feedback or start describing the change that I want to make to it or whatever integrations that I want to do. And then it's just that back and forth. More like a true developer workflow at that point. But it has to be at a certain level of quality before I pull it into like my main operating system spaces.
A
Makes sense. What's next in the operating system?
B
Let's see. I think one of my favorites has been Progress and Knowledge. We'll combine the two in this case. So I'm just going to go to a new chat inside one of my projects. I'm just going to ask like, what do my confluence say about the vision board, which is just like a feature that we launched a while ago in the mvp. So this is going to use the Atlassian mcp, which uses that new ROVO AI search and it's going to go through and check to see specifically. I'm hoping it's smart enough to know I'm in the planner. And so it should look in that space and Confluence and find my doc. It's going to check for my requirements documents and then it's going to give me some kind of summary for this. And then what I'll do actually is grab a Figma link at the same side or the same time and I'm going to show you it should use the Figma MCP and I'm going to ask it to compare.
A
This is some seriously connected workflows. I haven't been working in this connected of a way and now I'm really seeing the benefit of the operating system concept.
B
Yeah, I think some of the trade off, right. Like you were watching the screen as it was thinking and finding this information and spilling that in for me, because this is something I've already written, right? Like it already exists. I just didn't want to open Confluence and try to find that page and copy and paste the specifics. But then what I'm going to do on my other screen while that was loading is go back to my MVP designs and I'm going to grab just the link. So the figma MCP is kind of weird. It uses a lot of context and usually maxes out Claude, which is a complaint I have. But if you give it a specific URL to a frame. So if you go into figma and just kind of what I did earlier, but instead of saying send to make you just grab a. A specific share URL, you can go to share and copy URL or copy as they changed it. Now, I don't know why they did that, but copy link to selection. So I'm going to ask, I'm just going to ask it if I'm missing anything or if we missed anything from the MVP design comparison. And this is probably stuff I've already done, right? Like I've already gone through and I'm like, yeah, we're, we're not going to do this. We are going to do this, whatever. But yeah, but it's going to go to that specific frame. And I think the way the figma MCP typically works is it'll grab a screenshot of it instead of trying to pull in all that information and if you saw it, ask me to allow it. So depending on what mcps you set up, you can also set them so that they ask for permission every time they do something, or they don't ask for permission to find content or read, but they do ask for permission before they edit anything or push any changes, which is how I have it set up for the most part.
A
Nice. So you're using the FIGMA mcp, it sounds like very regularly.
B
It depends on. I mean, if you think about product life cycle, right? Like if we're working on something new or we're working on our version 2.0 set, there's kind of different waves where I'll get in FIGMA more. I spend a lot of time In Figma still too, working with design on getting those things done. And then when I'm working on like refining my requirements before we get to the dev team or when we're working on first like pie in the sky designs, I'll definitely be pulling that into Claude Desktop probably more often.
A
Got it.
B
And that's what I think is nice about the operating system approach is your tool stack can be very composable based on what you need. And so the whole pricing strategy Thinging from a SaaS standpoint is tricky. Right. Because it might be a. I'm going to pay for what I use as we if more people move in this direction. That's kind of like a developer API usage based billing for a lot of these services and it might be very fluid over time. Right. Like it might be a specific use case where I need to use. I have like a marketing type app that we're spinning up that uses Supabase and it uses Google Cloud and it uses something else or it might be something internally that has a totally different stack and I might need to be able to fluidly jump between them.
A
Makes sense.
B
So here it's giving me a pretty good gap analysis. The layout, the icons and everything. Wedding style doesn't display, which is true. We actually don't display wedding style from the onboarding process on the vision board at all. The documents say show bridesmaid only, which is true. I think we had wedding party versus we shifted to bridesmaid afterwards. So it's catching pretty specific things that are like little discrepancies in design. Right.
A
So it's actually huge lifesavers. I mean normally you're like pulling up the prd, you're looking at the design
B
or you just miss this altogether and second guessing yourself. And then like in this case I'm still going to second guess myself and this and like double check it. But at least I'll have some context and like idea of where to point my time. And if people are like you can't trust it. I think like I've done both. Right. Like we've been around long enough to have done both manually and like I trust this as much as I trust myself going through a massive list and comparing every single module and things side by side. Right. Like something might get missed or something might not get missed. And it just like it's nice to have a second brain to rely on to gut check yourself.
A
Yes. That's my favorite thing about AI is it's like my second brain. Sometimes I wouldn't have even asked a person for it. But I don't feel as bad asking AI. This episode is brought to you by Linear. You know what broken about product development right now, it's not the coding. We've got AI agents for that. It's everything else. The planning, the feedback, synthesis, the endless context switching between tools. You're drowning in PM busy work while the actual product work suffers. Linear started as that issue tracker that engineers actually want to use, which, let's be honest, is pretty rare. But it's evolved into something way more powerful. Your entire product development hub. Their new AI features are a game changer for PM's product intelligence automatically surfaces insights from customer feedback and support tickets. No more manually combing through hundreds of reports and tickets. And their agents will help draft PRDs, scope projects and handle all those status updates nobody wants to take care of. Product teams from OpenAI to Vercel use Linear to build complex products at speed. There's a reason they call it magical. See for yourself at linear app/akash. That's L, I, N, E, A R A P, P slash, A K A S, H. Yeah.
B
So we talked a little bit about. So we have prototyping, we have kind of knowledge reference. Right. Like pulling from Confluence and then gut checking it against Figma. We had an example where we took like a generated image and created a prototype and pulled it into Figma so you could play around with different states. We haven't talked about research yet, which we probably should actually done first, but we can do that next. Does that sound good?
A
What do you use research for? Is this setting up your project context or what are the right points in the lifecycle to be using this?
B
Yeah. So when I say research, I think ultimately what I tend to mean is context gathering. So if there's. There's a couple different ways I usually go about this and I would say I pair. I used to use Perplexity a lot more because I think the good brain exercises, what questions actually matter to answer when you're working on something new. So I'm not logged into that one right now, but everybody knows Perplexity kind of works as you ask a question, you get some answers, right?
A
Yep.
B
So you might ask something like the use case that I think I had up here right now is this one that I used to test reforge build when it was in beta mode. It was app that maps out paranormal sightings. My kids are into this podcast and stuff like this. But functionally it's interesting. Right? Because it was a community app. So it's like People need to be able to post sightings. There needs to be like a governance or like an workflow, like a queue where you can review or flag if it seems like it's spam or something like that is you need to be able to plot these things out on a map. And then there was a time dimension of like how did they play out over time and being able to filter that. So like functionally from an app standpoint it sounds really silly, but there's a lot of really cool interactions that are there that like if you're testing a vibe coding tool, this kind of is a good way to stress test it.
A
Yep.
B
So in this case, right, like I was like, well, I don't, I know what my kids tell me about this stuff. But I went through and I was like, this is all I gave it. Manus is fantastic at this, by the way. I don't feel like with Agent mode on GPT or any of the other tools, like even researched and Claude, it tends to max out pretty quickly and burn through a lot of your usage. So this is nice where it'll just run independently and let you know when it's done. And it gives you its entire trace of everything it did. Right. So all these different sources and then when I say context gathering or research, what I also love about this is I can ask for specific deliverables and it's also going to show me. So it did like a sample set CSV. I combined CSV with different sites. It gave a data sources report in a markdown file so I know where that data came from. It gave a quick start guide to be able to use it. And then it's giving me like a more human summary. But I can access all these files that it gave me. So even if I really just wanted one of these deliverables, I can access every piece of them independently and use them elsewhere, which I really like about this tool. So maybe it gave me a specific set that I thought was more useful than that. So what I think I actually did in this case when I was doing this is I pulled this information in and then I gave all of it like the raw information from that to Claude and I asked it to give me so I could test these tools, product requirements, user research and Personas for it and then like a technical strategy or approach for it. So I have these three different artifacts now in this kind of like joke project and I can pull these down and that's how I typically would go into like AI Studio like we talked about before, and then Give them, give it those files as the requirements to build something new. I did that for Reforge and it did a great job with the design. Or you can take those files and then work on actual requirements and refine them and then even build them from. If you wanted to go straight to cursor, you could. Right. Or go into Figma and like try to work on some design concepts for it or something from that make screen. It just depends on how you want to go about doing it. This is a specific for me to test like Vibe coding apps. So that's what I would reforge, build,
A
fall on your top three AI prototyping apps.
B
Then in terms of like where it got to from, hey, is this, is this functional? Does it match my requirements and is actually like from a product taste standpoint, does it match like some of the details I put in here about ux? I think it did a good job. I think Studio tends to be still where I go to, to get it to a functional point. Fast, easy, reliably. And then I can take that code and run with it or I can host it there. And then I would say I, I use repl.it for a long time, probably early on before they dealt with a lot of issues. And I just got really frustrated with that agent lying to me about like, oh, I fixed this, or hey, I implemented this new thing and I spent more of my time being like, why did you implement a new thing? We had that working yesterday. And then you go back into the code and you're like, we were using Python and now we're using Python and there's a node implementation that are fighting each other in the things broken or something. So I just, I, I think like, you can use those tools to get a concept up and running, but I feel like they're designed to make you feel like you're, you're progressing when maybe you aren't. And so if you're more technical or more capable, maybe it's better to jump into something that lets you transition from concept to code a little bit better.
A
Yeah. So Manus over chatgpt, Agent mode, Reforge build in Google AI Studio over something like a replet.
B
Yep. And this is like even stuff for date nights, right? Like, you know, here's what we're into, here's the weather, here's where we live. Like, I want something within an hour drive out, hour drive back, something like that. So we're not gone too long. Like you can give it all these things and it'll just run and bring back all the research and different options and things like that too. So the, the big reasons I jump to this are A, it's just good at running independently and being thorough, and B, being able to get multiple assets right. Like, this is a lot of different pieces to prove that it did the thinking right. Like, here's research on the OpenAI SDK that they rolled out to build an app inside ChatGPT. And I said, make sure you review this documentation for the technical approach or whatever.
A
And it.
B
And it does it. Like, this is capable of this. It's not capable of this. It has the details, but you can trace it back, which I really like.
A
Very cool. When do you use Manus vs Claude regular?
B
I. I still feel like my frustration with Claude is chat length limits and then usage limits, even though, like, for work, I'm on the max plan, right. 200 bucks a month or whatever, and I don't really hit those as much anymore. But if you actually turn on research mode, I feel like it just runs and runs and it doesn't do a great job of showing its work or its thinking like, Menace does. So if I have, like, something quick I'm working on in here, I don't even know what that would be. Off the top of my head, I think I've used research, like twice in the last two weeks, and maybe it was more technical or something by default, or it was some kind of weird offshoot, like a rabbit hole kind of thing that came up. No, that was figma. So, yeah, I. I don't really use research in here very much, to be honest with you, because I feel like it just burns through. Yeah. Research. And that way. And the other reason I do that is because if I. If Menace gets something wrong and I don't agree with it, or it finds information that I don't like, I can pick and choose, right? Like, here's prep for this podcast and I gave it kind of the notion content and was like, go through blog posts. Or here's what I'm thinking about this. Like, how should we position it? Or how are we thinking about it? Like, I had some decent ideas in here, right? And this is just like my notes. But I can pick and choose what I actually bring into my core operating system so that it doesn't give it the wrong idea or it doesn't start. Because the bad thing about a lot of the LLMs is they'll pick and choose what's in the memory to really anchor themselves to, and then they become like, common assumptions or common Beliefs of like, everything it responds to is based on a common belief that you gave it. So if you're not careful about what you feed it, I feel like you end up with the equivalent of a conspiracy theorist like LLM Partner who is like running with random ideas that maybe aren't as important as some of the core information you gave it.
A
Cool. What else should we understand from the operating system perspective?
B
I think conceptually, if you hear the word composable in like technical architecture a lot or like Martech, where you can kind of. And it's kind of like modularity with furniture or whatever, right? You can kind of pick and choose how you want it laid out or how it's configured to meet your needs at that time. So like these are all different kind of use cases and things that you deal with throughout your given week or month or whatever. I think there's also things that come up that are, you know, I have a specific project or specific ask that has unique requirements. I was doing something in the domain space tied to this next round of applications for new TLD strings or top level domains, which would be like a link, for example, or.com or whatever. And when you apply for these, there's a huge cost to apply for them and there's a really big list of like evaluation criteria that goes into it. So it was like, is there a patent or a trademark already out on the string that you're applying for? Is there like a cultural sensitivity or a significant geographic region that might have staked to that? First, are there any like language conflicts where in one language it sounds fine, a different one it's offensive. And so there's this long list. And for that it was like, you know, I wouldn't say that the API for the patent database is something I'm going to use again, right? But at the time I can go get that, I can connect that via cloud code, for example, to the application that I'm working on or the process, the agent that I'm trying to build. And that's like a one time thing. Now that wasn't even an MCP in that case, I just had to go get the API key to be able to use it. But it's the same process either way. Like with the mcp you configure it and you put in your API key. With this one you give the API key to your coding agent or cloud code and it handles that part of it for you. So I'd say like as a big takeaway, there's lots of different things you get asked to do in any given day or week. And like the right set of tools is not constrained to the ones that you can log into and use. Right. Like there's lots of different solutions out there for different things that are sometimes free. Right. Like very generous free tier plans on APIs or things that don't cost money at all that can like make you more effective or make the solution that you're working on more valuable. That makes sense.
A
Yeah, 100%. I think this is an incredibly powerful way to work. If we go back to the Nano banana image, I think there was also something you had put in there around communications and email. How do we handle that side of it in the operating system?
B
Yeah, I think so. There's. There's MCPS for Gmail, there's MCPS for Slack, there's MCPS for Calendar connector. I actually think even in Claude, if we go to the browser settings, maybe there's native connectors just for this. Yeah. So they do this via cloud right now. So this is my other account if I want to connect my calendar or something and ask it what I have coming up this week, or if I want to connect my Gmail and ask it to look up, like when did we schedule the podcast interview with Akash? Right. It can do all those things for you. You just have to give it, give it access to be able to do that. We can even do this right now.
A
I've never used this use case and I consider myself a cloud super user. I'm having so many epiphanies in this episode.
B
Well, I have a hard time thinking of a lot of the automation like N8N or even Zapier sometimes. Like what do I do? Often enough that I feel like there's a direct use case. Right. That I'd want to configure something or I'd want to be able to do something. So then let's go back to chats and see. I'm trying to think of what I would even have in drive right now. Oh, this is searching chat.
A
Sorry, I was going to say you can do that too.
B
I didn't know I. You can. You had to do it from an actual chat.
A
I see you've chose Sonnet 4.5. So kind of like the right mix of burning tokens, but effective, not opus.
B
Yeah, I think it. I think it switched. I was happy with four, honestly. I think four. Five is like a slight lift, but whenever it updated it switched to4.5 or I switched to4.5 when it was an option. I just kept it there because I think it consistently does a good job and maybe it's like slightly noticeably better. I don't think it's significantly better. Right. But so for this one, for example. So the connector, it'll let you add manually. Like if I want to look for something like this and you can pull it in. But I think the connector should also search my Google Drive for me, unless that needs to be configured separately as an mcp. And they didn't, they didn't have MCP in the browser version before and they only had it in the desktop. But I feel like, okay, so it's searching drive for me. That's cool.
A
Wow. Drive is so hard to search. Like I. Even though Google's a search company, I don't think it searches that good. So being able to use Claude is a huge win.
B
So this is the most recent one. This is what I talked about with the super guys about like planning that vacation and when the memory MCP is like ended up being really useful to me because it was like, oh, your five year old likes crystals. There's a place you can go mine for crystals. Here is like 30 bucks for a bucket and it's in between your stay in Lake Tahoe so you don't have to drive four hours straight. Like you can break it up and get some extra exercise kind of thing. But yeah, it can get all this context. You can view the full document or pull it in or whatever. So it's pretty easy to use the like they are MCPs. They're calling them connectors because I think MCP is probably intimidating for less technical people. But they finally made the update. At first it was just on desktop and you had to configure it there. Now they're running it in browser and cloud, which is nice.
A
So let's say you weren't on the Google stack. You were stuck in Microsoft. Could you do this there too?
B
See, I feel like they do have an Outlook collector or connector. Oh, I broke it. Leave it to me. I have.
A
That is like a model launch yesterday. Maybe they're a little bit bland.
B
Let's see if desktop is broken too. Yep. Okay. You're supposed to be able to get there from, from the actual app.
A
Yeah, there it is.
B
Yeah. So they've got Notion Figma. Let's see. Microsoft Ms. Learn Docs. No Outlook as far as I can see. But I don't know if that's like a common one for cursor clarity. No, Microsoft is a little bit behind on getting this stuff out the door. You Got Vercel post Hog Netlify all these specific ones you can connect to ClickUp. Chrome Dev Tools is sweet actually, especially now that the cursor browser or the cursor agent has browser use. So it can go in there, it can pull DevTools report and it can actually like test new functionality that you're working on in your local instance. Not that I totally trust it blindly, right, for testing anything, but it's a nice extra step instead of like go to the browser, copy the console, error paste it back in. But yeah, this, I mean this is a long list of all sorts of different tools. Like Fire Crawl is a developer tool, but I've used it to pull data like shape data and shape contacts for different tools and stuff too. Even pulling like our color swatches down. Honestly, it's just easier amplitude. If you have product data in there, you can pull that in and kind of like. Because sometimes even if I'm in a coding space, I'll think in that coding space, right? Like what does the data say? And you just shift cloud code to plan mode instead or whatever the case is.
A
Awesome. So this is the operating system for AI native PMs. One thing that sticks out to me is this is a lot of AI tool licenses. A lot of it requests a lot of money to be spending. Can you just walk through what are the different licenses that teams should be getting access to? Their teams or product leaders should be getting access for their PMs here?
B
Yeah, I think the big ones, right, like are what level of access you give to. So and depending on how technical your PMs are. Right, like maybe Jira is enough or if you're in linear or whatever the case is. But I think like read access for AI tools versus Write if you're conservative. I think at a personal level, some of these are free, right? And some of them are being done separately, like using the Gemini app. I use that through my personal email, which is on workspace and I play around with that. Or AI Studio is free with any Google account. And as long as your team's aware of like, hey, let's not take a bunch of IP or like private information or product specific things that are internal and bring it in there. Like you can still conceptually get a lot done outside, right? Like if you're just trying to come up with ideas and then bring them into your tools, like Figma for example, or set up your own sample app, that's like an example or use case for what you're trying to do before you spin it up internally or have a developer start working on it, or even start building up your own private repo for your own internal apps kind of thing. So I think that's, that's how I tend to navigate. It is like 90% of some of this thought and planning work doesn't actually need a ton of private context or private data or company information. But if you're actually working out of this day to day, I think your core tools. So if you have Jira, you have Confluence or you have Notion and you have Linear or whatever the case is, the like easy first step for PMs to start working this way is just connect it to where the actual knowledge they care about lives. Like what are my requirements? How can I get a pulse on dev progress? You know, like little things like that.
A
Okay, so to summarize the licenses you should be getting for your team, sounds like we can do the 20amonth Manus plan, probably the hundred dollar or 200amonth Claude plan, potentially the free version of cursor. Is that right?
B
Yeah, yeah, I think you can if, especially if you're paying higher end for Claude. I don't think you actually need the cursor agent if you're not doing actual dev work. And even then you can use cloud code for most of it. And I'll say for the, the Claude paid plan, the way that we're handling this internally, even not just in product, is like, why don't you start and show me that you're using it right? Like that you're getting some value out of it on a $20 a month plan or whatever the case is. And then if you, if you show me you're doing it and it's helping you and you're leveraging it in the way that's like moving things forward, happy to bump it up. And then if you start hitting limits on that, you're actually using it like we can bump it up again. And that way you're not just giving Everybody a blanket $200 a month plan that can add up pretty quickly if they're not really using it, but you have them like building habits that are using it productively. And then it's 200 bucks a month is nothing right. Compared to how many hours of work it's saving them at that point.
A
Nice. And then they might need access to the Pro plan of Google AI Studio and Nano Banana Pro if they're doing a lot of generations. So potentially $20 a month. So the minimum viable AI native PM stack, what I'm hearing, $20 a month for Claude, $20 a month for Manus, $20 a month for the Gemini package. That's basically it, right?
B
Yeah. And I would even argue at some level like if you're working with it, right? Like you can everything in AI Studio. And that's part of the reason I like it too. If you're just connected to the cloud account and you can say like, use this API key. So like your IT team could go in there and spin up a project and give an API key that specifically has access for usage based billing and give you permission and then any of the stuff you do in there can just be billed to that and then it is usage based and you're getting like wholesale rates essentially based on usage instead of paying like another software subscription. Right. So like if you want Gemini in the app you can. But I like, at this point I almost prefer like, oh, I have an API key used specifically for testing, right? Like yeah, and nanobanana 2.5 is like 4 cents an image. So I could go crazy someday for like a week or whatever and spend 20 bucks and then I might not use it again for three weeks and then I'm not on the hook for another subscription or three months and I'm not paying a monthly subscription for nothing. Right. I think like that's probably the way a lot of this will move. You just have to have the connector to your core, core tools where your teams are working to be able to move that knowledge in and out.
A
Sweet. So let's say you're a PM and you're stuck where you don't have access to cursor, you don't have access to Claude or Manus. How do you go make that case to product leadership and to IT to get access?
B
Yeah, I mean I'm pretty bullish at this point. I think like everybody in leadership, executive level, I mean I work at a. So David's is a 75 year old wedding dress retailer. Okay. And so we've got a lot of old beliefs and processes and things, we've got a lot of old tools and everything like that. But when I talk to leadership and I say like, hey, this is what my team has been able to get done at this velocity and this is like our team is one quarter of the size of most other teams that are working on this right now. Right, here's how we're doing it. Or if I say, here's what this investment means and ultimately like what the unlock is for us to be able to get done this year, you know, you have These different pain points, these different problems. And if you can show and build the trust of like, like shipping. I don't know anything about shipping and logistics and things like that. Right. But I took some of the data and reporting and things like that. I'm like, here's three different ideas that might be causing us excessive overhead on some of this. And they're like, oh, yeah, we hadn't thought about problem B. Like, okay, that took me 10 minutes. So imagine what a team of me could do if they're just asking smart questions. They tend not to push back on that if they do. Like, this is going to sound a little harsh, but I would be fairly concerned about any organization pushing back on people trying to use AI to be more productive and more impactful.
A
Yeah, sadly, a lot of PMs I talk to are in that world. They're living, you know, whether it's automotive or finance or healthcare, where they seem to be living like five years in the past. But I think what you said there of maybe you create like a personal use case or you show the personal productivity or you show how it can get you more velocity and maybe you chip away like one tool at a time. Like, maybe you don't hit the whole operating system right now. I think in particular, like, Manus might be one that you could potentially wait on, but you really can't wait on Claude. Right. So you start with, okay, I need Claude. And then you can kind of move your way up. And if you're stuck on a particular stack, like a lot of people I talk to, it's just that they have access to ChatGPT. Then I think what you do is you say, okay, let me try to do the best I can with ChatGPT and then let me show them with my personal work how Claude can upgrade that and then bring that to them. Does that sound right?
B
Yeah, yeah. I think it's like, show that you're getting value from the technology first. Right? Like, get buy in on the fact that there's an opportunity there to move faster, increase your impact overall to solve hard problems that maybe you weren't able to solve or you have different constraints around or whatever. I think like organizational buy in on the value of the technology is step one. Like, because a lot of people are stuck in. Yeah, it seems like a cool toy, but how are we going to use it on the business level? We have lots of other things that are more important right now. Like there's always lots of things if you show like, oh, it accelerates us. Oh, it helps Us tackle these hard things or, oh, it's unlocking talent. Right. You have people sitting there who are capable of more, and you're just giving them, like, menial jobs or kind of like a lot of overhead work. Right. And they're able to work around that, that there's immediate value. So from there you can say, like, okay, from this tool, here's what I can do, here's what I've been able to do, and here's where I'm hitting a wall. Here's what I could do if I had X tool instead of Y tool. And I think, like, at least if you have that case built up underneath, right. It's easy to justify. And it's not too different from like a feature roadmap or like a product roadmap or a specific feature you're trying to pitch. Right. Like, did the user see value in the first thing that you ship? Like, what issues are they running into that we need to help them navigate? Like, what are our goals that we're not hitting that we need to move past? Or what is the unlock for this? So, like, that's muscle memory that all PMs have. I think the pitch is ultimately the same. It's just like me getting things done instead of users having this experience or us hitting this metric. Right.
A
Amazing. And then finally, before we go, can you just walk through, like, the life cycle of PM and when they should be using these various tools?
B
Yeah, I mean, I'm in the weird, you know, early extreme, early adopters and testing and playing with this stuff. My default is you should be using them the entire life cycle. I think there's a huge value or there's a ton of value in your upfront research and validation. And that's everything from, like, what we showed earlier about pulling a bunch of data in. Right. Like, what about this consumer segment versus this one? What about this use case? Is it as relevant to our target audience as it might be to this other audience? That's kind of tangential. Getting actual numbers and finding what is or isn't available, what competitors already in the space, who's targeting? Right. Like, based on the competitors that are here and their recent marketing and the recent releases, who's. Who is on a trajectory to go after the same thing that we're targeting. Like, who's going to be neck and neck with us or even, like, okay, I've done my research, I've talked to customers. Here's scripts from interviews. Here's like. Here's like, our whole justification. Tear this thing apart. Like red team this thing. Tell me, tell me what I'm missing. Be a skeptic, tell me what, what you think is wrong about it. Like give me every possible angle so that I'm prepared to defend it and I actually believe it at the end of the day, right? And I think like up front, everything. And then throughout the process it's every interaction, right? Like, am I checking my design to make sure it meets requirements like we talked about? Am I using it to shape the actual tickets that I'm putting into JIRA or Linear or whatever, to make sure they're very clear and very specific? Did I reference the GitHub repo for the code base to actually make sure that the ticket wasn't conflicting with how the thing is currently built? If it's like an add on feature or something, there's a lot of little things you can do that were not expected before, but are very easy to do now that add a lot of clarity and value for the entire team. So I think like lifecycle wise, even if you're branching into like prototyping and things like that, I don't see a place where AI actually hurts unless you don't know what you're doing with it.
A
Amazing. And along this lifecycle, what are the biggest mistakes PMs make with AI tools?
B
Garbage in, garbage out still resonates with me, I think. Over prompting is one that I think backfires often. Really deep, really structured prompts that are meant to be reused and systematic, like a copy paste, without understanding the subtleties of every time, like the things that you're doing differently. Unless it really is just a repetitive task like generating requirements, I don't feel like I need a prompt structure to do that because it depends on what I'm starting with or where I'm at or what stage of thinking I'm in. I think the data you pull, for example, I think laziness with AI, like you put a prompt in Tool A, like Manus, you downloaded the files and you uploaded them without reading them in, say like a Gemini and you prototype something out. Like you're probably going to get something that's maybe along the lines of what you're thinking, but not very intentional, where you're kind of proving that that level of product management isn't valuable anymore. Right? Like, taste is incredibly important. I think intuition is incredibly important. Being able to be like, being able to look at something and say, I did this for x reason is 10x more valuable than like, I put this dog together. So being able to defend it and being really intentional about the end state of it, I think is some people tend to let go of that too early on. Especially more people like that are junior that I've worked with tend to be like, hey, I put it in here like you said. Like, I used AI. I put this doc together and then I read it. Like, but it doesn't make any sense. Like, yeah, you probably, you probably did use it, but like, none of this represents the information that we have or the interviews that we sat in. Like, did you use any of those things or not? Right. And they just don't gut check it or they don't question it. Like, you have to be kind of skeptical with all of it, I think.
A
Amazing. This has been a masterclass. Truly one of the episodes that I've had the most epiphanies on in terms of the gaps I have in how I'm using AI. All of the connectors that you talked about. This concept of an operating system is an incredibly powerful concept. So if you are a product leader, start to think about, am I giving my team the access to the right tools? Have I taught them about this operating system concept? Should I go through a training on it? And then if you are a pm, are you using enough connectors? Are you building an operating system? Do you have Claude skills that are helping you? Do you have quad projects that have the context? And then are you connecting it to all these tools? This is how you actually get the most out of AI. Nowadays, most of you guys are being evaluated in your performance reviews on how you used AI. This episode had just given you a bunch of keys to get a better rating on that. Mike, thank you so, so much for dropping the sauce.
B
No problem. Thanks for having me.
A
All right, bye everyone.
B
Take care.
A
I hope you enjoyed that episode. If you could take a moment to double check that you have followed on Apple and Spotify podcasts, subscribed on YouTube, left a rating or review on Apple or Spotify, and commented on YouTube. All these things will help the algorithm distribute the show to more and more people. As we distribute the show to more people, we can grow the show, improve the quality of the content in the production to get you better insights to stay ahead in your career. Finally, do check out my bundle@bundle.akashgi.com to get access to nine AI products for an entire year for free. This includes Dovetail, Mobin, Linear, Reforge, Build, Descript, and many other amazing tools that will help you as an AI product manager or builder, succeed. I'll see you in the next episode.
Episode: How to Build An AI Native PM Operating System
Host: Aakash Gupta
Guest: Mike Bal, Head of Product at David's Bridal
Date: February 3, 2026
In this episode, Aakash Gupta welcomes Mike Bal, Head of Product at David's Bridal, to dissect how product managers (PMs) can build an “AI Native PM Operating System.” Mike draws from his hands-on experience leading a digital transformation, showcasing live demos and deeply practical insights. Together, they walk through the real-world tool stack, workflows, and philosophies that define high-output, AI-empowered product management.
“AI native PMs are actually working from what do I need to do to what are the steps…to what are the best tools to get me those things.” — Mike Bal (01:48)
Core Tools: Cursor, Claude Desktop, and Model Context Protocol (MCP) for seamless integration (04:47–09:17)
“So creating wedding planning test titled ‘rent a car for your honeymoon,’ it’s creating that document… I don’t have to go to the tool.” — Mike Bal (07:47)
Composability: Pure tool-focused UI is less important; it’s more about connecting knowledge, data, context, and actions across core tools.
“It’s catching pretty specific things…little discrepancies in design.” — Mike Bal (28:37)
Core Licenses:
Advised Approach: Start with lower-tier or free plans, demonstrate value, and only expand as habits form and needs grow.
“If you show me you’re doing it and it’s helping…happy to bump it up…” — Mike Bal (49:16)
Full Lifecycle Integration:
Biggest Mistakes:
“Being able to defend it and being really intentional about the end state of it…is 10x more valuable.” — Mike Bal (57:46)
On the Core AI Native PM Mindset:
“AI native PMs are actually working from what do I need to do to what are the steps that I need to get done to what are the best tools to get me those things.” — Mike Bal (01:48)
On Abstraction & Operating System Thinking:
“We’re actually operating with a layer of abstraction from UI…logging into 20 different tools…is a bunch of time wasting.” — Mike Bal (02:47)
On Custom Instructions & Memory:
“I try to set that for the specific ones that need to be differentiated and then hope that it can maintain. But I also use like the memory MCP…” — Mike Bal (12:20)
On AI as Second Brain:
“It’s nice to have a second brain to rely on to gut check yourself.” — Mike Bal (29:10)
On Upskilling with AI:
“Taste is incredibly important. I think intuition is incredibly important.” — Mike Bal (57:46)
For anyone who wants to level up their AI productivity as a PM, this episode is essential listening—and a practical roadmap.