
Loading summary
A
For years, planning any trip meant flipping between Google flights and a couple of award search tools, putting all those prices in both points and miles into a spreadsheet, and then doing the math to determine what to book and how to get the best deal. Then a few weeks ago, I built a new tool that does all that in just a few minutes. It knows my rules, like never suggesting super early flights or having a layover unless the savings is really worth it. And this is just one of about a dozen tools I've built recently that have collectively replaced all the spreadsheets I use to dial in pretty much every part of my life, whether that's travel, health, work, and a lot more. So today, I'm comparing notes on this setup and the AI tools that power it with my good friend Anish Acharya. He's a tech investor at Andreessen Horowitz and a longtime friend who's been thinking about all this from a different perspective. And we're going to get into why I'm not even using openclaw anymore. We're going to talk about the architecture mistakes that almost everyone is making the case that personal AI is about to give every consumer the kind of optimized life we all dream about. And what happens when the heavy subsidies on all these tools eventually go away? I'm Chris Hutchins. If you enjoy this episode, leave a comment or share it with a friend. And if you want to keep upgrading your money points in life, click follow or subscribe.
B
Anish.
C
Yes.
B
We're doing it.
C
Okay. We're doing it. We're doing it. This is a thrill, man. We've known each other forever.
B
I know. I feel like we've been talking about this nonstop. Yes. And we haven't sat down to actually put our thoughts to paper. And so I guess now we're putting it to audio, but. And video.
C
We should give everyone a little bit of the backstory. We've known each other since Google, since you were responsible for some of your original hacks, your hacks before they were there, trademarked as hacks, which was helping the group of us go from Google to Google Ventures, and then later. I don't embarrass you about being responsible for my marriage.
B
Yeah, well, you know, I've only been responsible for one, and you are it.
C
The most important one. Yes. Well, dude, it's a pleasure to be here. You know, it's funny, the way I've been thinking about the world right now is it feels like there's two religions in Silicon Valley, Claude Code and openclaw. Each of them have their sort of figures. And I feel like you're the person who's been the most at the edge of the OpenClaw, a broadly AI enabled productivity ecosystem. So maybe a good place to start is like, what are you working on?
B
I like the way you framed it because I've been having a, I don't know, internal crisis about this show. The show is not supposed to be the AI show, right? I don't want to have videos where every day I'm publishing stuff about these models. But the way I generally think about the world and the way this show evolved was like, I really like to dial in my life. And historically dialing in my life has been making spreadsheets, doing tons of research, and now I'm just still dialing in the same aspects of all my life. My health, my points, my miles, my travel, my money. It just turns out that there's these amazing tools that make it so much more efficient. So one thing that I do a ton of, as you know, is I book flights with points. And searching for flights with points has gotten so much better in that there are all these tools that you can sign up for, most with a subscription, but some for free, where you could say, oh, I want to go from point A to point B. You can maybe add some variability. The challenge has been that when I want to book a flight, what I want really depends on what the prices look like and what the points look like. So if it's a really great deal on points, awesome. But if it's not, maybe I'll pay cash. If business class is a great deal, maybe I'll do that. So the way I've always searched for flights for a trip is I'll go to some award tool and I'll search June 5th, then June 6th, then June 7th, and I'll write all the prices in points. Then I'll go to Google Flights and I'll search the same set of dates in coach, Then I'll search the same set of dates in business. I'll put it all in a spreadsheet and then I'll go in and like figure out the math to figure out what makes sense. Now I thought, well, I have a computer, I have an agent that runs on the computer. Can it just go do all this, aggregate all that same information and present it exactly how I want. And so in a couple prompts I was like, I want you to use these tools, Google flights, award tool, seats, air, whatever it is. And I want you to collect all the information for these dates with the preferences you know, I have. I don't want to leave before 7am unless I'm going to save more than a hundred dollars. Like I only want non stops if available. Unless I'm going to save like 200. Like I have my rules. And then I get this response, maybe it takes 10 minutes and it's like, oh, you want to go to Cabo for five or six days and you can leave on these three days. Here are your options. We know how you value points. Here's what you need to make a decision. It's amazing.
C
Incredible. And okay, so take me back in time. Was this the beginning of your journey? What product were you using for this?
B
This is actually very recent. It was part of an experiment to detach myself from one tool. So part of the experiment was, can I build this tool with Claude code and then can I use it with OpenAI's Codex?
C
Okay.
B
Because I wanted to do things that were platform agnostic. Because I think, I don't know, I felt like we were going down this path where it's like everyone's saying chatgpt, we're all tied in and I worry that in the future we're not going to be tied in. And I want a platform that allows me to not be tied into one thing because who knows, like people are pulling the plug. People are going to raise their prices, they're going to stop subsidizing and I want my skills to work wherever they are. This is later, right? I went deep on openclaw for a while. We talked about it. I was like singing from the mountaintops about how cool it was.
C
Yes.
B
Now I'm not even using it.
C
That's crazy. That's crazy. But how are you interacting with your cloud code and your Codex then?
B
So at its core, Claude code or Codex, I'm kind of interchangeable. This week I already maxed out my cloud code max plan. So I'm like, I can't use.
C
You're a token glutton.
B
I know. I've been, you know, I feel like I've just been probably hitting it hard. I'm trying to automate a lot. Someone's leaving our team. So I've got a lot of new processes to automate, so I just kind of hop in. I'm almost exclusively using chat or code.
C
Yeah.
B
This kind of middle ground of Cowork that Claude has.
C
Yeah.
B
I feel like it's a great entry point until you realize that like you could do a lot of the things in Cowork just with Chat and then you can also do a lot of the things in code and more.
C
Yes.
B
So I'm mostly doing Claude code or Codex sessions.
C
Yeah.
B
No longer using the command line. Mostly using the native apps for each of those platforms and throwing it in a folder that I've now kind of treated as my agentic operating system, if you will. When I got after openclaw, it was fun. It was great. It did all this cool stuff and I was telling everyone about it, but one of the challenges with openclaw was everything was messy. There was no intentionality, and every two days they were launching a new version and it was upgrading itself, sometimes breaking. I remember the thing that I got me so excited about using Claude code was actually that it could fix openclaw.
C
No more gateway down errors.
B
It was so bad.
C
Yeah.
B
And I also realized that practically agents and humans are very, very different. And so you can't just have one threaded conversation forever with unlimited context. With OpenClaw, it just got so messy. You'd be like, hey, I want you to tell me what's on our meal plan tonight. And then next you're like, oh, okay, can you go build me a cool app? Oh, can you go search for these flights? It got so messy, and I felt like tasks needed to be threaded like they are. When you log onto ChatGPT or Gemini, there's all these nice clean threads.
C
That's right.
B
That's what I was miss. And I feel like openclaw didn't have a good solution except building more agents. So I built more agents.
C
Yeah, talk about your sub agent architecture and your agent per channel, because I think that's very sophisticated.
B
So I started with, oh, I want a bunch of agents, because I want one agent that can help me, you know, think of creative for the podcast, one that could do some research, one that could help family stuff. I had this idea that they all needed to be different agents, and now I don't have different agents because the concept of different agents doesn't even make sense. Every agent, your agent, my agent, they're all exactly the same. They're just assuming we're using the same model. If our agents both work on top of, let's say, Codex 55 or ChatGPT5.5 or Opus 4:7. Like, they are the exact same thing and the only difference is what context we feed into it.
C
Correct. Yes.
B
And so that context, originally I was thinking too much like a human, and I was like, well, if someone's being creative, I want all their knowledge and their lived experience in their brain. To be all about creativity.
C
Right.
B
But there is no lived experience. There's just what is in the session you have. And so for people who haven't gone too far down this path, every model has some number of tokens that live in its context. And every time you ask it a question, you're sending a bunch of information to it and it is giving you an answer. Anything that is not in that information, which by the way, that information could be, here's where to go find other things.
C
That's right, Yeah. I use a tool called a search memory, et cetera.
B
But if you don't give it that information, it has no idea what's going on. Whereas humans. It's kind of interesting. So if you take an average model with let's say 200,000 token context window, some are now up to a million. It's roughly a 500 page book. Okay. So that's about 200,000 tokens. 500 page book. Human brains keep in context. We're not the same as we can't measure it perfectly in tokens, but it's like 4ish things in context at any given point in time now, depending on our experience with those things.
C
Yeah.
B
And I was reading an example where it's like a chess grandmaster. One of those things might be the board, a newbie to chess. One of those things might be like what does the rook do? So we have varying degrees of what each of those four things are. The difference is that we have like two and a half petabytes of long term memory in our brain that we can tool call. That we can tool call in milliseconds.
C
That's right.
B
Which is like the equivalent of 12 million 500 page books. So like we have this immense ability to query all the stuff we know.
C
Yeah.
B
But when we make decisions and when we like act actually run calculations in our brain, we're using much, much smaller amounts of information. So if you were to say, gosh, I want to look at everything I've ever sent by email to this one person and I want to understand our relationship, you could literally throw all of those emails into one context and an LLM could process it really, really well. For me, I would have to read them and then I would have to like store experiences and then tap on them and it would take a really long time, it would be lossy. And so I've just started thinking, okay, so if we're going to structure how we work, we have to really think about it in terms of how these tools work, not how people work. And the whole agent model was great, but it kind of personified these computers. And it made me think, like, oh, I should set it up. Like, I would set up my team. I have the cfo. I have this. I have this. Yeah, maybe that's necessary. Okay. But what I started realizing was if you create workflows and LM seem to be really good at, like, repeating specific things, really bad at, like, abstract thinking on the leveling chart of, like, hiring managers versus VPs versus directors, they're not directors.
C
Okay.
B
But at the lower end of the experience, they're really good at these repetitive tasks. But you got to define them. And if you're going to define them and you know that the context is whatever you give it, then it doesn't really need to have another agent. You could just throw into the context. Here's a thing you need to do. Here's the style guide for this type of writing. Here's the past examples. And so, yeah, I don't know. I didn't need a persistent thread with 10 agents. I just needed a set of rules or skills for how to do things that any agent can hand off to another agent.
A
This episode is brought to you by bilt. You've heard me talk about BILT as the loyalty program that lets you earn points on rent wherever you live. And they just leveled up even more. As of 2026, homeowners can also earn up to 1.25x points on their mortgage payments. This is all thanks to bilt's three new credit cards, the Palladium card, Obsidian card, and Blue Card. All three turn your housing payments, rent or mortgage into flexible rewards so you can choose the card that fits your lifestyle without missing out on points and exclusive benefits. Built points can be redeemed at top airlines and hotels, Amazon.com purchases, future rent payments, and more. Built points have also been ranked by top publications as the industry's most valuable point currency. Your housing payment is already your biggest expense. Make it your most rewarding. Find the card that fits your lifestyle and apply today at joinbilt.com allthehacks that's J-O-I-N B I-L-T.com allthehacks make sure to use our URL so they know we sent you. Terms and limitations apply, subject to approval and eligibility. BILT cards are issued by column NA member FDIC pursuant to license from MasterCard International Incorporated. This episode is brought to you by Green Chef.
B
I've tried keto.
A
I've tried intermittent fasting. Neither stuck. And honestly, no eating trend has made a real difference the way just having good food already in the house has. That's why I've prepared dozens of Green Chef meals over the years. The chicken with garlic spinach rice is a favorite and we've reordered the Thai style coconut shrimp soup multiple times. What I love most is just not having to think. No deciding what to make. No wondering if we picked up a veggie for the side. Every week you choose from over 40 recipes, Mediterranean High protein or the new longevity line built around brain and gut. Health meals come pre portioned with organic produce and responsibly sourced proteins. Plus unlimited one on one nutrition coaching. Green Chef is the only certified clean meal kit tested by the Clean Label Project. If you can't tell, I'm a big fan so you should definitely check it out, head to greenchef.com 50All the Hacks and use the code 50All the Hacks to get 50% off your first month, then 20% off for two months. That's code 50AllTheHacks@GreenChef.com AllTheHacks okay, so just
C
taking you on the journey then. You started with a single openclaw instance that you messaged via imessage or telegram or whatever else. But then it started. There's just too much in the context window as you were context switching, human contact switching, it got confused. Then you move to subagents per sort of topic area, travel, family, finance, et cetera. Then you realize that was probably too skeuomorphic. So now you're sort of back to a single agent which has different skills. Each of the skills sort of corresponds to pulling in different things to the context window and different topic areas. Is that right?
B
I think that's pretty spot on. Okay. I would say it's not that I have one session every time. Now I have multiple sessions because there might be things that, you know, I want to follow up on. Like if I have a task, that task could either be something that just gets done once and then I don't ever need to come back to it. But it might be something where it's like, could you draft this thing? And then I want to have a conversation. So it's important to spin up a task in a way that you could continue having that conversation. Not that it dies at the end.
C
Yes.
B
But at the end of that task I don't need that person to like exist. I might have a Now take everything you learned from my revisions and save them to this. Like how Chris writes doc. But that how Chris writes Doc is actually relevant for all the tasks. It's not just relevant relevant to one specific thing. And so when I look at my workspace, I started with the open cloth folder and I was like, okay, let's start here. And I was like, okay, we need skills. Like skills are these repeatable tasks. And so you have a flight search skill, we have a go reconcile invoices skill, we have a go transcribe something skill. And they could be pieced together. I've got connectors which are just other things you might want to connect to. Here's one password, you know, here's Google Drive. Whatever things that I don't need you to replace, go get connected with them. Projects which are like, I want you to just know that there are multiple things happening if there's multiple skills and multiple things. Like, sometimes I elevate a skill to a project where I'm like, it's not an app yet.
C
Would like trip to Japan be a project
B
in my mind Project would be like finance, where it's like, there's just a bunch of stuff going on and I want to like, sub understand all the pieces and the channels is interesting because I might want to engage with AI tools in different ways. I might have some things that I want to be triggered by, you know, sending an email, some things that I want to be triggered by sending an imessage something that I just want to go create a session in a native app. And so every single one of these things needs to be separate. In openclaw, it's like there's one unit and it has a channel and it has skills. But here I might want to kind of piece it together and say, well, I want a channel where if I message you on Slack, I can send a message on Slack and it will use these three skills.
C
Yeah.
B
And then there's delivery, which is how the output gets sent.
C
That's right.
B
And you might want to click a button on a dashboard you build to trigger something, but you might want the delivery to be an email. So for me, I might want to go do an award search. And I can trigger that by using the Claude app. Like, the thing we haven't mentioned is that most of my interaction with this is not with custom tools. It's with like Download Claude or Codex, which is like the desktop app for the two mainstream providers, and use those tools on your computer to do all this stuff. Back when I first did this episode, I was like, here's how you can go use the command line and here's how you can go download these packages and it's like I don't think most people need that anymore to have a really, really both sophisticated and simple version of how to use these tools to actually add value to their lives.
C
Incredible. Okay, so maybe talk to me. So I understand the architecture. Now you mentioned the kind of award flight booking example. What are the other aspects of your life you've automated and maybe give us just a window into what you think the everyday consumer's life could look like. Because in a sense, you've always been automating your life. You've got a lot more leverage than you've ever had before. But I think one of the magic of these products and tools is that everyone will now have access to the kind of level of effort and sophistication that you've typically put into your own life for many years.
B
I will go there, but I know you've been thinking about this a lot. So I want to push you to kind of share, share your perspective on where have you been coming at this from? Where are you at? Where do you think we are?
C
Yeah. So okay. In the world of the two religions, I think I've been more about Codex cloud code than OpenClaw. I think OpenClaw is really, really interesting and I think, you know, I mentioned this a few weeks ago. I think the ergonomics of it are actually fascinating because it meets you where you are, which is imessage telegram group chats with your family which you told me about. WhatsApp messages. You can have this bots chatting with each other within these group thread. There's just something about the way you actually consume the model and the skills and everything else that openclaw seems to have gotten right with that said, I've just never been that productivity pilled. I probably run a relatively unproductive personal life, but I've been so much more interested in kind of making new things. And that's what the coding agents did for me. With that said, it feels like the two big breakthroughs we've had, there's been many, but the ones that I'm paying the most attention to are coding agents in 25 and personal agents open claw in 2026. I've been thinking about this a lot lately, like what are the kind of implications for society, the world? And I think one of the big implications is that you kind of have this zero marginal cost of doing work now or digital work. Right. Just as software I think is very famous and important for zero marginal cost of distribution, you now have zero marginal cost of doing Work because of computer use plus personal agents. So all of these things where apathy or sort of being under informed about a product area like finances is the most classic example where if you sat down with someone and looked at their credit report, or if I did, I'm sure we could identify for the average person a dozen dozen ways they can make improvements without changing their life at all. Like, I'm not saying don't drink the Starbucks. I'm not saying make different financial choices or change your retirement target age. I'm just saying with all of the existing constraints of your life, here are a couple of hacks that you can make to actually improve your score and decrease the cost of your credit, et cetera. But it requires motivation and information and that all requires work. And now that personal agents can do the work on your behalf, something that I think could happen in the near future is that every consumer lives a fully hacked life. You know, it's sort of completely optimal, as if you designed it yourself for them.
B
So I think the one thing I'll push back on is I've been playing the points and miles game for a long time. And I think when I probably got started, it wasn't as mainstream, right? Like, not every person went and had a points card and was trying to think about these redemptions. And then they added award search tools, right? And so now the average person doesn't have to go figure out how to log into the Air France website to search there and do this. So it's like, okay, cool, it's gotten better. However, if you have a United card, you might get a discount. And that's not going to surface on an award search tool, at least not today. And you know, I could tell you, for example, one crazy thing is that for domestic United flights, like just short haul flights, you could book them on United. It's often 10, 15,000 points. You can book it on Air India for 3,500 points. The award search tools don't search Air India. Air India is hard to even get points in. They're like not a transfer partner of a lot of programs.
C
But they are for United States.
B
No, no, they are a star alliance partner of United. But to get Air India miles is not easy. But like Rove miles, it's a hotel booking portal, like you could and a shopping portal. You could earn some road miles. You don't have to earn a lot. 3,500. And now all of a sudden you can book this $300 United flight for what's like $30, like 90% off and so everyone will always probably continue to be more and more optimized just as we've gotten more and more productive and more and more healthy and life expectancy and all stuff. But there will still be, I think, as this evolves, knowledge that isn't shared publicly. I wonder if knowledge gets less democratized over time. That's interesting because, you know, I don't want to share everything in the world with an LLM. So does everyone start, like turning off access these LLMs and to do conversations about things get more private? 90% of the content that I consume about points and miles and stuff is not public in that it's not going to show up in a Google search result. It might be buried in a Reddit comment. It might be on a podcast. It might be in a telegram or Discord group or a Slack group. And so I think, yes, people will be living more optimal lives. But I also wonder if those of us who kind of like doing what I do, maybe not as much you like, will still find ways to live a even more optimal life.
C
Well, I guess the question is there's secrets and then there's just things that are a pain to know. You know, if you had to go follow every telegram group chat and listen to every pod. Like, all of those things are highly doable by agents. So this is my point about zero marginal cost of doing work. They will do all that work for you. They will discover every secret that is discoverable. Of course, there will be secrets that are kind of whispered down shadowy hallways, and the agents won't be able to find those because there's just no path to them. But for everything else that is knowable, I think the agents will know them, will operationalize them, and the system will have to change around us.
B
I'm excited to find out how this evolves. Like, I don't know the answer, but actually, what you said is something that has been the kind of crux of a lot of these tools for me is that there have always been kind of three things. And I'll use health as an example, but I don't want to go too deep because I've already put together an episode entirely on this. But when I think about my health, there's kind of these three, or let's call it four components. I've got all the knowledge, and there's probably two tiers of knowledge. There's like the public knowledge, which if you search health things on the Internet, varying degrees of authenticity and everything.
C
Yes, yes, yes.
B
But then there's like the scientific Journals. There's the medical research, the clinical trials. Then there's what I'll call, like, it's not private knowledge, but it's knowledge that doesn't today surface well in things. Maybe it's podcasts, maybe it's member sites, maybe it's private YouTube channels. I don't know. I'm sure you and I listen to various podcasts that aren't private, but the information in them doesn't necessarily show up on the Internet. When you search on an LLM, maybe you pay for a course. There's another good example. So I have this knowledge. So I have a bunch of places I like to get knowledge on every topic. Health, this, that, and the other. Then I have all of my personal data. And that personal data for health is pretty obvious. It's my biomarkers, it's my pernuvo scan, it's my doctor's appointment records and all that kind of stuff. And then separately I have. Where do I go to ask questions. And most of the time I'm going to ChatGPT or Claude or Gemini, and I'm like, hey, what should I do that has no knowledge of, like, here's the people I trust to give me information and where I would get it. And then here's everything about me. And so what I said, I think we can start to build in. Every area we care about in life is tying these things together. And I don't know if they get tied together publicly and maybe they just get tied together by us, but that's where I think everything gets really fascinating. This is true. You know, I say health as an example because I've done a lot of health tests, I've crawled a bunch of health data and built a little database of all this health information. And then I have all of my stuff, and then I can go ask questions and it hits those and public, and that's where it starts to get magical. And that's what's not obvious today. A lot of people, my mom like, oh, let me ask. My stomach hurts. I ate this. What should I do? But it doesn't know your genome and it doesn't know that maybe you trust Andrew Huberman more than anyone in the world, and so you want his advice or whoever your medical expert is.
C
But you know, Chris, okay, so here's the real black pill in all of this. My counterpoint to actually the true sort of fully optimized consumer in the near future thanks to this technology. The black pill is if you look at what we've done for the last Two years is we've made it really easy for anyone to make a movie or a creative artifact and anyone to create a piece of code or software or anything they wanted to make. And Nikita said this, and I think he's right, which is like, funny enough, two years later, the exact same people who are doing all these things are still doing them, except with the new tools and technology. So perhaps you're one of the most high agency people I've ever met. You're probably top five in my world. And yet you continue to be high agency with these new tools and technologies. Like, does the mass sort of public actually have enough agency, interest, curiosity, motivation, even if the tools are right there to operationalize all of this?
B
I don't know. I think that where this kind of takes a turning point is when the tools can start to both do things less expensively. We should talk about the cost, but actually prompt it. So right now, if you go to ChatGPT and you say, hey, my stomach hurts, what might it be? It's like, oh, what did you eat last night? It doesn't say. Like, hey, here's a connector to connect to Apple Health so I can pull in that data. Who's your doctor? Can I email them and get your medical records?
C
It's a great point.
B
It doesn't say, hey, you know, if we really want to do this, we should track what you eat. Can I build you an app to do that that will live on your phone? And by the way, that's actually probably an easier problem to solve than many other things.
C
That's a product problem. Yes.
B
It's literally like, if these tools just. And by the way, it's like, oh, we're halfway there. Do you want to upgrade your plan so we can finish building? Like, it seems like a great thing for all these people to build, but that doesn't happen yet. And so today someone's like, hey, ChatGPT, how do I use all my points?
C
Yeah.
B
And it's like, well, you should search for flights.
C
Yes.
B
Not, yeah, let us evaluate everything. Let me connect to everything you have.
C
It's great.
B
And so I think once the connecting happens, it's really magical. So I'll give you one example. I built this app called Card Tool. It manages all your credit cards. And the other day I was having a conversation about paying taxes with credit cards with Claude. Oh, my God. Now Claude is tied into Card Tool.
C
Okay.
B
And so when I went in and I was, like, asking a question, it was like, hold on, let me just go look at what cards you have and what their available credit limits are. I didn't think, let me go open this app that I've built and collect this data. But because I'd connected card tool to Claude, it just knew that's a tool available to me. Let me go check the tool and let me give you a better answer. Because I have the data now. I've taken the time to connect all these things. That's like an onboarding thing. And it's a challenge that most of the things out in the world don't yet have those connectors. But I have to imagine in, I don't know, months to years, every single consumer product, service, data store, et cetera, is going to have some way to connect to every single agent tool and then it can start to do some awesome stuff.
C
What is so beautiful about what you're describing, what's got me thinking about is the fact that, okay, maybe a subset of people in the world have really high agency. And actually, Sam and Elon are maybe the two most high agency people in the world. And for them, funny enough, the chat box is actually the ideal interface because they just say what they want. But perhaps for much of the rest of the world, the agency has to be sort of externalized and the model has to be like, you know, a bit of a trainer. Like, hey, do you want this really good thing to happen? I will just do it for you. Let me like push you to like connect all of your data to do this thing. Let me phone your doctor on your behalf. Like, I don't even have to be high agency for that. I kind of just have to let it wash over me. And for all these good things to happen in my life, it's a really interesting idea.
A
This episode is brought to you by NetSuite. Every business is asking the same question. How do we make AI work for us? The possibilities are endless and guessing is too risky. But sitting on the sidelines is not an option because one thing is almost certain, your competitors are already making their move. No more waiting. With NetSuite by Oracle, you can put AI to work. Today, I'm using AI across my business. And the hard part is knowing which tools hold up when everyone's shipping something new. That's what I like about netsuite. It isn't another bolted on tool. It's AI built into the system that runs your business. The number one AI cloud, ERP, trusted by over 43,000 businesses. Bringing your financials, inventory, commerce, HR and CRM into a single source of truth from software and IT services. To healthcare, equipment, manufacturing, financial services and many other great American industries. NetSuite delivers a customized solution for your business. If your revenues are at least in the seven figures. Get our free business guide Demystifying AI at netsuite.com hacks the guide is free to you at netsuite.com hacks netsuite.com hacks this episode is brought to you by Mercury. Radically different banking now available for personal accounts. I've been using Mercury Personal for two years now and I love it. It's fast, elegant and designed to help you do everything you want with your money all in one place. You can send wires with no fees, set up sub accounts for your goals and projects, put your money to work with high yield savings and set automations to move money between accounts so everything is funded without you having to think about it. You can also spin up cards with custom limits for an assistant or a nanny. You can grant read only access to your accountant and manage shared finances with joint accounts built for clarity and control. All of this for one all inclusive subscription of $240 a year. Visit mercury.compersonal to learn more. Again, that's mercury.compersonal Mercury is a fintech company, not an FDIC insured bank. Banking services provided through Choice Financial Group and Column N A members fdic.
B
I don't think it's prescriptive enough and proactive enough. And that was like the biggest complaint when all these agents started. It's like, I want you to do work for me. Well, the reason is like it doesn't think, like it's not a human. And so over time it will start to get better. Oh, let's spin this up and see what are the things this person is accomplishing and what can I do? If you said I want to be healthier, it's like, here's a fitness plan plan. Why doesn't it text you the next day? And that comes back to from a channeling standpoint, I think you have to do it yourself. But you can, right? So I stopped using OpenClaw, but I still use iMessage. And so you can both use out of the box channels for Claude or you can build your own channels. And one of the things that I think this goes back to, like you have to want to do it. But I literally was like, well, Openclaw works with iMessage. I want Claude to work with iMessage. Claude has this channels feature so you can work on channels. So I literally just took the openclaw code and I gave it to Claude code. I Was like, build a channel that does exactly what this other thing does. And in like 20 minutes, I'm, I'm messaging an instance of cloud code. It just works. But you have to have that thought. That's the thing I think a lot of people are missing. It's like, I don't want this thing to work. Can I fix it? Well, here's a great example. A lot of people know that I've done tons of research on random stuff. So people call me all the time and they ask me questions. Now they could go answer these questions somewhere else. Right. Like, but they just know that I'm this person that's like, probably answered it.
C
Yeah. And you've sent me these tags, human chat, GPT. It's beautiful.
B
Once people realize, oh, that's how it works, like, their mind will be trained to be like, oh, let me just ask this question, let me do this thing. And I've seen it happen. And I'm going to give you an example because you're going to actually tell me how it turned out. This woman asked me, can I have it reorganize all my photos? Can I do all this stuff? And I wasn't sure if it could. She wasn't someone who uses these tools like I do, but she had the idea when we were talking and once more of those ideas happen and you see the output, then you'll just start to train yourself. Oh, I can share those ideas. I can get comfortable with that. And you did it. I did.
C
Well, you should describe what the prompt was or I will, if you like.
B
Yeah, go for it.
C
Yeah. So the prompt essentially was. And you're just talking about your show with Kevin, which I thought was terrific. The prompt essentially was, hey, can I have Claude or Codex go and look at all my imessages, all my photos, and put together a video montage that sort of interleaves and puts together, you know, sweet messages with my partner, photos of my family, and puts it all into this nice finished polish artifact. And you were saying on the show, oh, I don't know, that might be a little beyond its capabilities. And then you tried it and it worked. So I too tried it yesterday after being inspired by your pod and it was incredible. It like selected a photos, it found all of these old imessage. You know, I'm sure you've got thousands of messages with Amy, I've got thousands with Lindsay, and it went all the way to the bottom to find the very sweet and endearing ones. You know, one from Lindsay that said, hey, it's true. I'm pregnant. Things are so great right now. Just something that was such a nice digital artifact of our relationship that I never would have found otherwise.
B
So it found messages and photos?
C
Yes, and it selected great photos. I wanted to put the music on myself, but I said, hey, go and dig into my music library and give me 10 ideas for tracks that might be cool. And I found this remix of a Drake track. You know, sorry to all the Kendrick people. That worked great. And I threw it on there, and I am so excited to show it to her. I actually showed it to the kids last night, and they're. They're super excited, too. So it's just amazing what's possible. It's something that's richer and I think more touching than anything I would have made myself.
B
You know, I think of my daughter, right? She's so young, and she loves creating stuff. It's like that creativity that we probably don't have, we've kind of lost it as adults. And I think that these tools let us do that in a way that we're probably even more comfortable with than, like, working with wood or working with paper. And I'm just excited for it to be retrained. I will give a kind of caution, because when I think of the task you just described, it reminds me of why things often break in. These things, it's like if you describe everything you want all at once in one big blob. Sometimes I found errors and problems. And so I'm gonna encourage people when they're trying to be creative to do this, like, reverse prompt, where it's like, well, I'm thinking about this. How should I structure it? It's like using the tool to ask how to use the tool. And I always find that when I do that, I get much better output. And so if you want, you can send the exact prompt you used. I'll share with people. Thinking carefully about the prompt is something that I think we probably won't have to do as much in the future. But right now, it's like the difference between good and bad output.
C
Well, it's also very interesting. I think, for a lot of creative fields, people are unable to articulate what they want to see or hear or what they like. If I told you, hey, tell me what your favorite track sounds like, what does it really sound like? You'd be like, I don't know. It's. It's cool. It's high energy. It's moody. Like, you'd have all these words that were just grasping at what the song was or what the genre was or what the sound was. And yet if you heard it, you'd be like, I know it when I hear it. So you should actually be able to put things like that into the context window, which you can to some extent. But in the future, I don't think you'll have to have the language as sort of like they're the keys to unlock all these incredible artifacts you want to see. And hopefully there'll be a lower bar for that.
B
I find sometimes now if I want to do something in the style of something else, I, like, upload a photo and I'm like, describe this photo.
C
Great technique. Yes.
B
So then I say, now create a photo that sounds like this, you know, And I find that when you break things into discrete tasks, the quality is so much better. And so I. I did a whole session with Codex. Cause I ran out of my tokens on why and. And you know, it doesn't have episodic memory. So it's not like I did this and then I did this, and it just kind of like does it all at once. Prioritization is tough. So they've solved these with, like, plan mode. But it's like, so driven on software development that I think once the tools evolve to be better for human tasks, it's going to get a lot better. But the thing that I learned, building this kind of replacement for one of the processes the this person on my team who's no longer going to be here is doing is if you want something to collect information and then rank it and then draft it, oftentimes my instinct is like, here's 10 sources. Go get this and try to figure out the best and do this and draft it. And then I was like, okay, go get the information, then give me a summary of the information. Now run another task, take the summary of the information, and then go and rank it. And then take the ranked summary and draft it. And when you break it into discrete tasks, maybe future models won't need this. But at present, it's wild how much better the response is when you're, like, not trying to have one session do everything.
C
So since we're on this topic about family relationships, kids, we should get to that as well. You know, My question for you is, I understand what it looks like conceptually to have an optimized life, finance, health professional, perhaps. Do you think that this technology allows us to be more or less connected to our partners, to our children, to the world, to each other? Are we more or less connected than we were three Years ago. Like, maybe give me the philosophical and the practical because you've lived both.
B
I'm really torn on my answer to this question, because I was trying to figure out how to better communicate with someone that I work with. We both took this personality test, and I sent the transcript of everything we've done in our meetings, and I was like, what could we do better? And sometimes it just turns out that, like, certain managers have a style of work and certain employees have a style of work. We were the most incompatible possible. Right? Like, so that's not great. I was like, hmm, I wonder if I could write an agent where it's like, imagine this person's name is Sarah. I have, like, fake Sarah, and I DM fake Sarah on Slack. And then it translates everything I say into the version of it that I wish that I could say, but I don't have, like, the ability in the moment to say it in the way that it would be best heard. And then I was like, gosh, there could be a world where, like, I never even have a direct relationship with this person, but we kind of feel like we do. If that world comes to fruition, then it's like, our relationships are definitely not closer. So what I've been trying to do instead of that is every time I have a conversation with someone, I'm like. I send the transcript of the conversation. Like, tell me how I could be better at communicating. Tell me how I could improve this.
C
And does it work?
B
I think it does. Like, I think I've gotten to be a better manager. I'm gonna go deep on what I've learned doing this, because I'm recording, like, everything that happens. And so I could have all my personality. I could have all the conversations with my wife and all the texts we send. And every month, I could say, hey, go look at everything we're both doing, talking, saying, what could we do to be better? So that would probably make our time together better, which I think is great. Like, I want to have a more exciting relationship with people.
C
Okay? Two things put together. And this is such a perfect microcosm of the entire debate around technology and AI, which is you have an abstract concern that perhaps if virtual Sarah talks to virtual Chris, you never interact. That would be terrible. Meanwhile, on planet Earth, you actually are communicating with Sarah much more effectively thanks to this technology. You know, like, there's this abstraction of a fear, which maybe it could exist, but, like, it's just not what's happening. It's not how you're experiencing the technology in the world.
B
And honestly, I'm thinking about how does this affect our kids? Like what, how are they going to communicate? What are they going to do? There's a lot to unpack there. This is not going to be that unpacking.
C
But tell us everything.
B
I think that I've been able to improve so many things and be more efficient. And ultimately I think I'm probably a little bit too much of a spreadsheet productivity junkie to stop. But I think most people don't do what I do at all, at all. And so if they can free up some amount of their time each week, what can they do with that? I have a long list of things that I want to do with my free time that I don't have. And I've been able to make meaningful improvements in personal things, not just work. I feel like we always talk about, like, let's build a software to do this thing. But it's like spending time planning a trip, spending time meal planning, spending time, like all of those things. If you can outsource a lot of those tasks, then we have more time. Right? Like, I could spend more time just hanging out with my kids. We could spend more time on the weekend just going to the park. I mean, most, most of the weekends off for AI, but it can work
C
in both directions, you know, so, for example, maybe a future form of passive income. The same way like I call Chris to get his quick advice on taxes. There's like the Chris tax skill, which I can hire to look at my taxes and do a bunch of work and give me a bunch of feedback that I wouldn't otherwise get. And you've like written it, you maintain it. So perhaps there's a way to extend the one thing in the world most opinionated at into this technology in a way that financially benefits us.
B
I would love to be able to have some version of my lens on certain things to give to other people. And then I would love to have your lens on certain things. Like imagine I'm like reading this article. I'm like, what would a niche say?
C
So can we do that in the bigger world? How do you get compensated for doing that? Because in a sense, that is your specialization, so you don't want to give it away for free. And I think a lot of people would pay to hire virtual Chris and it would cost 1/1,000th what it would cost to get you on the phone, perhaps. Right.
B
That's a cool, a cool world because I think we all have different, like random specialties.
C
Yes.
B
Like I might want my father in law to teach me this, like, strategy in a card game that he's, like, played for 50 years. Yeah, I could just go spend time with him. But, like, conceptually, like, you know, if someone else came to me and they're like, I'd love to learn how to do, be a better cribbage player. I'm like, oh, you should hang out, my father in law. But that's never gonna happen. But, like, he's got a little, like, you know, you could basically virtually play with him and learn.
C
It's this thing I've thought. Thought a lot about this, which is that there's intellectual knowledge, like the stuff that you, like, read in books and maybe read blog posts about, like, knowledge that's quote unqu Important with a capital I. Then there's tacit knowledge. Tacit knowledge is just the knack for getting things done. The majority of the knowledge in the world is when you call customer support at United versus Air India. Maybe you sort of know what the transcript to talk to each is to get exactly what you want, or what the tone or what the bedside. There's just so much information that is never explicitly expressed that could now be captured and be used to benefit you and to benefit others. I think that's really beautiful.
B
You gave me this example, or someone did, about how could everyone just start to pay all these bills or fight all these bills. If you get medical bills, if you want to negotiate them, or if you want to submit appeals, you can do all these processes. So stage one is going to be everyone can just do that. You get a speeding ticket. Do you remember where it's like, hire a person and they're like, write the defense. It's like, yeah, stage one is, I want to fight my parking ticket. Just go do this for me. Stage two, though, is like, what happens next when all of a sudden there's just every bill is getting fought? Like, I don't know how that evolves, because you're seeing this in software. Like, the number of people contributing to software is crazy. And so I'm not a software developer, but I was using this one app as a teleprompter app. It's like a Mac app. And I route all my audio through this Rodecaster. It's like a mixer that we're using today. It ultimately means that the audio device on my computer has multiple channels, and this app didn't work with multiple channels. So I was like, well, let's download the source code for this thing. Let's tweak it to do this. And then I was like, let me submit a fix that got accepted, and now it's in the final app. And that's cool. But then I noticed, I looked, I was like, there were like a hundred people submitting these things. And so it's gonna be overwhelming when the insurance company has 8 million claims and appeals and fights. I don't know how that gets solved. But in the short term, I feel like anything that you're like, oh, would this be worth it? We always had this question, what's my time worth? Is this worth my time to go fight this thing? To go do this thing?
C
Yes. Well, now you don't have to ask that question. That's exactly. And I'll tell you a funny side story. Then we should talk more about this, which is Brazil is a country in which it's the easiest to sue anyone in the world by a long shot. So in Brazil, you can sue people digitally, and people sue companies. Like they sue the Brazilian doordash because their order was late. So companies get sued thousands of times per day for otherwise trivial things. And so, in a sense, you already have this ddos of sort of agents in the form of lawsuits. And they, of course, then have their own technology, but they've had to change the system on both sides to handle this sort of emergent behavior. And I think it would be beautiful if, for example, we had to change, I don't know, the DMV and the way that we handle adjudication of parking tickets and all of these other systems which we assumed were so ossified. Like, if you look at how society works, you know, I don't get too political, but you might say that the super rich get services and all the rest of us get products, you know, or they actually have a team that manages things and makes calls and fights on their behalf, and they never have to make the labor trade off. But now with these technologies, nobody does. Everybody's got a team, and it's incredible.
B
So I'm going to pause, and we talked a lot about how I've set things up and, like, tools, but practically, like, if someone's listening to this and they're like, I haven't dove deep into this, you know, I said, my advice is to continue doing what you're doing. You know, use ChatGPT or something in the web. But I think it's really interesting when you start using one of these platforms and letting it talk to other things. My advice to start to play with this is because we didn't get into the, like, tactical Is unfortunately, I feel like Gemini is a tool that I know a lot of people I've talked to are using. They have chosen to not integrate much beyond the Google ecs.
C
Do they not have the MCP connectors?
B
So Google Gemini on the command line, which is like opening up the terminal on your computer does, but the consumer grade version of it, which is like on your browser, maybe on your phone, they've built like three or four connectors.
C
But I see. And there's no marketplace or no marketplace. I see.
B
Whereas OpenAI has apps and Claude has connectors. And I went deep on both of them because I knew we were going to talk about this. I'm disappointed with OpenAI's version because they treat it like opt in of the connectors. So I'm having a chat and I'm like, let's also include notion in this chat. And the problem I have is that oftentimes like I don't know what I want to do when I first start. And then in order to add custom ones, you have to switch into developer mode on OpenAI, which anyone can do. But once you add this custom one, you switch in developer mode, there's no memory, there's nothing. And I have gotten so much value out of the memory in Claude and OpenAI in ChatGPT because I was like, oh, I'm thinking of doing an episode about Japan, which by this time this comes out will have come out. And I was like, what are some of the things I should include in this? And it was like, I don't even know why I asked it because in my mind I was like, well, it doesn't know what we did, but it does. Because every day I would pop in and be like, if I'm looking for shoes, what are some great places to get shoes in Japan? Like where should we go for X, Y and Z? So it actually had all this context and you lose all of that once you enable these custom tools. So if you can live in a world where you're using the default tools, then both of them are pretty comparable. But if you want to use any custom tools which might be something you've built or just something that exists. Yeah, I really like the Card Pointers app. It basically aggregates all the Amex card offers, Chase offers, they've got an MCP server, you can connect it, but they're not like in the marketplace, so you have to use it as a custom setup. That doesn't mean that it's like, like more risky. It just means they haven't like been approved to be in this marketplace. So I think Claude gets the win there for just making the connectors available. So I would use one of these platforms. Most of the services you'd want to use are probably already in there until you start building your own. And if you don't, the thing that I haven't heard anyone talk about, and I'm sure I'm not the first person to talk about this, because they big company is what Zapier has built.
C
Yeah, yeah.
B
So Zapier built this.
C
You're obsessed with this.
B
Well, so they built this thing. It's Zapier MCP server server. And the reason why it's really cool is they are in the marketplace so you don't have to do anything custom, but then you can go connect all these random custom things to it. And they are in the business of that. So if you're like, oh, I use this one obscure thing, there's a decent chance that Zapier is like 8000 connectors work with it. And so instead of having to go and like wait for company X to go build something specific for these tools, you could connect it to Zapier. And for now, you know you're gonna pay 10, 20, 30amonth for hundreds of calls, which you probably won't need until it's maybe saving you so much time you're willing to pay it. But until then, it's like a great stopgap for I want to do a thing and I can't quite do it and I don't want to go code it. I built this website for a friend that was like how to get started. And it was kind of like a fun, interactive way to get started. And he was like, all I want to do is build a tool that can check my email every day and label everything that meets this criteria. And I was like, oh, this is the easiest tool you could build. And now with Claude has routines and scheduled tasks. And so a lot of what OpenClaw could do, you could just do natively in the app. You don't need to code anything. You don't need Cron Jobs, it doesn't get messy. And you can go in and say like, oh, every day I want you to research this thing and send me a briefing. And I actually tested this. Like you can do a cloud Claude code briefing every day. Now what Claude can't do is send you an email. So you can go into Gmail and say, send me an email, but it can't. The default Gmail connector.
C
Yes, that's right.
B
Drafts it does only draft. Yes, but Zapier's does. And so Zapier can plug Gmail in to send emails. And if you're going to send an email every day, that's 30, 31 emails a month, it's not going to cost that, you know, like. So there are a lot of ways that you can extend without having to build. Now I want to push everyone to go build because it's fun and you could do it. But it's just amazing that if you download these apps, they now can control your computer, they can go browse the web. You know, you want to go build a tool that searches for hotels. Great. Easy.
A
This episode is brought to you by Fora. Imagine if your love of planning trips could actually become a source of income. Your years of comparing hotels and putting together perfect itineraries for friends turning into a real business on your terms. That's exactly what I've been doing. I am a travel advisor. And full disclosure, I'm also an investor in Fora because Fora makes turning travel planning into income actually simple. It's a modern travel agency that gives you the training, the booking tech and a global community of advisors all in one subscription with over 7,000 preferred travel partners. The trips you book come with upgrades, resort credits and late checkout. Every hotel, cruise or activity you book earns a commission and you can build it on your own terms alongside your job, your family or whatever else you've got going on. Become a Fora advisor today@foratravel.com allthehacks that's f o r a travel.com allthehacks and make sure to tell them we sent you foratravel.com allthehills thank you for being here today. You can find all the links, promo codes and discounts from all our partners@AllTheHacks.com deals.
B
They're all brands I love and use.
A
So please consider supporting those who support us.
C
Us. Let me ask you a tactical question about this. How do you manage logins? Like today I was downloading all of my health data from Stanford and you know, you've got to log in on behalf of Codex so that it can then operate the browser on behalf of you. And then every time you want to refresh it, you have to go log in again on behalf of it. So how do you sort of manage passwords logins, especially for your most sensitive data sources, your health.
B
A few options. I was thinking about this a lot because there's a few different ways to get this health data, some of which are not available to consumers but are available to medical companies and Apple Health. So we'll get to that. So one option is just one password. So I have one password set up and it's on the computer. I have one vault that the computer has access to. It's called like a service worker account. It's actually free. You don't have to add a user to your account. So I have a service worker account that has read access to a vault. And that vault, anything I put in there, then my Claude code session or my Claude cowork session or my codec session like anything can do it. I think the beautiful thing that I didn't mention earlier, the most valuable thing I recently did is every time I start working on anything that's going to create anything skills tools I start it with build this so that any agent coding platform can use it. And so when I built this award tool thing I built it in Codex and then I immediately tested it in cloud code and it worked. But it worked because I told it because if you don't it's going to say use the function X to go use a browser that is a codec specific function. And so for health data specifically you kind of have a few options. You can download all the data yourself for the historical and maybe rely on other things for upcoming. I don't love that a lot of these sites have logins. You can just put your login into one password and let it do it in the browser or Apple Health syncs to all of these platforms and so I have something that's like it's so silly but there is a shared Dropbox folder that is on my computer that has access to all my health stuff stuff. And then in Apple Health you can export and you could even. There was a shortcut related thing you could do but you could just go into Apple Health and export it once a month and it pulls from a lot of the records in all these other places. So you could use Apple Health as the hub. You could. And this is like sounds daunting to most people, but it probably isn't. You could build an iOS app that literally all it does is runs on your phone and syncs with Apple Health with the one permission that it just gets access to all your health data. So that's an option for when I collected this I kind of did it a little bit more manual out the gate but you could choose how much you want to give it access to. Can it log into your Stanford My health account? Like is that that big of a Deal? I don't know. You gotta decide.
C
I mean, I'd be fine with it. I just don't want to give it access to my primary one. Password. Vault?
B
Yes. So I have a service where, like a.
C
A separate vault. Separate vault with a separate service worker account.
B
Yes. Smart. And so the computer can access that vault and it can only read it, can't write it, so it's not going to override something. And that's changed a lot of the problems of, like, like, oh, I need an API key for something. Put it there, Everything can find it.
C
Okay, so now let me ask you a different question. Take me to the edge. What's not possible today that will be possible in three or six months? Presumably, like, what do you wish you could do that you can't? Or do you feel like you're just, you know, now you're looking for areas in which to apply this stuff?
B
What's not possible is to not waste time because these LLMs make so many mistakes. Okay, like. Like last night, one of the things we haven't talked at all about is how to manage this, like, on the go. If I gave people a lot of advice. Install Codex, install Claud. Then, like, what do you do if you have a question when you're on the road or something like that? Claude has a cool feature, remote control. That's like, better than it was when it launched. Not perfect, but you can start a thing on your computer and, like, control it remotely. Then they built a thing called Dispatch, where you can actually just like, tell your computer to start doing a thing remotely. At the end of the day, it requires some persistent device on. That's going to be an offering of all these tools in some period of time where it's like, if you want a thing that can run all the time, like, you can have a Google Cloud server, you can have a Claude cloud server.
C
You've got your codecs connected to remote terminal sessions as well. Right?
B
So this is where I was like, I can't believe this happened. So I went to ChatGPT and I was like, I really love how if I'm using Claude code on my computer, I can go to my laptop when I'm not home and control those sessions, or I can go to my phone and control the sessions. How do I do that in Codex? Because I hit my quota, I'm switching to Codex for the rest of the week. And it was like, there's no good way to do it. I was like, here are three articles about other people doing it. It's like, well, there are some ways to do it, but maybe they're not the way you wanted. Here's the limitations. Here's the. So what I had done was. And this is probably a little bit farther than most people need to go, but once you have a. Like a Mac Mini or any computer setup or any desktop computer that can run all day, if you're on another computer, I find it's better to just operate on that. And so why it's not enabled by default, I don't know. But there's this thing called Remote Connections on the Codex app in Codex that you have to add to this config file. And if you just Google, like, Enable Remote Connections, it sounds like it's a beta thing that you can enable enable, and then it lets you do what's called SSH for people who don't know where you can connect to another computer and run on that computer. So I did that, and I was like, I'm doing that. I would like to control that from another computer. And it's like, you can't do it. And I spent 20 minutes arguing like, there's gotta be a way. Like, just fundamentally, this should work.
C
I like this experience.
B
I gave up, and I was like, you know what? I'm just gonna create a new session from the other computer. So I go to my laptop, go to the config, I enable remote connections, and then. Then I put in all the details to connect to the computer, the Mac Mini in my house. And then it's like, what working directory do you want? And I chose this directory that's like my agent directory, the former Open Cloud directory. I choose that directory and then all the sessions just show up. Like, they're just there. So one of the challenges, the things that it's tough, is this whole context thing. I even told it to research it, and it couldn't figure it out. So I don't even know if that could have been solved. But a lot of times you'll ask a question like, can you do this thing? It's like, well, I don't know. Like, it doesn't know it can do the thing thing because it's not in its context or in its model. And so I think what I can't do now is operate in a world where models are, like, able to figure things out. And so you've got to give it so much instruction. And so it's like, if you think of when you've hired people, it's like when you first hire someone, you can't expect them to do the work. You have to show them how to do their job. These models can do things that employees I've hired have never been able to do, but sometimes at the fundamental level, they can't. And so they need a lot of instruction. And so when they're able to figure more out on their own through trial and error. And I think we're getting there. Like the browser use stuff where it's like, oh, I can't figure it out. Let me look, let me take a screenshot. Let me see if it's there. Let me move it around. We're getting there. It's just getting better every week.
C
You know, it's interesting. So Aaron Levy talked about this on X last week. Do you know what the Gelman amnesia effect is? Have you ever heard of this? So I'll describe it to you. So the Gelman amnesia effect essentially is when you read an article in the newspaper about a topic that you really understand, and you're like, this isn't right at all. And then the next moment you read an article about a topic you don't understand at all, and you assume it's 100 accurate. So I think there's a similar thing with the models where if you use the models for something that you're sophisticated at, you're like, ah, it's like 80%, but not quite there. And then when you use the model for something you're not sophisticated at, you're like, oh, my God, Profession X is totally cooked.
B
So terrifying. Example was like in the health stuff, if it gets something wrong and it sends you down a rabbit hole of like, you have, you're not going to know. You're not going to know. And so that is, I think the thing that I can't do is I can't rely on it. And so what I've realized though is, is if you change your framing from like, this is an agent that's going to go do this stuff to this is something that's going to take away the repetitive work I can do, then it becomes really, really interesting. If every day you want it to go check the snow report in three places and send you a message. Every day you want it to go look if this thing's in stock, or every five minutes, go look if this thing you want to buy is in stock. Like those kinds of things that are really redundant, repetitive, great. For the more complicated stuff that matters, like, there is still a component. Earlier I talked about this health thing. It's like you've got data, you've got the knowledge That I like. You've got my questions and then you've got a, like, person that you can trust to help you. Until the models are actually at a place where they can make those decisions. It's like, I still have a doctor. What I want to do is combine all of that and then take it to someone who can sanity check it. And the models are getting better at sanity checking, but you can't just rely. Like, you need to ask them. How many times have you said, are you sure that's right? And got. No, no, no, Totally. I'm so sorry. It was totally wrong. Like, that happens all the time. Time. And so I find that it's better at just repeating.
C
And this is why. Just the theoretical versus the practical. Right. In practice, you still need your doctor.
B
Yes. And I think that doctor will hopefully not need to do as much.
C
That's right.
B
I was thinking about this. When our daughter got sick, we went to the doctor and the only thing we needed to be in the doctor's office for was just for someone to put a stethoscope on and listen to her lungs. And so I said, I Google. I was like, there's gotta be electric stethoscope because we have an otoscope which is like, goes in your ear, in your nose or something. Like, we use the ear one, we take a video, we send it to a doctor online so that we don't have to find out if she has an ear infection. I've now done this enough that I'm like, I now know what an ear infection looks like. You know, like, I don't even need to send it in.
C
Not terrifying at all.
B
But I was like, there has to be like a remote stethoscope. So like a lot of those things. Great. And then on one hand, looking at the full context of a person is something that like a doctor can't even do. Right. Like, we talked about how many things you can keep in memory. Like, Dr. Can't keep your genome and all this stuff in memory. But a doctor also can do a lot that an AI can't and can sanity check it with lived experience. Because they've got that, what, you know, 2.5 petabytes of knowledge in their head about how medicine works. And until we have a. What would it be? Whatever 12 million times a million context window is, we're talking trillions of token context window. Like, it's just not going to have the ability to process on lived experience. Yeah.
C
Or a tiny context window with much better tools use.
B
Yes. One thing that I think is also worth talking about briefly is like the cost of this stuff. So right now there's basically the free version, the 20 version, the a hundred dollar version, and the 200 version. And then there's the pay for the tokens version. And almost every consumer that I know is using one of these monthly plans and that might be the free one. They might just go to Gemini, ask questions, and that's fine. I think it seems like we're at a point where these companies are starting to realize that they can't just lose as much money as they are. And so for people that aren't aware, like if you fully max out your 200amonth plan, my best guess is that you're using roughly 10 times as much usage as you're paying for. And I know this for a couple reasons. When I was doing this Codex thing, I finally got it working and I was like, oh, this is awesome. But I didn't realize that when it connects to the remote machine, it uses whatever login is there there, which didn't exist. So it used an API key. And for like 30 minutes, I was not using my 20amonth chatgpt plan. I was using the API and I used like a hundred dollars in 20 minutes. And I was like, wait a second, I've done tasks like this. It wouldn't have even used my $20 and instead I spent a hundred dollars.
C
There's definitely subsidization that's happening. There just is, you know, I mean, there's going to be a lot with token economics. It'll be really interesting. So one, as you're pointing out, I think some of the big labs are subsidizing. Many of the big labs are subsidizing coding tokens in particular. Then I think the second thing that's going to be interesting is just like, what is the value of an incremental token? So a token that you use to click a button on a webpage, probably not that high value. The token that you use to unlock some tax insight that saved you $100,000 hugely valuable. And yet both of those tokens cost you the same amount. And then there's also, by the way, other things coming like local inference. So what happens? Maybe we're all going to have a big GPU rack in our closet and there'll be a subset of queries that get routed there and it costs us the same as electricity. So there's a lot that you know. Well, let's see, let's see.
B
I, I only say this because I have two Neighbors one side of me and across the street and we're all like nerding out about this stuff and we're like, should we just buy like two massive studio Macs and then we'll have like our neighborhood local computer and we can actually like hardwire to it. Yeah. And so we ran the numbers and like the break even is like years away. Unless you are constantly using these things, it's years away away. And with the subsidies, it's also years away. If the subsidies go away, it becomes interesting. And I think we have to realize something that I think is really funny. So right now on most weeks I never hit the max on my $200 a month plan. And so every question I'm asking, I'm asking the most expensive best models. So right now it's like I'm asking Opus 47, I'm asking GPT 55. And you know, as much as I've argued that this is not humans, if we equate that back, it's like if I needed someone to wash my car, I'm not going to call my neurosurgeon to come in and wash my car. But effectively, like I really struggle with like what task do I want to use that? Like, I don't want to use the dumb model to like do anything yet. But we're going to have to.
C
It's a funny human quirk. So I use a bunch of the browser plugins. So for example, when I'm using the Cloud one one, I never want to use Haiku. So I'm like, oh God, that's the dumb model. You know, Even though haiku is far more sophisticated than 37 from a year ago, which was the most cutting edge expensive model.
B
That's so cool.
C
But something about using the third best model just like irks me. I'm like, I'm not a third best model guy. I'm a first best model guy. You know. Agreed. That needs to get reconciled.
B
But when doing what I'm doing goes from costing 200amonth to $2,000 a month.
C
Yes.
B
I've got to make trade offs. And I think as someone who's thought about this a lot, the question I have not found a good answer to.
C
Yes.
B
Is what is a good practice for model routing? And I think someone's gonna have to solve this. It's gonna be like when I ask a question, it's like we're gonna infer what level of effort you want to take.
C
Well, cursor and others try to do this already. Like I think there are many that are thinking about this routing problem. I also think that.
B
But it's not a consumer problem yet, because all the consumers are on some subscription. Yes, they're getting subsidized and they don't really care until they start using it like an enterprise.
C
So the only slight pushback I'll give you is that I think today we think of these things within our existing technology budgets. Just like I think many enterprises are like, okay, the token budget goes in it. But the truth is that these tokens are going to start to eat into our spend in every other part of our life. It's going to eat into your healthcare budget budget, into your parenting budget, into your finance budget. So I do think the ceiling on how many sort of dollars of tokens will be consumed by consumers is going to go up a lot. But also on a per token basis, consumers are going to need to get more efficient at some point.
B
It's crazy because I talk to my parents regularly about random things. And my mom, when it comes to how much money she's willing to spend on, I don't know, a meal or a flight or a hotel, it's a lot. And then I'm like, you should try this app. It's like, $3. No. Why would I pay $3 for an app? That's crazy.
C
That's right.
B
And so I think we struggle as consumers. Businesses are really good at this. Consumers are really bad at paying for time. It's like, oh, I want to hire a lawyer, I want to hire a doctor. I have to pay 300 to ask a question. And that's gonna get harder with these tools because you're like, well, I get, like, a pretty good answer. Or do I want to? Like, the delta between the answer you get and the cost is, is a lot of. I think we're gonna have to realize that as human tasks get replaced by tools, we should pay for them. And I had this moment the other day where I was like, gosh, 200amonth. So I ran out of tokens for the week. Not even the week. I ran out of tokens for the week. And so I upgraded my chat GPT from the 20 to the 200, as you should, so I could keep going. And I was like, this is crazy. I'm spending 400amonth. And why am I spending 400amonth? I am massively trying to automate all of these things this human did that cost thousands of dollars a month. And so I had this moment, and it was easier because it was more of a business thing. Right. Like, I was replacing a cost with something versus buying time. And I think buying time is harder for us.
C
And it's not just buying time. Kevin and I were talking about this, like, I think entertainment budgets, you know, so how many of the things are you building because they're fun to work on versus they're important to work on. And sometimes it's hard to tell, but I've definitely built a bunch of things that are just fun to work on. And if those things had to eat into my bottle of wine budget or my movie night budget, that would be fine because it's entertainment. So I just think that every dollar of sort of disposable income on the consumer in the consumer's budget is sort of up for grabs, at least on the digital side. Models might even eat into your restaurant budget at some point. We don't have a good mental model for how much value these things are going to create and how much we're going to pay for them in the future.
B
Yeah, I think that is something we're going to have to figure out. Until then, I'm glad that they're being subsidized as consumers.
C
Yes.
B
Amex and ChatGPT, they've got this like 300 a year credit. Really? So if you have a business Platinum or a business Gold, they have announced it. Part of the reason I hadn't upgraded my stupid Chat GPT is because I was waiting to use it because it was like, it's been announced, they said it's coming, it hasn't come out. So you get 300 bucks a year for every Amex Platinum, Business Platinum or business Gold, it's a business thing. So I don't have to hit on this in another episode really quick. Unfortunately, unfortunately, it's only for ChatGPT business, which only is effectively the 20 month plan. So, like beyond the 20amonth plan, I've got to pay. So for anyone out there who's like, I'd love the 20, 25amonth chat GPT plan, you've got 300 bucks a month. You do need to pay for two accounts. So, like, find a buddy. You guys sign up for a business, you set up two accounts. One of you pays for the first 300, next person pays for the next 300, and you go back and forth paying monthly. And there's a way to get value out of that. Or maybe you have two.
C
Maybe my open spot can do that for me. God.
B
But unfortunately there is no. And I get why there is no like 200 subsidized plan for businesses. So the Amex credit sounded really cool when I was thinking, like, gosh, I've got five business platinums and business golds. That's $1,500 of open AI credit. It is $1,500 of open AI Credit that can only be used on a $25 a month plan.
C
Just need 25 buddies.
B
I need 25 buddies. Let's make it happen.
C
Awesome.
B
It's gonna be fun. I'm having a lot of fun, dude.
C
I mean, the best ever. People are talking about, oh, token maxing. They're not sleeping, they're working harder than ever. Like it's a bad thing. But everybody I know who's deep in it, they're giddy, they're having more fun than they've ever had. And their digital lives, their professional lives, their personal lives, it's just a lot of fun.
B
I would say my exercise has taken a dip, but I feel like a kid when, like the new Zelda or Mario game came out. The only thing I hate is that it's not as social. But, you know, you and I, last night we'd hopped on a little call for a little bit just to share what we're working on. I think like, the more we can make it a little bit more social, but like, I don't know, it's just so much fun. I feel like a kid again, just getting to build fun stuff and enjoy. And I'm watching in some of the groups I'm in with other people who aren't deep in it with tech. Like, they try one thing and it's like, oh my gosh, I can't believe I could have done this thing. It's just simple tasks, hard tasks, tackle this, do that. It's never been easier to automate things. And the thing I'll close on is if you heard me talk about open clothes a couple months ago, you can do pretty much everything I described then in terms of function, maybe not in terms of flair, but in terms of function with first party tools without any getting, like building custom anything. And that is really cool. And I haven't played with like Perplexity Computer, which is supposed to be similar, but I don't think it works with any of the coding plans. So I'm like, you know, like it's. It's crazy that if you want to build tools that use AI models, if they're not first party tools, the tokens cost 10 times as much.
C
Yes.
B
And once that shifts, it's great for consumers because we'll get all this choice of tools.
C
Yes.
B
But right now you'd be crazy to go use non Claude OpenAI Tools or tools that integrate with them because the tokens just cost ten times as much.
C
Yeah.
B
So this has been fun.
C
What a blast. Chris. Well, thanks for having me. Let's do it again soon and yeah, and let's like check in in three months and see where we are on our journeys.
B
That sounds good. Thanks, Chris. Cool.
C
Cheers.
B
Some Follow the noise Bloomberg Follows the money Whether it's the funds fueling AI or crypto's trillion dollar swings, there's a money side to every story.
A
Get the money side of the story. Subscribe now@bloomberg.com
B
security program on spreadsheets New regulations piling up, an audit dread. It's time for Vanta. Vanta automates security and compliance, brings evidence into one place and cuts audit prep by 82%.
A
Less manual work, clearer visibility, faster deals, zero chaos.
B
Call it compliance. Or call it compliance. Get it join the 15,000 companies using
A
Vanta to prove trust.
B
Go to vanta.com calm.
Host: Chris Hutchins | Guest: Anish Acharya (Andreessen Horowitz)
Date: May 13, 2026
In this episode, Chris Hutchins and longtime friend Anish Acharya dive deep into the evolution of personal productivity and life “hacks” through custom AI agents and tools that are rapidly replacing the high-maintenance spreadsheets they both once relied on. With their characteristic blend of technical curiosity and pragmatic maximization, they share how these AI workflows are transforming how they manage money, travel, health, and everyday efficiency, and debate the broader implications for society as these tools become more accessible.
Chris has moved beyond “openclaw” (his previous favorite open-source agent framework) to using Claude Code and OpenAI’s Codex, optimizing for platform agnosticism and flexibility. He warns about the risks of single-platform lock-in given how fast the AI ecosystem changes and subsidies disappear.
“I want a platform that allows me to not be tied into one thing... People are pulling the plug. People are going to raise their prices, they’re going to stop subsidizing and I want my skills to work wherever they are.” (04:41, Chris)
Discussion of agent design: Chris started with “subagent per topic” (finance agent, travel agent, family agent) but now realizes with LLMs, it’s less about ‘personified’ agents and more about dynamically feeding context and instructions, since the agents themselves are contextually identical.
Chris details his “agentic” OS:
Modern first-party tools make agentic workflows extremely accessible, removing much of the developer pain once required.
“You don’t need to code anything... most people can have a really sophisticated and simple version to actually add value to their lives.” (16:14, Chris)
Anish contrasts his own less-productivity-obsessed approach, emphasizing AI coding agents for creation. He observes two “breakthroughs”:
a. Coding agents (2025),
b. Personal agents (‘openclaw,’ 2026).
They agree that personal AI ushers in a future where every consumer can live an “optimized, fully hacked life,” closing the gap between life-hackers and the average person.
Chris pushes back, noting that different depths of knowledge and access (Discords, Telegrams, podcasts) still create “secrets” not easily accessible even to AI agents, and wonders if total democratization is possible.
The conversation is energetic, nerdy, transparent, and peppered with friendly jousting. Chris and Anish alternate technical analysis with practical advice, often acknowledging the limitations and deeper social/philosophical questions while maintaining infectious enthusiasm for how automated, agentic workflows can empower anyone (not just die-hard spreadsheet nerds). The tone is humorous and self-effacing; both hosts admit to “token-maxing” and “productivity junkie” tendencies, but also focus on the broader, democratizing implications of what’s coming next.
Summary Usage:
This episode is a treasure trove for anyone interested in practical AI, personal optimization, or the evolving intersection of technology and everyday life. It offers tactical insights, architecture breakdowns, societal reflections, and a glimpse of where the “AI life OS” goes next.