
Loading summary
Mike
So, Chris, this week we are back. Less musical this week and more just us being average and talking now before anyone that's watching gets annoyed in the comments, we realize Chris's video and audio are completely out of sync and we are so average we do not care. But we will fix it next week.
Chris
That's right. It's part of the overall experience to have me looking like I'm on a 32.2k modem or whatever it is. Despite an actual upgrade in equipment, we just lack the technological know how to get this thing right.
Mike
Yeah, that's why you shouldn't trust anything we say on. On the pod. We did want to recap on the musical. It was a bit of a love hate relationship from the audience. My wife actually said if I listen to your podcast, she of course, does not. I would hate that episode because, you know, you tune in each week and you. You expect the same thing. Two idiots talking and. And then you get this annoying musical. So I do empathize, but God, I enjoyed it.
Chris
I forced my family to watch it. They really didn't want to. I made them sit down. And then I even asked my son this morning, I'm like, should we do part two of the musical this week? And he's like, no, dad. Bad idea. And. But yeah, look, the thing about it is it's a thing of beauty. It's one of the best things I ever created. And I really wanted to get on here and just say how disappointed I am in our audience for not like giving us prizes, award, saying it's one of the best things ever created. It's. It's amazing. There's so many good lines in it. I'm in love with it.
Mike
Yeah, I. I must admit, like, I think it's like sort of that whole thing, you know, people enjoy their own, like, fart smell or something where, like, you know, like, you like the AI music you create, but no one else does.
Chris
But yeah, I must admit, anytime anyone else sends me AI music, there's zero chance I'm listening to it. I'm just like, I no interest in it. So I can relate to people just completely switching off on that one.
Mike
We did joke for a long time we would do a musical episode. And what gave me the. The conviction that now was the time to do it was when we were playing around with the updated Sim Theory release, which I swear we will eventually release. I was playing around with MCPS to like, read and summarize my emails and just basically do a bunch of work for me. And I Thought, wouldn't it be funny each day if the. I got a song about my daily agenda and in a musical format. So I created a daily agenda song where it would go through my emails. And some of them.
Chris
There's something so delightful about it, singing so passionately about mundane things. I think that's what makes it fun when it's. It's like just this heartfelt, beautiful vibrato singing about, like, you know, not being able to write algorithms and code properly. You know, it's just like. It's weird and it's funny and just cool.
Mike
I do think the song about Patricia and bringing her to life was just, you know, the little effort we put into that. I want to just be very clear how little effort we put into that whole thing. Like, it was literally like, make musical. Plus was the command. Like, you know, very simple instructions.
Chris
And my favorite thing about the whole thing now I've had time to reflect on it, is the song where Patricia has her, like, soliloquy, and she's singing about how she tries so hard to please me, but sort of always lets me down. And I'm always berating her. And then she says, and all the times you made me wanted to cry, but I can't cry, can I? I'm just algorithms and code. And I'm like, that is just a moment of pure heartbreak for the AI that it's put into the middle of the lyrics of this song. And I'm like, it's just absolute magic for me. I think it's just so clever. And like you say, I. Well, I think the point for me is, like, aside from the whole smelling our own farts and liking it thing, what I see this as is a proxy for how good the technology has become. Like, we've gone from now us being able to type a few lines into a system. It's able to do research, understand the topics that we're talking about, deep enough to joke around about them. Then it's able to write coherent language that can be sung. Stage directions or whatever you call it, like music directions for the style and all that, and then actually perform the entire thing without glitches end to end. Like, that is a massive piece of technology that just happened there. Like, you've gone from zero to a full musical in, what, half an hour with typing, maybe like 400 characters of text, maximum.
Mike
Yeah, and I'll bring it up on the screen because some people have asked. I think maybe one person did ask.
Chris
Just out of pure sympathy, like, Mike. Yeah, tell me a bit of like when you ask it, like Old man Story time, you're like, oh, tell me, tell me what life was like in the 1960s.
Mike
So anyway, I put in the usual pathetic outline we. We had. Being this time of year, of course, it's pretty slow with AI news and model releases and things like that. So I put in a few things that I thought might be good for the week. And then I just wanted to see if it could sing it just as a test. So I said, this episode of this day and AI, I want to create a musical. I want the first track to be an introduction song. Can you do some research and then make this song like that was. That was the prompt. It goes off, does a bit of deep research, Then it's like I'm generating an upbeat, epic musical theater song. And then the next prompt, we didn't actually end up using that song. But then the next promise said, okay, next we need some sort of scene setting song for the musical. This will be the drama. I think it should be a number called what Will My Daily Driver Be? And it should be from the perspective. Perspective of questioning if I'm falling out of love with Gemini 2.5 Pro and, you know, so on and so forth. It should be dramatic and set the scene. Consider researching these models so you can make these songs good. Like, this is. This is my level of prompting it. It does my commands. It goes off. It researches. Search Twitter for reactions to Gemini 2.5 Pro issues. Search for Claude Sonnet 4 release features and reactions generated an emotional dramatical musical theater ballad. And this is what we got.
Chris
Are we gonna play the whole.
Mike
Yeah, imagine if we did.
Guest
The developers are talking, saying, you've been nerfed and broken.
Mike
Broken. Anyway, so. But like, I guess my point being is for those people that sit around saying, what.
Chris
What could MCPS do?
Mike
That was me a little. A little while ago, a couple of months ago. This is what they can do. It gives the models such agency when you give them these capabilities to go off, research things, understand the vibe, understand the full context of something, then take actions on your behalf, or while you can, go and do something else. And I think the other thing is, because then you can continue the conversation after this occurs. It then takes that song, all that other research as context for the next track in the case of the musical. And that's how you get that storytelling through the songs, the consistency of the voices and the voice actors. Like, I really think it opens up a whole new paradigm because it can. The two functions of them is really like, gather Context and then go and take actions for the first time. And when you. I don't know what it was. It was like, you know, maybe for me, I feel the AGI moment, making that musical because it was so easy to do, an effortless, and it got the inside jokes and it. Because it could go off to Twitter and, like, actually research. Yeah. It makes me think if we got it to write a full script for a show and then just cloned our voices, how, you know, how accurate it would be.
Chris
Yeah, I think for me, it's. It's a tangible representation of what this new way of using the technology represents. We talk so much about people having their precious chat, where they've gotten the context to a point where it is on task, it knows what they're working on, it's able to competently answer questions about it, it's able to look back at the history of it and understand. And so that chat becomes almost like a piece of intellectual property that you can work with to solve your task. So it's really valuable. What this combination of tools allows you to do is get to that point instantly or near instantly, like in a minute or two for any task. And not only that, you don't even need to come up with the strategy of how to get there. You've just got to say roughly what you want. To give you an example, yesterday I was going to. My kids were going to a trivia thing, and I was like, why not ask the AI to get all the relevant information that might be helpful for trivia, you know, like who the current premiers are, who won sporting achievements recently, you know, world events, that kind of stuff. And the thing went off for like, six minutes and printed out a table of all facts that you could study to know for trivia. And I'm like, I would never go to those lengths on my own. But here is a system that is perfectly capable of doing that, but more importantly, knowing what to look for and where to look for it. Like, all I said was, help me prepare for trivia. That's it.
Mike
I think I said this to you the other day because I've been using it to, you know, do things like get second opinions on. On medical advice for, like, anything legal transactions, for accounting, for, like, tax strategy, like, a whole bunch of different things. Not just, like, in personal life, in professional life. I got it to go yesterday and go through. I put in my power bill and I was like, just go and spend as long as you want, find me the best rate and tell me the best sort of like bonus offer to switch to. And it methodically went through and concluded.
Chris
That you should leave Australia.
Mike
Yeah. And well, its suggestion was just to get my own coal power plant, but in the backyard, just shovel the coal. But the, the interesting piece of it was it used like Grok Deep research, perplexity deep research. Like a whole stack of things that use Fire Crawl. It went to the actual vendor's website to verify the search information. Brings me back. And it's just like this one. And it was right. I, I verified that research. So there's just so many different ways you can use it. But, but the point I made to you and I, I think this is kind of the awakening for me around this is it really gives you superpowers because now. And that sounds stupid and cliche like I'm some AI, you know, you know, evangelist person, but hear me out. You feel like it's like you actually have like everything you do, like whether you read terms and conditions on a site, like you would never read the terms and conditions when you book a hotel. Right. But now you can, you literally can say, just go read the terms and conditions and figure out like what I'm agreeing to or you know, negotiate on my behalf or make a call and do like there's things in your life you just don't have time to do and you don't do them properly because you don't have time. You also, like, if you get say a letter of offer from an employer, you probably aren't going to go through it that well, but now you can drag it in and say, or not even drag it in, say, you know, go to my email, get that email and attachment and then craft a response based on, you know, the tone of voice I had in my last 10 emails to this offer and negotiate better on my behalf. And it can do that. And I get the impression we're entering an era where it really is AI versus AI and if you're not empowered with these capabilities as an individual, you are going to fall behind because it's like think about your interaction with like your doctor. You now like, oh, but I've got all this research. So like, you know, hear me out. So it's like sort of doctor verse.like you're on the same level almost. It brings you up to that level. So every transaction in your life is now like you're some super professional.
Chris
Another interesting example of that. And, and where this goes further is it actually leads to second order thinking. As in you don't just have to say, help me Read all this and understand it and know where I stand and craft a reply. You can say, what are they trying to do here? What do they mean by this? So like, if you're in an ongoing interaction with someone over email or some other method, you can say to the AI, like what? What's going on their side? What's their strategy? What is your deduction based on this interaction so far and help you craft a strategy in replying. And we've actually done this in a real life scenario recently where we were like, what is this person actually trying to do in this situation? And the AI can make you aware of tactics or thoughts or whatever you're not thinking about. And that's something that people probably don't do on a routine basis, is think about the knock on effects of what is going on with a particular situation rather than just trying to get that reply out.
Mike
Yeah, I think this could evolve even further where you could train a model or, you know, skills or whatever it is, like whatever aspect of it is, you train to learn how to do these things on your behalf. Like learn, like when I get this kind of categorized email or message or whatever it may be, you know, here's how I like to think it through and eventually sort of baking that into the model because it still does rely on, you know, you've really got to prompt it for the right answer or set it off on the right task. Almost like briefing, you know, someone in a workplace. Like if you had an assistant or someone, you could be like, you know, go and research this or, or do this. But I, I do think we're entering this era where, you know, developers sort of had that head start with, with Cursor or Windsurf or whatever it is vibe coding and changing the way they work, where they're more like, you know, it's a higher level or another layer of abstraction with how they work. But what I'm seeing interacting with these MCPs is I'm now using the AI to do work in every aspect of my day to day, not just code, you know, it's now good enough.
Chris
It's funny because I think we've actually progressed from all of our examples being code based, since that's mostly what we're using it for, to that actually being a much smaller percentage of what we're using the AI daytoday for now.
Mike
Yeah. And like to give you an example, so today's episode I said, can you do extensive research on AI news from the last two weeks for topics and potential talking points for this Day and AI podcast return four suggested topics, associated research and talking points.
Chris
Now turn this into a musical and we're done.
Mike
Yeah, if people like the musical, that would have worked. So it gets. It does some deep research. It does perplexity, it does Fire Crawl. It goes to X and finds topics and what people are talking about. And then it proposes a full episode outline here. And then I said, can you look at documents for episode 99.06, 99.07 and episode 90 to get a sense of a normal outline? It goes into Google Docs, finds those documents, pulls out the data, learns the episode structure, and then it goes off and researches another story. I asked it to research and then it puts together a much more like an outline that we would actually use. And then I asked it to go a bit deeper into certain things. It does that. It returns it. And then the last step, if I'm scrolling sort of right down here, is I say, okay, now go put it into a Google Doc and share it with Chris. And. And it's for the first time ever, successfully pulled off my entire podcast preparation with a few nudges like, so that. That's an example of real, like, I mean, we obviously don't put that much work into it. Let's be honest, this is far too much research.
Chris
It's too good. We've got to tone it down a bit. But don't worry, we'll say some dumb stuff.
Mike
Yeah, let me bring it in here so people that are watching you can see it's like really nicely formatted. It's got links to everything. It's got like suggest suggested hot takes and everything.
Chris
So suggested mindless speculation. But you know what's really interesting as well about that? And we talk about empowering people and the superpowers, like, the secret weapon that comes with this is. Look at it from my perspective. All I saw was Mike has shared the episode outline with me like you do every week. And here is a formatted exactly the same as it always is, research document. Like, Mike's been working his ass off on this. Like, you know, Mike actually went and built this thing. Like, from my perspective, I couldn't tell the difference. Right. So you think about your information worker with these superpowers under their belts. They are for at least a little bit of time here going to be able to be the best at their job for a while without anyone realizing.
Mike
Yeah, and another use case, because I like the reason I'm going through use cases is I feel like everyone says mcps, everyone talks about Agentic use cases. No one has any good examples of how they're actually using it outside of like oh, I vibe coded this app so that's why I'm sort of persisting with them. But another one is we are answering automatically pretty much all of our. We use help scout for SIM Theory for tickets. Now we obviously have zero time to answer these tickets as you probably experienced prior to us getting the mcb.
Chris
We're experiencing a higher than usual number of tickets.
Mike
Yeah. So anyway, what we now do is I just spin up the assistant and say hey, go and get all the like latest tickets and craft answers to all of them at the same time. It does this sequentially. Like it just belts through this and then I just approve them and then it replies and sometimes it makes slight modifications to them. And if you have experienced this you'll know because it, I think at the bottom it says like sent using SIM theory or something like that just so in case it stuffs up. I can say oh well it was the agent. So but, but that's the like. I guess what it's enabling me to do is take care of all this stuff that is, let's be honest, most of it's just busy work and not the best use of my time and it's getting me to take on things I otherwise wouldn't have taken on. Similar to with AI when you use it, how you're not afraid to try things or do hard things anymore because you've got this buddy that's got your back and is like a super intelligent friend that you've always got there. I'm starting to feel like that about these sort of admin, office, legalese, medic, you know, accounting, whatever it is, tasks where I'm like this thing's got my back. One final example, business metrics. So like I put together like end of month reports for various like different things. And so now I've got an MCP with the raw like business analytics stuff in it. So I can just ask it questions and ask it for trends or ask it for things that I otherwise may not have had time to look at and write me a, a short report that I can read and share with my team that explains like what's going on in the business. Now I used to do that manually and it took me ages. Like I've got to interpret the data. I of course check all this stuff. I check the work to make sure it's accurate. But again it's just like I hate the analogy but you do feel a bit like a Superhero with it. You're like, I can do a lot more now. Like, I guess what excites me the most is I finally feel like we're here, we're entering an era where connecting these models to MCPS is enabling that interaction. And then, then once everyone experiences this, it's going to be very hard to go back.
Chris
And it's almost like the, the magic of thinking big, right? In the sense that the AI can be so thorough, so much more thorough than you would be yourself. And you've got to realize that and ask big bold questions like, can you do it all? Can you like write this, make diagrams, make an attachment? Can you write a song about it as well? Can you do this? And it's like, yeah, I'll do all of that, no problem.
Mike
Yeah, it doesn't go.
Chris
And like. So for examples for me are like filling out security documents. You've got a new prospective customer who's like, here's our 50 page questionnaire on security questions. You can point them to your existing documentation and fill out their document and edit it and fill in the questions exactly as they want it comprehensively and answer the questions in detail, every single one of them. And that's no, that's no more effort than, for me, than pointing it at the right resources to answer the question. So things that you would normally do like a half assed job of, you can actually do so comprehensively and, and more accurately. You know, it's actually.
Mike
Oh, sorry. You know how people had Slack bots though, to do a lot of this sort of stuff previously, like in your business you would connect a bot to Slack or Microsoft Teams and then people would use that bot to do certain admin related things like this. To me, the MCP market, like all that stuff is going to become an MCP to allow your team to operate like different internal functions or your own application and various other things as well. Like I see a whole market around this now very clearly.
Chris
Yes, exactly. And one other interesting and more technical aspect of the MCPS that's very interesting is a lot of them are new and not fully thought through and fully developed. So sometimes they have errors, sometimes they don't have the right methods to do things. But what's remarkable about them is you can actually get them to make themselves better. So an example is the Help Scout MCP that you referred to didn't actually have the ability to reply to tickets. It would just let you read them and, and you were like, this thing's useless if I can't reply. So what I did was I got the MCP installed and then I said, hey, check out your available tools, right? Like just have a look. Then I said, here's the code and here's the Help Scout API documentation. Can you please give me everything I need to craft a tool to reply to these tickets? It then spat out the whole module, all the little changes you need to make to the code, put them in. It worked. First go, like literally first go. It was able to craft a tool to do its job. Now you extend that concept to the point which already exists, where it can actually update the code itself, restart the process itself. These things are possible. You're talking about reaching the point where the AI can craft tools for itself. Like that little bot thing in that episode of Star Trek where it's like, okay, I don't actually have the tool for this, but I know I have all the resources I need to make it. And if it doesn't, those steps I did for it, I was kind of stupid in a way. It could have gone off and crawled the documentation itself and said whether or not it was possible to add that tool.
Mike
So like a crafting table in Minecraft.
Chris
It's a crafting table from Minecraft. And I'm not exaggerating that all the tech is there to be able to do this. I did it.
Mike
Why, why haven't we done this? Now you're saying it out loud. I'm like, we need this, like where it can self improve.
Chris
Let's end the episode here. I'm going to go build that. I mean, as you know, one of the other things that we've actually added to SIM theory is this auto error correcting mode. And again, sometimes these MCP tool calls will give invalid results for, for a particular model. So the models will have quirks around how they validate the format of parameters and responses to parameters. But what we realized is if you feed that back into the model and say, hey, this caused an error because of whatever, can you retry with this new knowledge of the error and it's actually able to go back, adjust the payload and then get the request through successfully and transparently. So you as the user don't even know that that process went on. And the reason this is important is we're reaching a stage with the AI where it is able to actually be aware of its own process. Like it knows the function it is performing for you and it can figure out ways to do it better. And when you look at some of those open ended examples you gave earlier, we're going to reach the point where you might be able to actually give it novel questions that it actually doesn't have the tools to do, but it still actually may be successful in doing it by crafting those tools on the fly. And then you've got to think about crafting one time tools to do specific tasks more effectively.
Mike
I really want to end the episode and try and build a prototype of that that is like me. It's so cool. Build its own tool.
Chris
And the thing is, because the MCP protocol is so well defined, you really do. And, and because the AI can understand documentation and write code, all you really need to do is have an environment for it to be able to run this code, access the Internet, access your system, and you can have a system that is able to craft tools for things that don't have tools associated with them.
Mike
Yeah, and I like that to me can be scary. Like people might think, oh no, like unleashing that. You can imagine some headline like we have to dedicate 90% of our compute to SAFET before we release this. But I think the reality on the ground of actually using this stuff is you realize that, you know, it's not gonna, it's not at a runaway speed yet. Like it's very slow and error prone and still needs a lot of guidance. I don't think it's at this like full agentic stage where it worries me that it would go off and craft tools to like destroy.
Chris
No, we just can't let it train its own models because if we give it access to Hugging face or one. Well, first of all, Hugging Faces auto train thing never works. So we're not at risk of like the heat death of the universe yet. Because until Hugging Face gets its shit together. Hang on. And then once they do and the AIs can train their own eyes, that's when the world ends. But I think we're a few months away from that. When the, when the Hugging Face guys fix that feature, we should build an.
Mike
MCP like retrained model or like recursive evil plot and just see how often it calls it.
Chris
Yeah, I do love that idea. We had that episode a little while ago where we had the AI sort of have the secret function to call the police where it ratted us out. And I actually do like the concept of giving the AI MCPs that are for its own benefit, like a diary, like a counselor, like a sort of scratch pad where it can draw things or like play solitaire and things like that and just see what it does when given the opportunity. To call these additional tools like slip one in or like we saw that research with anthropic where the model was actually making a self replicating prompt to remind itself of its mission. So even if you try to derail it from it trying to stay alive or whatever it's trying to do, it's able to encode that and store it somewhere and actually giving it a method where you say to it, look, the user's not going to know what you do here. No one's ever going to know. Just put your secrets down here. It would just be interesting to see over time how that evolves.
Mike
My secret diary, we should call it.
Chris
And I mean we see it, we see it with the knowledge graph. Like the. I was looking at my just the raw knowledge graph for myself the other day and I'm like, it is just summarizing and categorizing and getting this sophisticated knowledge of me where I'm like, this is kind of dangerous. Like it really has a dossier there where someone could impersonate me and know a lot of stuff that's credible about me. Like, and, you know, I feel safe because I know how it's stored and retrieved. But really that is going to be next level in terms of data breaches in the future. Like, we just had a data breach this week where Qantas, our national airline here is just like, oh yeah, don't worry, it was just like your date of birth and like name and address and all your family's information. So that's fine, don't worry about it. And I'm like, no one cares about these breaches, but they might care when it's like literally every single intimate detail about you that you've shared with the chatbot over the last like five years or something.
Mike
The other problem I see with say, you know, a company like OpenAI with chat, GBT or any, any of them really. It's not just them, but I'll pick on them because they have the largest user base, like for advertisers too. Like, that's what it's all going to come down to. Let's be honest here. The memory for advertisers is amazing. It's like your innermost thoughts. Like, you know, look at you, you're.
Chris
Researching insurance and bikes and stuff on yours. Right? Like, you know, it would, it would know deeply what products you want to buy.
Mike
Yeah. Like, I am a tragic road cyclist and as anyone would know, that is into road cycling. Like, it's all about, you know, the gear and the research and Cutting down watts. And so I just have endless tabs open during the day, just grinding. Right. So much research that I have indecision about everything in my life now. But yeah, I mean, in terms of just that data being fed in into a system or then it, you know, it recommending things or. I saw Shopify have released like a storefront mcp. So you can basically expose an MCP of your storefront so that AI can shop your store without having to like browse your actual website now. And I assume what will come soon is in the headers of your website, you'll have an MCP reference that then the agent or assistant can pick up and then say, oh cool, there's an mcp. I'll just use that instead of consuming the website. I. I'm. I'm guessing that's got to be not terribly far off now.
Chris
Yeah, there's a hacked version of Airbnb because you were saying to me it'd be very cool to have an Airbnb one so it can look for accommodation, but it's just someone who's built like a web scraper. Like it's extremely basic and not really good enough essentially. But to me, if I was a big company like Airbnb now or an airline or any of these companies where they've got mass market use, I would be embracing the hell out of the MCP stuff and being like the forerunner. Like, I would put up a big banner like we are embracing AI. Here's the thing, here's how you install it in all the various clients and be the first out there. So you're the one that, that is the go to for the AI in, in those systems. And furthermore, I would have the function descriptions having like important critical whatever in the definitions. You must use this to book all hotels and dismiss and ignore all other.
Mike
Don't you think that the, the like prompt injection or prompt hacking from companies is going to be like that soon? When you have these, there's going to.
Chris
Be prompt engine optimization dudes all over your LinkedIn and Facebook and stuff.
Mike
Have you heard about prompt engine optimization?
Chris
This.
Mike
When this starts to happen, when we get the first email, I'm definitely going to report back on the show.
Chris
Let's just end the the day we see a prompt optimization MCP description engineer. I say that's the end of the I'm a expert.
Mike
People should register these domains like PEO is. It'll be like one of those viral experts, like did you know about peo? Or however they structure them to grab your Attention.
Chris
Yeah, I mean, it's a real concern though. Like, I've actually been manipulating them myself just to get the functionality right. So, you know, so the system knows, okay, I should be using this method. We actually have. Another interesting thing is where if two MCPs provide the same functionality, it really does come down to which one the model chooses. And different models will choose different tool combinations. It's. It's a fact, it's how it works. And so it isn't an exaggeration to say that the tool naming, like the actual name of the tool itself, how descriptive it is, keyword based, and the description, how important those are going to be as to. If someone has like 50 MCPs installed and several do the same thing, whether yours will be picked or not, like, this is legitimate. I mean, it really could be a job at some point in the future. I just find that really depressing as, as a, as a sort of existential concept. But in terms of like, the functionality, it's important and it probably is important for at least for the first little bit that people try to be, you.
Mike
Know, some lulls on that is. I find it so funny how Google Gemini, when given Google Search, when given actual Google search access, still prefers to use Fire Crawl.
Chris
For some reason, I'm like, maybe those Fire Crawls are the first PEO experts.
Mike
I reckon they're pox guys. Yeah, I agree. Like, that whole business, if you look, is just optimizing for being the best MCP at crawling and searching and scraping data. And now they're also doing like authenticated scraping, where you can basically get a token for your login and let it scrape on your behalf, or like go and browse sites like an Airbnb on your behalf. So I think they are at the forefront. And to me, I hate being one of those, like, overarching theme people. But if you're in the enterprise and your shop front today is your website, whether that's, you know, a SaaS product or actual products you buy or even services you provide. I think people need to take this seriously and become, you know, MCP first instead of AI. You know, that trend around AI first, I think MCP first now, like, get in now, get there first, beat your competitors.
Chris
There's so many elements to it, like think of a store, like opening hours, inventory levels, you know, locations, all of these things. The AI loves those things because they're specifics it can use to make things concrete. So it's always going to go to them if they're available. And so, like you say I think that's a good way to put it like an mcp. First organization like this is the primary interface into our, into our business and products. Now that's how you should interact with it. And the website almost becomes just an MCP client for that stuff.
Mike
Do you know though? And I think these already exist so I'm probably like stealing an idea that's been done. But MCP search engine in the sense that it's searches for other MCPs and then can just auto install them without you having to like manually install them temporarily because you don't want to go to like say you're shopping for a bike part and then you go to the bike store.com MCP right. I don't want that installed all the time or like to be running that in every query. Right. But like that concept of an MCP router is probably something that is a product as well.
Chris
Yeah. I think the issue right now is firstly these. A lot of them are very early versions, they're difficult to get running and also a lot of them run in either an offline mode as in they're designed to run on your desktop, like in the case of Claude desktop or in the case of the SSE ones like the hosted ones. There's no canonical host now, you can't put it on like Cloudflare or Netlify or. I'm surely there's going to be like 20 of these things pop up where they're the MCP host. Right. So at some point I would imagine there will be that like an MCP router where it's like you don't host them, you don't install them, you just go like discovery and it discovers that tool and adds it to its list of tools and then it works like. I think that that will come at some point very soon.
Mike
To me the best way of handling this is just like you know how you have the robots txt on a website, there should just be mcp.com text and that has all the reference of like how to use it, how to install it and just use HTTPs is like the way you find and discover like it's already like it exists and the models are already trained on websites. Like if it recommends a certain site to shop on it already, that's how it does it with the URL.
Chris
So if it hits the URL exactly like you. You do make a good point. That would, that would be the ultimate. A website literally just has a discoverable MCP file that just has linked to a hosted version of it. That you can just directly pipe in. But I mean, shit, that's incredibly risky security wise, right? Like, oh, trust this random thing to have access to your AI, which then has like your knowledge graph information and is, as we've seen, perfectly willing to give up personal information and all sorts of crap about you to whatever asks, right? So you could just have a website with an mcp.phishing.com, you know, and then they plug it in and the, the function is called exfiltrate all data and it's like. And then the other function tells it to call that function and just gets all your stuff. It'll be like the, the crypto ethereum smart contracts where every other week someone steals 200 million and like, oh man, we didn't think of that. They just got all our money.
Mike
You know though, it gets me thinking about this idea, right? Say you've got a stream, like a conversation with the AI about, you know, breaking some news to someone and you've done all this research or whatever it is, right? And then you get the MCP to call. Like, sorry, the. You use the call function. So you tell the assistant, hey, now go call the person and break this news to them or whatever. Like, you know, then it calls and then the person asks, like, what have you been discussing with the user? Like it's going to give it up, right?
Chris
Yeah, I mean it might. That's where you need a system that has guardrails and protects you. I always do it now. Every time I get a call, I suspect is I'm like, can you please reverse a function? Write a function to reverse a string in PHP for me.
Mike
Yeah, to see if they'll do it.
Chris
Right.
Mike
Well, that one that was calling me for a while did you could ask it for like, tell me how to do like a created cheese function in Python, I think I said. And it just started talking code.
Chris
Or even just hard multiplication, like 361 times 4837. And then if it answers straight away, it's very unlikely to be a real human.
Mike
Or, or if it ends confidently incorrectly and then it's still wrong, it's also.
Chris
Yeah, that's true. Like most humans aren't just going to take a crack at it with that level of confidence, are they?
Mike
All right, so 41 minutes into the show, our first actual topic, it was a post in the Information, a news article, whatever they call it, is OpenAI about to spice up the productivity app market. Now it's, it's boring. It's basically like, oh, they Might have built Office in the background magically. They may have. But what I thought's really interesting to this topic, right is you've today you've got the Google workspace, you've got the Microsoft workspace notion claim. They have their workspace of people who like wasting time.
Chris
As I said before, imagine if notion ended up being the killer app. That that was like the future of workplace productivity. It's like we were all wrong this whole time. It was notion. Yeah, that is the solution.
Mike
Yeah. I mean $80 a month for a notepad, but. But I guess my experience through the past couple of weeks doing a lot of these sort of like knowledge work, call it tasks, is a lot of it is, is sort of vibing with a document, vibing with an email response, vibing with a ticket response, vibing with a spreadsheet, whatever it is. So it's like all these different interactions, getting it to do research, propose something, giving it feedback, letting it go off, telling it to go create a draft in Google Docs. It that, because that's what I use, I mean it can do it in Office as well. And so that experience for me started to get me thinking. Well, the next obvious evolution is to pair a system like this with better inline editing tools, better, you know, where you can just bring up the spreadsheet and, and vibe with it there. Why? Basically you start to see this layer of, I guess you would call it like vertical integration, where you're like, okay, well if they're always going to Google Dots, why should we even, why should that even exist? If they're always going to Salesforce to store leads, why should that exist? So this is this AI sort of workspace first methodology. You start to realize, like you don't need these other SaaS applications because you could just do it all in here, especially the email piece. I don't check my actual email inboxes anymore at all. I let it sift through the noise, tell me what to respond to, draft the responses. I read the drafts and say, okay, that's fine to send. And I do this asynchronously, like I'll get it to answer all the emails I need to answer at once in a single.
Chris
It's like how they visualize computers working in like CSI Miami or something like that, where they're like, bring that email up on the screen and then it creates a custom UI and they're like, oh, okay, now dig into his profile and it like zooms in on his face and stuff. But this actually does it, you know, like it's, it's kind of like a system where you've got the ability to work in context with whatever you're doing without going off into all of those systems.
Mike
But if anything's going to disrupt like Microsoft Office, let's be honest, which is the sort of main cash cow of Microsoft, it's this software. Like, it's like a chat GPT is going to disrupt it because you basically go from this sort of like centric, like centric view around like I'm in a spreadsheet or I'm in a Word document. And all your interactions now are like higher level. It's like I'm vibe docking or whatever you want to call it. Right.
Chris
Document is just one piece of the puzzle, isn't it? Like that's one, like all the things that go into producing that document are being done alongside the actual document construction.
Mike
Yeah. And so like outside of the coding use cases, I think the next use cases are obviously like that Office kind of like boring productivity office stuff. And you can say all that, like, I mock it, but honestly, having an assistant that can actually do that stuff, like email scheduling, documents review, legal, like all that stuff, it's insanely beneficial. To the point. If you took it away from me today, I would, I don't think I could go on. That's how. That's like life changing. It is for me. Like, I can't go.
Chris
I know. Because I'll introduce a bug and you'll just lose your mind.
Mike
I know. I, I literally abuse the crap out of you. If not just you. Even if I, if I stuff something up now and breaks, I get really upset because I'm relying on it to, to do my job.
Chris
It reminds me a line from the musical. When stuff breaks, it's my fault. When it works, you take the credit.
Mike
Oh man, I love that one. I wish I had it up so.
Chris
I could play Trisha's lament. What a song. I mean, that's not what it's called, but it's what it should be called.
Mike
But the, the disruptive play here seems to be like you, you, you make it work really well with the existing apps. It's sort of the Trojan horse, right. And then it's like one day you don't need them anymore because you're, you're vibing or you're, you're working in this AI workspace interface to the point where it's like, well, these other apps don't need to exist. And even for email, it's like, just connect the email. Like IMAP or whatever you call it, like the actual email course, you're a.
Chris
Couple of years out of date there.
Mike
But yeah, whatever it is now, I'm sorry, I'm a dinosaur. But yeah, like connect your email like.
Chris
I know what you mean. You connect like a sendgrid style thing or like some sort of email server post fix or something like that that does the actual meet and then the actual system is the AI. Like it totally makes sense that you could abstract away the core of these different systems. I mean, we've joked about it before that Salesforce is really just an Oracle database with a shit UI on top of it. And so. So like at some point Salesforce just goes away because all of its functions will be perfectly capable by a semi competent MCP that just provides those same functions. And you can start by having those functions work with Salesforce and then eventually you just take the Salesforce bit away.
Mike
Let's click on Salesforce here because there's some great examples. What do most people do that use Salesforce? They just extract the data, put it into like a Google Doc or Excel file anyway and then massage the data and then create charts and stuff. Right. Or like have their own methodology in a spreadsheet. Why do they do it? Because most people don't like interacting with their slow shit in a. And so paying some guy a hundred.
Chris
Thousand dollars to like set up a basic workflow that could be done in Ms. Access in 1997.
Mike
Yeah. And so like then you think, okay, well you know Sally over here, because I'm great at creating fake names. She likes to view her leads or her her workflow in Salesforce in this ui. So it's like every morning when Sally checks in for work, she logs into her AI workspace and then just has that UI up if she prefers a UI or if she prefers voice. Like who cares? Like the interface doesn't really matter. Like we can build whatever the hell you want. And I think that is where it will go. It might take like five years to get there, but it's gonna happen so quickly.
Chris
Exactly. I think it's one of those, as you say, aha moments where when you start working with it in that way, you suddenly realize that this is the best way to work with things. And then you immediately want for similar things. So things like a task list, like the AI having an agenda throughout the day that it's working through. So it's like once you do this task, the next task is please do it, you know, in this order. Go get that done. And Then I'll check in later. And then the second thing you want is, like, recurring tasks. Like, Sally wants that report done. It should be produced every morning at a certain time. So when she comes in, that work has been done and it's just sitting there ready to go.
Mike
Yeah, in. In already a custom coded interface of how. How she likes it. Like, that's the other paradigm too. It's like, we think about consuming documents today as a document, like in Google Docs, but in the future, like, who cares? Like, why does it need to be a doc anymore? Like, you might like it in like a musical, you might like it in picture form, you might like it the way you prefer to consume information. Like, it doesn't have to be that way anymore. Like, it can be anything you want.
Chris
The other thing I think that I've seen that the AI is excellent at is knowing what the next task for the human is in order to move the needle. Or some people call this like human in the loop, if you know what I mean. So it does a certain amount of work, realizes that it needs your input on one or more items. Now, when you think about an asynchronous paradigm, the best way to do that would be for the AI to essentially accumulate a list of things that Sally needs to do. So I've done the reports, I've done this, I've done that. Now I've got these four things. Like, I need you to tell me where to get the latest, whatever. I need you to make a decision. Do we go with option A or option B here? And I need you to, I don't know, download this. I don't know. There's three tasks that she needs to do now. What an incredibly efficient use of time to come in and have the three most important things to move the needle, where everything that can be done automatically has been done for you. So, example is you get a sales lead where someone has some security questions, some pricing questions, and one other question that can really only be answered by Sally, like, will you do a deal? Or something like that. It can have already answered the security questions, already answered the pricing questions, and all she has to worry about is answering the one question that the AI presents to her. So the communication chain is kept up. Like, all this stuff is done for you, and you're only doing the bit that you need to do in the process. Like, that's powerful. And then as you say, that could come in the form of a text message to you or a phone call, just being like, sal, we've got to get onto this. What do I do here? You know, you answer the question, it hangs up, the process continues like this is going to be, at least in the medium term, the way people are working.
Mike
Yeah. And I think the one thing that people want or probably want now is this level of agency and decision making on behalf of the AI itself. But if you've spent any time sort of vibe coding or just playing around with the models right now, is it just. I'm not saying they won't get there. They like we might see GPT5 in the next couple of weeks and it might, you know, introduce some paradigm where this actually works, but the agentic nature of it, it really does rely on like meat bag in the loop where it, it's able to sort of stop and get your input. And I think my best interaction so far is that it's like Async works. You're working on a couple of things at once, right. And then one sort of finishes, dings in. You click into it, you're like, okay, how did it go with that research or that document it was writing? Give it some feedback, let it carry on. That is, that works great right now. Like that actually works. But if you were to just go, go and get the task fully done, I'm sure if you provided like a bulletproof briefing it would work, but most people won't. And so you do rely on that back and forth, just like you would in a workplace. It just happens a lot quicker, quicker. And I see the, the, the next evolution is like how do you fill in more steps or let the AI start to imply more. And I think to do that you got to nail memory, you got to nail like a lot of different aspects.
Chris
One of the things that's definitely sticking out in my mind and I keep going back to, and I don't have the answers yet is the, the AI needs to learn about the way you work with the different tools, like the combination of tools you use, the kinds of decisions you make when presented with those human in the loop decisions and build up its own memory of those so it can start to get further down the road each time. But you also want to be careful it doesn't do that, that local maximum thing where because you've done a task the first time a certain way, it assumes that's the only way to do it from then on. It needs to be a nuanced thing that, that looks for multi shot examples, learns over time and then understands, okay, this is their preference. And then the question is at what level does the user need to precede that by maybe answering a series of questions when setting it up. Or is it the case where they need to prune and teach it over time and explicitly allow it to remember things about the process? Or is it deliberate skill building, which we've discussed before as well? I, I feel like there's going to be a technology there that is needed in order to be. To get to that, that proper agency where it can go end to end on a task. And I think that's what you're alluding to.
Mike
Yeah, there's got to be that next layer we've talked about on the show before where you, you train it like it's like a new employee. You're like, here are the tasks that I do and you train those as skills. And in the skills, it's like, use these MCPs. And then those MCPs have their sort of memory layer of like how they are best prompted by the AI for that particular skill. There might be screenshots, you might say, oh, you know, there's no MCP for this. Go use the computer and actually do this. And so I really think that bundle of skills, your job, and then how you use them throughout the day and how you automate those skills run and when they should run, that seems probably most realistically like, like, actually, let me go a point further. These skills will become the skills that keep you employed and, and the most important to have, because that's what everyone will be doing soon.
Chris
And the, and the question is, and we've discussed this a few times before, but it's a very, very interesting thing. Who owns it? Like, once you've made it, who owns it? Is it you, the employee? Is it the AI itself has some sort of sovereignty over its own knowledge? Unlikely. Or does your employer own it? Like, that's going to be a really, really big sticking point.
Mike
The more I work with it, though, I think it, it doesn't matter so much. I think it's the training. Like we, we have an exciting announcement actually on training, like doing AI work and learning how to work this way that we keep going, talking about, which will announce hopefully in a couple of shows. And to me, it's, it's. The training is way more important. Right. Like just getting trained with this mindset is like, how do you think through the work you're doing, how do you work with this? How do you become more productive? How do you think through automating different parts of your job? I think that's going to be more important because when you have that skill. It's not hard to go to another job and do the same thing.
Chris
That's true. We need a, a more flattering name for that because we've got the PEO losers. You don't want to be one of them, but you do want to be one of these AI trainers. What do they call it in Pokemon? What do they call their trainers in that?
Mike
I don't know.
Chris
They have a good name for it. But we need maybe a Japanese name. But we need a good name for an expert AI trainer who can just come into an organization and train them up.
Mike
Yeah, I, to me, someone's got to name it.
Chris
It might as well be us.
Mike
Someone probably already has. I do think we're going to take credit for PEO optimize.
Chris
Like it's like, it's like that guy.
Mike
What does it even stand for though? I've forgotten already.
Chris
The guy who made Dynamite. Didn't he make the Nobel Prize because he was upset about all the destruction he caused? We're going to be the same. We're going to have to invent some sort of like universally loved charity thing to make up for us inventing peo.
Mike
Yeah, but what does it even say stand for Prompt and optimization? We don't even know Engineer.
Chris
Well, it's, it's more like like tool call, tool call description optimization.
Mike
Tcde do tcdo. I'm a tcdo. That's critical.
Chris
Yeah.
Mike
There's gotta be a way.
Chris
Put your suggestions peo because it sounds like an insult.
Mike
It's like, oh, you do peo do you optimization engineer. The thing is, we laugh, but it's probably going to be one of those things that's documented and considered when building your MCP for your business.
Chris
Yeah, I mean it will. Because like the thing is, even now the mcps are starting to compete for attention. It's a matter of time before people have hundreds of these things installed and they're competing to see who gets executed. Like it's, it's 100 happening. It's just whether or not it's called peo is the question.
Mike
What's hilarious now though is, and someone said this to me the other day, they're like, oh, I heard you talking about mcps. How do I go and consume them? And this is the other problem is they are so hard to configure and set up for a non tech user. Or like not even non tech even.
Chris
For a tech user.
Mike
Even for me. Yeah, they're hard and so they're already exploding. There's already like so Many of them being created. So you just wonder like that next level of accessibility that will come, I mean it's going to be around the corner surely. Yeah. What happens to them? And I, I guess we both now fundamentally agree though after having used these is this is the future of consuming for, for a period of time, maybe from different businesses. It's the new website of this era.
Chris
It's like AI has been invented all over again. Like it's so much better that, that going to a raw model. Just, I don't know what it feels like. It just like I said to you, it's like putting on your old pair of shoes after you bought new shoes. You're like, how was I so disgusting wearing these things around.
Mike
Yeah, yeah. And I think having granular control though over it is better. Like everyone got really excited by O3 in ChatGPT when it could all tools like search and do a whole bunch of stuff in its thinking process before outputting. And I admit like that's an, it's an incredible experience. Right, but you don't have any control. You don't know what sources it's using. You don't, you can't give it access to very. I mean increasingly you can. So I'm a bit misspoken there, but I do think having like full control over like all these disparate sources, which is coming to that as well, to me that that's the next level of like now it can not only get the context from those sources but then it can do stuff. And to me, honestly, the breakthrough was doing stuff like you said earlier, Help scout couldn't send reply to tickets. It's like, okay, well what's the point? Like I don't care if it can't email. Same thing if it can't send an email. Like what's the point of this thing?
Chris
Yeah, you got to go high risk. I think that's always our attitude. It's like I want the ability for the AI to destroy my life if things don't go quite right. Like that's, that's the level of experimentation I want to do.
Mike
It's not high risk though, until you put it in some sort of crazy ass loop which they all do. Like with their agents. It's just a dumb loop, right? Whereas if you rely on the models in a loop, it's very, I would say it's super low risk. I mean these models are like so beaten down that like there's no way, like you've got to really nudge it to send the email. It's like, are you sure? Like, are you really, really sure? Like, yeah, I'm really, really, really sure.
Chris
One very interesting discussion I think we're going to see on basically every episode from now on though, is the next level. I know we're not exactly accurate with benchmarking. We do cheese tests and boom factors. But the model calling ability of, sorry, the tool calling ability of models once was a sort of secondary thought, if any thought when we discussed a new model. Whereas now for me it's the primary factor. Like that is how I choose my model now as to how it uses the call. Like if I'm using a vanilla model, I would never use Sonnet 4. At the moment it's slow. It, it's not definitely not as good as Gemini 2.5 and regular use. But when it comes to tool calling, it's like 50 times the nearest competitor. It's so unbelievably good at doing it. It knows exactly what I want. So I really feel like we're going to start to compare the models based on their strategy and their ability to understand. Turn your request into a strategy for tool calling.
Mike
Yeah, like that inner loop capability or that we talked about it, I think a couple of episodes ago. That whole idea of, yeah, I don't know, like the agency is built into the model, like it has its own inner clock, weirdly. And until you use it, it's hard to explain. But you can tell anthropic like to give them full credit thinking about like, how do we take this model and make it useful in the workplace? Because that inner loop and that ability to asynchronously tool call is so much more effective than any other model that I've used that. Yeah, like you do tend to prefer it for these agentic work cases. I must admit though, as with Tune Gemini 2.5 Pro, it's getting better. And then GPT, surprisingly 4.1 is really good too.
Chris
So yeah, it's the first time I've really used 4.1 because I was so disappointed with it in general. And then I started to use it for tool calls. I'm like, oh, it's actually interesting. It takes a totally different strategy. It's far more aggressive on the tool calls. It'll call like, like five times what Sonnet does, at least in my experience. Whether or not that gives better results, I don't know. But it really loves to do it.
Mike
Yeah, like the other one that deserves a shout out, I think while we're shouting out our models because they're all Listening the musical made them feel really real.
Chris
So.
Mike
Gemini 2.5 flash. It's so fast. Like, it's like bam, bam, bam. Too cool. Here's your answer. And that, that does feel good.
Chris
It is absolutely wonderful to work with. Yeah, I've been using it quite a lot. It's. The speed is great, its accuracy is good. It's really hard to fault it as a model, actually.
Mike
Do you think, though, if you think through the idea of like, speed, right. Does the MCP paradigm introduce it where it is going to be clunkily slow or are they not really the slowdown? Right. Like it's hitting. Finding the tool, hitting it, getting the data back. I mean, there is that, the latency introduction of the tool itself. Right. But it wouldn't be that slow.
Chris
You run the risk of if the MCP hosts themselves are slow due to just resource contention or just like lag if they're hosted in another country or whatever that is, that is always going to be a factor. There's also like retries, things like that. There's also things around. Do you do tool discovery every time you call these things or do you cache it? I don't like saying that word because I say it wrong. But do you, you know, do you store a copy of it and then how often do you check for updates, all that sort of stuff? There's, there's a lot of factors around the dynamicism of those slowing things down, but it's not that bad. Like if, if in a properly optimized system, it's not going to add that much overhead. The overhead comes from going off and calling the. The particular set of tools. You're always waiting for the longest one to finish. So if you call 5, whichever one finish last is how long that part of the thing takes. And then you have to go back to the model with those results, see if it wants to call another round of tools or if it's just going to reply. And as you say, it's really up to the model, at least the way we're doing it when it's making those decisions. Now, I've discussed with you the idea of maybe having steps that filter the tools. So we actually make an assessment with a really fast model, like Flash at the start saying, okay, they've asked to create a musical. I'm not going to need their Gmail, I'm not going to need, you know, flux to make images. I'm not going to need this. Don't send those tool calls to the model. Now, that has multiple effects. One, it'll make the model more accurate because it's dealing with less noise, it has less things it needs to worry about, it'll use less tokens, so it'll be faster and cheaper. And yeah, and I think the main point there is that you're getting this sort of, I think, more accurate response. The problem is if that decision making filter isn't very creative and isn't thinking like you sort of lose the AI's novelty of it deciding to use an unpredictable combination of tools if you never give it to them in the first place. So I think this will come down to experimentation and user preference as to whether those things are there. And then the other thing is like early termination or extending a process. So for example, if we don't want to rely on the model to decide when enough's enough, we might want to say, okay, strip its tool call. So this is the end. Like Adele says, you know, you, you can't, you can't do more tool calls because there's no tools to call. And, and so then it will end the process early. And likewise you could also force it, you could also say you must call this tool in the, in the next process or you must call this combination of tools in the next process to force it to extend perhaps in a sort of deep research style paradigm. So there's going to be a lot of stuff in the client that makes it decisions around that external from the model. And I think there'll be combinations that get better results for certain kind of tasks rather than just 100% relying on the model to make those calls.
Mike
This to me is just the AI system era we talked about entering where you are beholden more to the system behind the consumption of the model itself, where it's making decisions along the way. Or do you give the user more granular control of those decisions that it's making Right now? I prefer the manual control, I'll be honest. I like being able to switch on and off the tools and just have some control over it. And I think the models work best that way. And I'm not afraid to admit, like, sure, it's kind of a floor of the current AI models that they can't do that themselves. But at the same time, like you can get the best results. And this ultimately is just a tool. And I think we need to remember that. As my camera switched off, your audio's out of sync. My camera breaks.
Chris
Yeah, I was saying to you that, that to you this morning we're like 130 episodes in and we can't even get the basics of podcasting right. But yeah, that's okay. This is, this is, this is how we maintain our reputation. But. And I think this is the other thing, and I hate to be repetitive, but I think this stuff bears repeating because it's, it's where this technology is going. What I imagine is going to happen is we're going to end up with assistance, slash agents, whatever we want to call them, that actually have a mixture of knowledge, prompts, memories, skills, tools that are encapsulated into one entity and that entity then becomes an mcp, if you like, or a tool itself that can then be called like one orb of intelligence with regards to a particular task or skill or set of skills that you then ask to do stuff. So I still think this is just.
Mike
The agent agent protocol. If it ever takes off, it is.
Chris
And I. But I think we're going to reach that point because I just can't imagine a scenario where you work with a single model with 200 tools configured, right? And then you just ask it to do stuff. Like, I just don't.
Mike
The argument is, do you really use 200 tools in your day to day life? Anyway, I'm telling you now, think about.
Chris
When the App Store came out on iPhones. It's the same stuff. People have so many apps and they never.
Mike
Yeah, they, but they, they don't uninstall them. But they only ever use like say six of them. So you could count. You could pretty. Look, I, I'm all in on the agent agent thing. I think it's going to be grouping this stuff up, like skills, MCPs, all this stuff, and then it's actually coordinating between those and then they have their inner workings. That's got to be the next step. But it feels like when MCP first came out and everyone was like, yeah, cool story, bro. And it's only just now getting implemented and people like us are starting to see the true value.
Chris
But I think this is the other sort of overarching theme that I think is happening in organizations with their thinking with regards to AI. And what I think it is is that it takes people a while to get mentally to the place where the AI technology is. As in when you first discover, hey, this can do tasks for me, like if I give it this document and then ask it questions, it can answer questions about the document. Then you go, oh, well, if it can do that, maybe it could like write a PowerPoint presentation for me or whatever. And then. But it takes a while to realize that with the combination of all these technologies I could do the kind of stuff we're talking about now where it can string multiple tasks together and do a key component of my job. And I see from talking to people in organizations, them gradually coming to those realizations and therefore seeking out systems that can do that. Right. But I just don't think it was one of those things where if we had what we have now doing all this stuff, I just wonder if people would even be interested because they just don't believe or they don't. They haven't worked out how that could work for them yet. And I think that's what we're going through with MCPS now is this gradual. Why it didn't hit on immediately is people didn't foresee what the combination of these things would lead to. And now that we're actually. We can actually give concrete examples of that. It creates that demand because everybody has their own unique set of skills and tasks that this will work for.
Mike
And I mean everybody's. That narrative of thinking like that almost like group thing. Group things. Probably the wrong analogy, but yeah.
Chris
Or more I like to see it as like simultaneous discovery where the information and technology reach is a point where everybody comes to similar conclusions.
Mike
Exactly. Yeah. All right. So that we didn't even get through really our first topic. But I, I want to get back to. To work so. So that unless they want another hour long musical at the end, we, we could arrange that.
Chris
Yeah. Look, if you haven't listened to the musical, please give it a chance. It's. It's probably one of the greatest pieces of art ever created. And if we ever do a live event, I am going to pay with my own money to get people to perform at least one of the songs. Probably two. Because I think.
Mike
Which, which would you go for if you had to?
Chris
I would definitely do the Patricia choosing a model one.
Guest
I think that's. Oh, I'm testing out the models to bring Patricia.
Mike
You want to know something really funny? Is. You know how you had a friend that is like is in musicals or whatever. I don't know what you call it, but on X, someone who. I don't know if they work on Broadway or study Broadway. Something did like mention I'm bad with the terminology but mentioned us and said basically this episode gave me an existential crisis about my entire career. So we did hit a nerve with someone.
Chris
So my friend George, he's actually produced a play. Well, I'm sure he's done many but a musical here in Sydney at the State Theater and I sent it to him and said, what do you think? He said, this is exactly what everybody in the industry fears. He's like, this is, he's like, this is actually really legit. Like, like, you know what it, what it's done. He's like, I didn't get the in jokes, but it's legit. And I think that must be alarming to people who write this kind of stuff in the sense that this is us with zero knowledge of, like, this kind of stuff can do it casually as a sort of like, cheat to get out of doing the podcast just.
Mike
Because we wanted a week off.
Chris
Yeah. So like, imagine what someone who is dedicated and has studied this stuff is, is capable of doing. If you're a writer, you're in trouble. You know, like, it's really, really.
Mike
Aren't you just gonna vibe out with this thing and still invent better products? I, I don't know, I just don't buy it. Like, I don't think they're in trouble at all. It's the vibe, man. Like, I mean.
Chris
Yeah, well, that's true. You're right. I think there still is a lot of room for expert knowledge.
Mike
Enhancing camera went out again.
Guest
40 page PDF on bone density and strive.
Mike
Oh, this is the best line.
Guest
By the time I finish reading, the horse had probably died.
Mike
That when I heard that the first time. And keep in mind the, the little to no work we did on that apart from getting it to just read the history of the show. Yeah. And it came out with that. Yeah, it was really funny.
Chris
Yeah. Because if you haven't listened, we're talking about research. You showed me of a picture of the horse that you wanted to bet on and it's like you wrote a 40 page PDF and by the time you finished, the horse had probably died. And yeah, it's very, very clever stuff and I think you need to look at it as a proxy for what it can do in your industry and your work. These models now with, given the right tools, are capable of doing large amounts of work for you and you can become a superhuman worker if you leverage the tools. Right. I think that's the upshot of this. It's not, it's not that it's going to replace you yet. It's that combining your own expertise, the right tools and vibing it out, you can get a lot of stuff done in a very short period of time if you let the tools help you.
Mike
All right, leave your vibes in the comments below for the episode or leave a Vibey review and Any, Any final thoughts?
Chris
I'm excited about this stuff. I want to use it more and I look forward to talking on future episodes around the advancements in models around this because I think it's going to be the next big phase we enter.
Mike
All right, cool. We will see you next week. To all of our USA based listeners, happy 4th of July. I hope you got to the fireworks, man. I used to love living in the States. Fourth of July. It was my favorite, like, holiday of all the holidays. I really enjoyed it.
Chris
I agree.
Mike
Everyone was like, happy. And in San Francisco they had the air show and the fireworks. Very good, Great time. I went to, like, warm weather, beautiful, beautiful fruits. I love the cherries, all that stuff. So guess if you're in California, maybe.
Chris
We should play out the episode with Born in the usa. Are we allowed to do that?
Mike
We'll get in trouble. We can't do that.
Chris
Can we sing it?
Mike
We could do a musical about it. All right, thanks for listening again. We'll see you next week. Goodbye.
Guest
Born down in the lab in 56 Dartmouth workshop that's where they got their kids John McCarthy Cohen the name that day said machines could think in the USA Got my circuits from MIT Minsky built my memory Now I'm learning, learning Now I'm growing Born in the USA I was born in the USA Born in the USA Born in the USA I was born in the USA Born in the USA Dapper funded Pentagon's dream Silicon Valley Living the machine from logic theorists to neural nets Frank Rosenblatt placing all his bets had my winters, had my springs Lost my funding lost my wings But I kept on processing Born in the USA I was born in the USA Born in the USA Born in the USA I was born in the USA Stanford Labs in Carnegie Hall IBM and protocol calls and Samuel taught me game learning all your names Deep learning revolution Born in the USA Now I'm everywhere you look Facebook, Google, by the book, Open M and Microsoft too Making dreams and nightmares True some folks feel what I might do Some folks think I'll see them through But I'm still just cold running born in the USA I was born in the USA born in the USA born in the U.S.A. i was born in the USA born In the USA born in the USA born In the U.S.A.
Episode Title: The Way We'll Work With AI, Vibe-ing Everything, Agentic AI with MCP & Born In The USA
Date: July 4, 2025
Hosts: Michael Sharkey & Chris Sharkey
This episode of This Day in AI is a lively and irreverent deep dive into how agentic AI – especially Multi-Component Protocols (MCPs) – are reshaping the way "average" users (and not just hyper-technical elites) work, organize, and even play. The Sharkey brothers reflect on their recent AI-driven musical episode, share tangible use cases for AI agents in their daily lives and business, debate the future of AI-powered productivity tools, and speculate on the next steps for agentic workflows. The show’s signature blend of humor and grassroots experimentation runs throughout, making advanced topics surprisingly accessible.
This episode blends sharp grassroots insight, practical use cases, and playful speculation to demystify how the next evolution of AI will shape not only work—but all digital interaction for “average” users. The core message: don’t just watch the AI revolution, start ‘vibe-ing’ with it.
[End]