Summary7 min read

The Vergecast: "I Just Want AI to Rename My Photos"

Date: November 30, 2025
Host: David Pierce
Guest: Thomas Paul Mann (Founder & CEO, Raycast)

Episode Overview

This episode launches a two-part series exploring how makers of AI-powered software are thinking through the possibilities and pitfalls of integrating AI into their products. Host David Pierce talks with Thomas Paul Mann, CEO of Raycast, about the real and practical ways generative AI can change how we use our computers— focusing specifically on workflows, device control, and just how far we are from a world where AI automates all our digital busywork, like renaming your photos with a single prompt.

Key Discussion Points & Insights

1. The State of AI in Consumer Apps

(00:32)–(02:47)

The tech industry is racing to embed AI into everything, sometimes unnecessarily.
Genuinely useful applications exist, such as reliable transcription (OpenAI's Whisper), organizing tools (Todoist "Ramble"), and AI-centric search (MyMind).
The central question: Where should AI augment our workflows, and how do we balance utility, complexity, and privacy?

2. Introducing Raycast and Its AI Ambitions

(05:58)–(07:42)

Raycast originated as a Spotlight alternative for Mac, allowing quick app launching, text expansion, and window management.
Recently, Raycast added the ability to run AI models and even let AI interact with your apps and files— for instance, asking Raycast to grab browser tabs or find recent files.
The app is uniquely positioned, sitting at the intersection of user commands, app extensions, and now, AI-driven automation.

“Raycast itself is like a massive search box ... so you can just type something in … and it just used to be very static text ... but then it felt quite natural to extend this and just put in natural language like a prompt and then get going.”
— Thomas Paul Mann (06:44)

3. Early AI Integrations and Evolving Use Cases

(07:42)–(09:55)

Early days: Integrated OpenAI's GPT-3 right after it launched—noting demand from users for more natural ways to interact with their devices.
Initial predictable tasks: answer questions, summarize content, search the web.
Obstacles: Model hallucinations, finding solutions for accuracy, and layering in web and local data access.

“As with every new technology, people kind of adapt to what’s possible and then they take the next step and push the boundaries.”
— Thomas Paul Mann (09:35)

4. To Build or Integrate? The Model Dilemma

(09:55)–(13:21)

Raycast decided not to build its own LLM, but rather integrate multiple external models (OpenAI, etc.), as users have varying preferences and use cases.
Some tasks require fast models, some need more depth—model choice is both technical and personal.
Raycast fine-tunes prompts and workflows for reliability in their context.

5. The Push Towards Agentic Workflows

(13:21)–(18:51)

Raycast envisioned early on an "agentic" system where users simply describe tasks and the computer acts.
Reality: Reliable agentic workflows are difficult—AI remains unpredictable. Hybrid UI (text box + suggested actions) is necessary.
Discoverability is a challenge: users must both trust and understand what’s possible with natural-language prompts, echoing earlier voice assistant frustrations.

“Prompting is still like a skill thing ... as those systems become much more proactive, I think this will be better.”
— Thomas Paul Mann (17:39)

6. Model Routing and Orchestration

(18:51)–(22:44)

Current frustration: Users must pick the best AI model for each task.
Ideal future: Raycast (and similar tools) should auto-route prompts to the most appropriate model for each use case—abstracting away complexity while letting advanced users customize.
Orchestration across access points (apps, files, extensions, and models) is where Raycast aims to differentiate.

“We started basically now abstracting that away ... the best experience is you sort of have an automatic mode which just does what you want.”
— Thomas Paul Mann (20:14)

7. The Path to Reliable, Everyday AI

(24:47)–(29:53)

Sustainable AI integration is about building out incremental, reliable workflows—avoid science-fiction "100% agentic" promises that break on first use.
Focus on improving repetitive, mundane tasks like reformatting text, spelling corrections, and—crucially—file operations like naming and sorting photos.
The “last ten percent” of reliability remains elusive; demos can mislead, but even 90% solutions provide value.

“We’ve all seen the shiny demos in launch videos and then they fall apart the moment you use it ... It’s science fiction, right?”
— David Pierce (26:07)

Memorable Segment:

Renaming Your Photos or Cleaning Up Files—Are We There Yet?
(29:53)–(33:26)

David underscores the real-world need for AI-powered batch operations, like renaming photos based on date/content—a task Raycast is “90%” able to do now.
The vision: The rise of disposable, personalized tools called into existence via prompts, rather than ever-fatter utility catalogs.

“You can do this today in Raycast, we have that ... And then the 90%—every now and then, it doesn’t work.”
— Thomas Paul Mann (31:26)

“What if you could have this app just by asking AI and it builds this little app for you, and then you have it for yourself ... Software becomes malleable and you can change it ad hoc and it becomes just what you want.”
— Thomas Paul Mann (32:10)

Notable Quote

On AI’s role at the OS Level:

“If you purely think from a user’s standpoint, AI should be on the operating system level. It just makes so much more sense to be there instead of in every app ... from a user’s point, the best thing is if you have a smart operating system that helps you to get your job done.”
— Thomas Paul Mann (61:22)

Safety, Privacy, and the Human in the Loop

(43:02)–(50:35)

Raycast takes extra care with security and trust due to its deep system access; users are alerted before destructive actions, human override is always possible.
Not all AI-powered features should be built. Example: discarding plans for an AI to always-actively monitor the screen for distractions due to privacy and user unease.
It's a value exchange—users will give up data for utility, but developers must responsibly draw lines and retain user control.

“We put a lot of effort into making [Raycast] super stable. ... For that it’s even more important to have the guardrails right.”
— Thomas Paul Mann (44:43)

The AI-First Workflow and the End of Apps?

(53:11)–(62:10)

Thomas: “My brain is completely rewired and it’s like I’m prompt first by now ... I basically just put things [into Raycast] ... and then iterate on that.”
Raycast is used to orchestrate across browser tabs, apps, note-taking, and even code snippets—lowering the barrier for non-coders to automate or script computer behavior.
The once-clear boundary between extensions, mini-apps, and AI-driven workflows is blurring: sometimes the artifact should be concrete and repeatable; sometimes open-endedness is a strength.
Designers at Raycast now code; creativity is accelerated by AI.
But: Discoverability and reliability still need work. Open-ended prompts are powerful, but for true productivity, repeatable tools are still king.

Closing Thoughts

Both Pierce and Mann agree the industry often races for an “end state” vision (fully autonomous computers, etc.), but there's value—and a real user need—in practical, incremental AI that augments, not replaces, human control. For now, the best AI-powered experiences will be context-aware, OS-level, user-driven, and above all, helpful for repetitive or complex-but-patterned digital tasks.

Key Timestamps

00:32–02:47: Why AI everywhere? What works and what’s silly.
05:58–07:42: Raycast’s roots and AI ambitions.
13:21–18:51: The challenge (and limits) of agentic AI.
18:51–22:44: Why we need AI to auto-route tasks to best models.
29:53–33:26: AI for renaming photos—how close are we?
43:02–50:35: Privacy, guardrails, and where Raycast draws the line.
53:11–62:10: Living “prompt first”; the blurring of apps and AI workflows.

Notable Quotes

“Raycast itself is like a massive search box … it felt natural to extend this and just put in natural language like a prompt and then get going.” — Thomas Paul Mann (06:44)
“We started basically now abstracting that away … the best experience is you sort of have an automatic mode which just does what you want.” — Thomas Paul Mann (20:14)
“We’ve all seen the shiny demos in launch videos and then they fall apart the moment you use it ... It's science fiction, right?” — David Pierce (26:07)
“If you purely think from a user's standpoint, AI should be on the operating system level. … From a user's point, the best thing is if you have a smart operating system that helps you to get your job done." — Thomas Paul Mann (61:22)

Summary by The Vergecast Summarizer — Listen to the full show for deeper insights on the future of AI-powered productivity and the real, messy ways it could change your digital life.

Loading summary

Transcript93 lines

[00:00]
Thomas Paul Mann
Adobe Acrobat Studio, your new foundation. Use PDF spaces to generate a presentation. Grab your docs, your permits, your moves AI levels up, your pitch gets it in a groove Choose a template with your timeless cool. Come on now, let's flex those tunes Draft, design, deliver, make it sing AI builds the deck so you can build that thing do that, do that, do
[00:24]
David Pierce
that with Acrobat learn more@adobe.com do that
[00:28]
Thomas Paul Mann
with Acrobat
[00:32]
David Pierce
welcome to the Vergecast, the flagship podcast of using AI models to rename all the files on your computer, for better and for worse. I'm your friend David Pearce, and this is the first in a two part series we're doing about AI and more specifically how people who are building AI tools are thinking about the AI tools that they're building. Basically, we're at this moment in time where everyone who makes any kind of app, any kind of software, any kind of hardware, for goodness sake, is trying to figure out the ways to put AI into this. And on some level I think that's silly, right? Like there's, there's a lot of stuff out there that actually does not benefit from having ChatGPT shoved into it in some ridiculous way. But on the flip side, there are actually a lot of things that become better and more useful and more functional with these kinds of tools. One thing I think about a lot is, is text transcription. It's a simple thing, but, right? OpenAI put out this whisper model that does really good, really fast transcription of audio that ends up being really powerful for lots of things. There's this feature in todoist, the app that I really like. It's a To do list app and they have this feature called Ramble. I think I've talked about this before, but you can just talk your to do list, all the things you're thinking about, all the things on your mind, all the things on your shopping list, you just sort of yell it into the app and then it will attempt to go through and structure it all and make sense of it. And there's, there's a couple of different layers of AI in there, right? But the first one is just take your voice and reliably successfully transcribe it. That's very powerful. Uh, there's also an app I use called MyMind that is using AI to do really great search, so that instead of having to like make a bunch of notes and then file them into folders or give them tags or do any kind of organizing, you just put it all in and trust that you'll be able to Sort and search and find things as you need to. This stuff can really work. So for the next two Sundays, what I'm going to do is I'm going to talk to two people who are making apps that I think are doing a smart job of AI. It's going to sound in both of these interviews like, I like the products and it, I do.
[02:48]
Thomas Paul Mann
It's.
[02:48]
David Pierce
That's why they're here, because I think they're thinking about AI not as just something to like shove into the app to charge you more money or juice their stock price or whatever, but because there's something it actually makes possible. And sometimes it makes those things possible in ways that are complicated and messy and privacy threatening and maybe even threaten to like ruin the vibe of the thing you're trying to build in the first place. But that also have upsides that make the things more useful and more fun and more discoverable. So we're going to talk about all of that. And my guest for this first one is Thomas Paul Mann, who is the founder and CEO of a company called Raycast. Raycast is initially was a Mac app. It's now on iOS and on Windows. The way I would describe it is it's sort of a launcher and then some, right? So you use it to replace Spotlight on your Mac and in it then will let you launch apps. You can use it to store like text expansion things. I have one set up so that when I type H O M E for home, it just immediately spits out my home address. That's a thing that lives in Raycast. You can also use it to like manage the windows on your computer and move stuff around. But increasingly one of the biggest things that it can do is access AI models and you can use it just to chat with ChatGPT inside of Raycast, but you can also use ChatGPT to use your apps. So I, I can go in and I can type, you know, at browser, download all of the tabs as a CSV and put it into a text file that I can then send to somebody. And that's like a thing it is in theory capable of doing. I can open it up, I can say at finder, show me all the files that I have created in the last 24 hours. And it's, it's actually an AI system that can use your other apps and even use your computer. We've talked a lot about AI browsers, we've talked a lot about these sort of tools that have lots of additional context. Raycast has more context than just about Any other app. I've been using this app for a long time. I really like it a lot. I have not made that much use of all of the AI stuff inside of it. So I wanted to have Thomas on to both talk me through how he thinks about putting AI into this product and also what it can do for you when it really starts to work. I really enjoyed talking to him. I think you'll enjoy hearing it. We're going to take a quick break and then we're going to get to my interview with Thomas. We'll be right back. Support for this show comes from Vanta. Vanta uses AI and automation to get you compliant fast, simplify your audit process and unblock deals so you can prove to customers that you take security seriously. You can think of Vanta as your always on AI powered security expert who who scales with you. That's why top startups like Cursor, Linear and Replit use Vanta to get and stay secure. Get started@vanta.com Vox that's V A N T A.com Vox vanta.com Vox Thomas Paul Mann, welcome to the Vergecast.
[05:56]
Thomas Paul Mann
Hey, thanks for having me.
[05:58]
David Pierce
You and I have talked many times, but we've never talked into a recorder like this. And I'm very excited to have you here. This is when we were like, we're doing this series about people who are sort of building and thinking about AI and what AI can do, which is a conversation you and I have had versions of many times. So now we're just going to do it again and I'm excited about it.
[06:19]
Thomas Paul Mann
Sounds good. Yeah, sounds like we have done it a few times already. So let's see.
[06:23]
David Pierce
Indeed. So first give me a sense of. I think you've been thinking about AI inside of Raycast for a while and I would say just sort of rewind to like the early days of when you started thinking about how AI models fit into what Raycast was doing a couple of years ago. Like, what were those first conversations you were having?
[06:44]
Thomas Paul Mann
Like, yeah, so yeah, Raycast is like sort of this global search bar on your Mac, right? And actually now also on Windows. But like basically what we realized when ChatGPT came around and suddenly everybody talked about a prompt and everybody was looking for like a text box to feed in a prompt that we were really well positioned for that because Raycos itself is like a massive search box and so you can open it anywhere and so you can just type something in and it just used to be just very static text. Like, oh, you're searching an app or like a command or you want to do something, but then it felt quite natural to extend this and just put in natural language like a prompt and then get going. So pretty much right after ChatGPT came out, we. Hey, wait a second, that suits us really well. And so I think the first model was GPT3 that we integrated, which was back like end of 2022. Okay.
[07:43]
David Pierce
So the thinking even then was like, we can solve some of the way people talk to their computers. Because I think to me, it's like the very first good thing that any of these models did was make it slightly easier to like, speak in English to your computer. Was that kind of your. Your thinking too, that, like, we can just make this make a little more sense?
[08:05]
Thomas Paul Mann
Yeah, I think, like the very first things that people did was just like asking questions. Right. And so it's kind of funny because all the way back when we started Raycast, we were like, oh, like we're programmers, we sometimes have questions, how do I do X with that? And then you used to go to something like Stack overflow and find those questions that somebody else asked. And then you go over it and read the answer yourself. And you kind of had to be very good at basically keywords to find a proper question that leads you to the answer. But now this whole thing got basically flipped upside down. And so the very first thing we saw, like, oh, people just ask questions and so get the answers and then carry on with whatever they do. And this can be little things, can be fun things, but it helped them to basically stay informed. And so one of the first big challenges to overcome was like, oh, but sometimes those models hallucinate and how do we get over that? And I think then the next sort of wave was very easy, like, oh, well, let the models do what we would do and search the web and then take that information and distill it down into the style and the tone of voice we wanted to have. So that was sort of the very first things that we've seen picking up. And since then, it became more and more advanced, Right. And then it was like, okay, maybe not just search the web, but search your calendar, search your files, read your files, do actions like organizing your folders and files on your Mac or other things. So I think as with every new technology, people kind of adapt to what's possible and then they take the next step and pushing the boundaries a bit more and more. And given we have quite advanced users, they're oftentimes on the forefront and so they pushing really hard to the extremes, and then we can kind of see what they wanted to do and then integrate that quite nicely and make it accessible to many more people.
[09:56]
David Pierce
So do you have to make a decision early on about how much of the stack of AI do we want to be part of? I assume there was never a question of should we train a Raycast LLM? But it does seem like you could build something that is essentially just a text box that replaces the ChatGPT text box, or you could build something on top of it. You could try to integrate with an API and be a sort of developer. There just seemed like a lot of different ways you could try to do that thing that you just described at vastly different levels of complexity. Was it obvious to you where to land early on?
[10:39]
Thomas Paul Mann
No. I mean, this thing came out overnight, right? Like suddenly this thing was there and like, oh, wow, what used to be sort of sci fi and the movie thing was suddenly like, somewhat possible, but like, you kind of need to get started somewhere. So early on we kind of were just playing with the APIs and integrated them initially just OpenAI because this was literally the only API that was there. There was nothing else that was even available. And so that took us several months until somebody else popped up. And then it became clear, like, that models are all a bit different, not even talking about which ones are smarter or faster, but just to have nuances and people prefer the one over the other for oftentimes personal reasons, like, oh, I like how this model talks to me. And so really quickly we said like, oh, it probably makes sense to integrate all the different models because people are going to have personal preferences and they're going to be better and worse models or better models for certain cases. Sometimes you just don't need the full intelligence. You just want to do something simple like, oh, summarize this blog post or rewrite this message. Those don't need to be the cutting edge models. You want to have them rather fast. And then sometimes you want to have a model that goes on for several minutes and does a bunch of research and then coming back with a big research paper for you. And for that you probably need to have a better model. And so early on we said, okay, let's integrate with as many models as possible because we have a quite technical audience. So that will help us also guide which models they prefer and what we now see. After a couple of years, whenever there is a new model dropping, everybody goes to the new model, tries out, and basically want to have the latest and greatest, and then they're using that for several months until the next model drops from a different company, most likely, and then they're going over to the next one. So the switching costs between those models are extremely low at the moment, at least for us and for our users. And then I think they're building up a bit of muscle memory or even learning how to get the most out of those models. And those things are sometimes a bit more tied to, let's say, a model family. But early on, we said we're not going to go and build our own models. What we did, we did some optimizations on the prompt level and also some fine tuning to make the models really good in our case. And so that is, for example, making a lot of agentic workflows. But nowadays a lot of the models are pretty good at that on a basic level already.
[13:21]
David Pierce
What were you doing in those early days? That was before everybody was talking about agentic stuff. But it also seems like kind of perfectly up the alley of Raycast to figure out, okay, how do we have this unique access to your device and your files and your data? How do we teach this model how to do stuff? Were you poking at that stuff even in these sort of early days, before everybody was talking about agentic AI?
[13:44]
Thomas Paul Mann
Yeah, so we had this pretty early where we said, this makes total sense for us. We have this extension platform. So there are over 2,000 extensions, they're publicly available. You can integrate Racos with Notion, linear, Google Docs, GitHub, you name it, anything you can really think of. And also on your local computer, it can see your files and your calendar, et cetera. And so we had all of those extensions lying around and we're thinking like, oh, my God. The obvious thing is, instead of you doing everything manual, you say what you want to do and the computer does it for you. Turns out it's a bit harder getting there, right? But the promised land is like, quite nice, right? So you flip it upside down, essentially, and kind of change how you do use computers in the first place. Because if I think about how I used to use a computer, it's like, okay, I have an idea what I want to do. So in my head, I'm kind of transpiling that into clicks and keystrokes and navigate around my computer. But now it's like instead of doing that, I just write down or even speak into my microphone for several seconds and then let the computer handle it for me. So we had this idea early on. Getting there was a bit harder because, one, we wanted to make sure that those things work really well. So that isn't that easy. You also had to figure out a bit the UX because it's still with a prompt where you can say anything is great, but. But also we used to have Create UI that guides us, right? We have buttons that we can click and help us basically navigating. And now suddenly the computer pretends that everything is possible. It's a bit of a lie because that's oftentimes not a truth. And so figuring out sort of the middle ground between when it makes sense to have UI and when it makes sense to just have an open prompt field, that was like a bit of a challenge.
[15:42]
David Pierce
Well, it's also kind of an essential Raycast problem, right? Like this is a thing you and I have talked about before. The how do you discover what this thing is able to problem because you open it up and it is just a text box. Like you have the exact same problem that ChatGPT was has, which is that you open it up and it, it makes clear that you can do things, but it's not super clear what things to do or how you teach it to do things. And again, this is where like all the agentic companies get really excited because they're like, you just say it and we'll figure it out for you. I'm extremely suspicious of that as a, as a concept. Uh, but it does seem like you're, you're sort of stacking discovery problems on discovery problems here. Uh, is there, is there a way to start to push through those things?
[16:26]
Thomas Paul Mann
I think so, yeah. We gotta learn how to use this new technology, right? And it changes in how IT behaviors. If I look for example for a moment at coders, right? So they probably a bit further ahead in this adoption curve and it's like they're very close to the technology. So I think that's why it also progresses there really quickly. But programming used to be like you write text and if you write something wrong that's bad. And then at some point somebody like a compiler tells it that's bad and then you correct it, right? And then we happen to know it's like oh, we can do some auto completions, so we kind of know what's possible so we can show you possibilities and then you pick them and that's great. And then the first LLM use case was like piggybacking those completions and say like, oh, maybe I can tell you a bit longer what you could write and kind of suggest you that and predict that for you. And then that worked really well. And then now it's like, well, I don't even write code anymore. I just write what I want to do and let the LLM do it for me. And so I think you see the pattern where I call this oftentimes prompt first. If you know what you do and you know the system, you get actually really good results. But you're right, there is a discoverability phase where you need to know what a system can do. Right. And I think we had this not that long ago when we had all the voice assistants, I'm not meaning the ones we have right now, but back in the day, the Alexas and all those things where suddenly voice interfaces were the hot thing. And then everybody was like, oh my God, this is amazing, I can order an Uber and God knows what. And then everybody got them and well, we all know how that one turned out, right? It wasn't that useful after all. So. So I think it's sort of the tech is obviously much better now, but people still need to learn how to use the tech and that just doesn't happen overnight. And so prompting is still like a skill thing. Oftentimes we get user feedback, oh, this didn't work. And then you look at it, it's like, well, I'm not surprised that it didn't work because it didn't really tell it. So. Yeah, discoverability is something that I think is extremely important. I think as those system become much more proactive, I think this will be better. Like when a system pushes to you like hey, how about that? Or you start typing and it suggests, oh, I know kind of what you want to do and I know what the system can do. So I can suggest intelligently what's possible and kind of guiding you in the right direction.
[18:51]
David Pierce
So one place I think that approach could be really useful and I'm sort of surprised to have not seen more people try to do it. Is what you were talking about earlier with the fact that there are lots of different models that all have lots of different skills. It seems to me what we need is not just a sort of model switcher. Right. And Raycast offers you access to lots of different models. There are lots of apps out there that are just like we have all the models in one place and that's something. But what I actually want is something that is like an intelligent router between the models that's like, okay, this is actually the one that is going to do better image generation. And oh, what you prompted me is actually a huge research product. Let me funnel it to this like I think the idea that we all have to understand which model is best for which thing is like ridiculous and just bad ui. And it seems like this is a thing that you're actually in theory in a pretty good position to orchestrate. Is, is this, is this like a possible thing to do? Like, why, why doesn't this exist yet?
[19:51]
Thomas Paul Mann
Yeah, in fact, we started doing that. Right. I think it's basically first you kind of need to understand what are the best models for what thing. Right. And some of them you can measure, but others is also like a personal choice. But we started doing this and I noticed like, oh, there are some models that are just better at like for what you said, like image generation or some are better at trending workflows where they use certain tools to get a job done and some are better at recognition of images and all this kind of stuff. So we started basically now abstracting that away and sort of we think about disclosing the complexity over time. So we think the best experience is you sort of have an automatic mode which just does what you want, right. Automatically and so you don't need to worry about it. If you ask a question where it needs deep research, it does it for you. If you want to generate an image, it picks the best model for that. But then also sometimes when you get more advanced, you maybe want to have the flexibility so you can go a level deeper. And we want to give you still the configurability where you say, hey, I kind of know what I'm doing, so I want to have that specific model doing that job. And so you kind of go up and down in the configuration and can pick what you want. And I think you see this like in the industry where you saw this on ChatGPT, they put out an auto mode and then everybody was freaking out that you can't select models anymore. And then they kind of had to broaden it up. And I think this is something which you will see more often because if I think about it, we have this massively smart systems and we still need to figure out which one we need to use. As you mentioned, it's a bit ridiculous. So over time I think this will just go away and it just like does what you want and figures it out. We're just not there yet.
[21:48]
David Pierce
In a way, do you think Raycast is in a uniquely good position to do that? Because it seems to me like again, when I think about Raycast, I think about it as like, it's just a box with access to all of my stuff.
[22:04]
Thomas Paul Mann
Right?
[22:04]
David Pierce
And I think on the one hand it has access to all the files on my computer, which I think is a thing that is sort of unique to Raycast. On the other hand, you have, like you said, all of these extensions and all of these apps that I'm plugging into. And I'm like, you know, typing my API keys for all of these apps into Raycast. So like, you, you have this access and then on the, on the third hand, you have access to all of these models. And so what it seems to me is, at least in theory, there's nothing. You don't have to just be able to orchestrate all of this stuff on my behalf. Like, is there some big blocker here that I'm missing or is it just a matter of figuring out how to make all of this stuff work?
[22:44]
Thomas Paul Mann
Yeah, it's model header. Just basically for us, as you're right, we're basically in the perfect spot, right? Just happen to be there at the right time at the right place. So yes, we have access to all of that on top. We also kind of see what you're doing, right. And throughout the day which actions you perform through Raycost. And so we had like the usage patterns and those things as well. So connecting those things together is, I think the magic sauce here is like what basically makes this really personal and unique and tied to you, right? Because we all somewhat unique in using our computers differently and we use different apps and we write differently and we're interested in different things. So you kind of need to have this personalization layer. But we're also spending hours a day in front of a computer and perform things right? And so collecting those and analyzing those and making sure that we basically can predict the next things you want to do and becoming smarter over time and having this sort of reinforcement learning for you personally, I think that's what really makes a difference. And we call this sort of the contextual AI. We talk a lot about context generally with AI, right. Like, I think when you talk to people, like prompting is providing context to a large language model so it can produce the best results. But sometimes it's really hard having the relevant context. But if you happen to be always around while somebody works on computer, you can collect a lot of that and basically become smart over time and can help the person steer in the right directions. And kind of ideally the computer knows the same as you do, right, from what you have read and what you have consumed and building that up over time.
[24:31]
David Pierce
So.
[24:31]
Thomas Paul Mann
So it can basically take the same resources and throw them back at you and form connected dots because it has read all the things and consume that and then also tie together and use the same apps and tools you use that are already connected to Raycoast.
[24:47]
David Pierce
How do you start on a project like this? I think a thing that I see a lot of AI developers and people building stuff with these tools do wrong is they try to do the like hundred percent version of the thing all at once, right? And like, yeah, I can say with great confidence most agentic workflows don't work. They just don't. And there, there are a lot of perfectly valid reasons for that, but there's also a lot that this stuff can do. And, and I think to me it seems like the struggle right now is, is figuring out how do we do, how do we sort of sequence this thing that eventually gets us to the place that we think and hope this technology is going, but that actually works today. And what I see is everybody either builds stuff that doesn't work or it ends up just being a ChatGPT text box and you just sort of offload the, the. Does it work or does it not? To chatgpt?
[25:44]
Thomas Paul Mann
Yeah.
[25:45]
David Pierce
Like have you, have you figured out how to sort of sequence your way to this magical thing that may someday be true, but clearly isn't yet?
[25:52]
Thomas Paul Mann
Yeah, I think that's the tricky part, right? Like we all seen the shiny demos in launch videos and then they fall apart the moment you use it, right? And it's kind of annoying and I mean sometimes those things don't even ship. Like they're just videos and they're never getting material.
[26:07]
David Pierce
It's science fiction, right? And it is like the dream. This is the interesting thing about this moment is like I think people mostly agree on what the dream is, but it is still a dream, right? Like it is, it is, it's, it's a plausible future, but it is still the future. And, and I think like everybody would do well to just remember that a little more often.
[26:29]
Thomas Paul Mann
Yeah. I think, I wonder like if it ends up being this like, you know, like self driving cars, like, oh, it's just like one more year and then it's going to be self driving. Right? And then this goes on for 10 years. Right. The progress that LLM has made shows a bit different. The trend, the trend is more like, oh wow. We like having a lot of progress in a very short time and it doesn't seem to stop anywhere close. But I think it boils down to sort of the usual things like do the simple things first, so try out because it's such a new technology, you kind of need to get an understanding what's possible and what is not. And so it's a lot of just prototyping and see kind of what sticks and what brings real value. And it's not just like science fiction, as you say, which maybe works 1 out of 10 times and then that's not going to be useful. Right. And so I think like finding that middle ground is extremely hard and you see like some of those things, they're happening, right? So where people see value, which may be not the sci fi things that we all dream up, but like say meeting recordings, like those happen now basically on a regular basis. Right. So there's some true value in here that was not possible before. Or like just like, yeah, the research cases, like just consuming and finding information about topics that you would otherwise not have looked up. So there's some of those very concrete examples and I think there's a lot more out there. Like coding is another one. Right. It makes just so much progress in such a short period of time. But it's not this like super general stuff, which I think for us is in a way like a challenge because Ray. Cause this is like everything app, right? You open it and then you can type something in. And so finding that middle ground is sometimes hard for us, but it's coming back to like, okay, let's see what people do very, very often every day. See how we can improve those workflows and then go sweat the details to the prototyping and see what actually makes a difference. And then when you find something like that, you kind of can bring it back and then bring it back to users. And that usually like resonates as well because that's what people then are. I used to. But yeah, it's like there's a lot of like exploration and at the end of the day, everybody cooks with the same water, right?
[28:45]
David Pierce
Yeah, yeah. Is there an example of that in the product now that you can think of that feels like that, that sort of medium measure that you either got right or kind of in the middle of getting right?
[28:57]
Thomas Paul Mann
Yeah, it's, it's weirdly the simple thing sometimes like, oh, you open it and like I use it for meetings, for example, all the time. There's like a word pops up, you don't know, you open your eyes, you get the answer done. Like it's those. We also optimize RACOS as a tool for something that you basically use like hundreds of thousands of times. Right. Like it does a lot of like little things that pile up. So it's for those short interactions. Things that we see people use all the time is like just like plain reformatting text and fixed spelling because, like, well, we're still typing all day long, Right. So making those things easier and faster to do. Um, and then people, when they get comfortable with it, they're getting a bit more adventurous. Right. And then it's like, oh, I just happen to download a bunch of files, I need to move them in a separate folder and also rename them so they make more sense. And so they then type in those prompts and see, like, oh, this works as well. And then I take the next step. Right? Yeah.
[29:53]
David Pierce
Okay. Renaming files is actually like a perfect example of the kind of thing I want to talk about about Raycast specifically.
[29:59]
Thomas Paul Mann
Yes.
[29:59]
David Pierce
Because we've been talking a lot on this show and elsewhere about the idea that Satya, Nadella and Microsoft have right now that before long you're going to barely use your computer and it will sort of use itself on your behalf. Just to put my own cards on the table, I think that is not correct, at least not in any sort of near future. But I do think that there is a lot of room for, like, doing computer tasks without having to do the tasks. Right. And I think about, like, all of the things that, you know, we've spent 20 years downloading little tiny utilities to do that that were these sort of like one off apps that are like, batch, resize a bunch of photos. Simple example. Or like, rename these photos with all the same name in sequential order based on when I took them. Like, these are the kinds of things that we do a lot on our computers that are not hard tasks and they're not particularly, like, mentally complex tasks, but it's like a constant part of computing life. It seems like you're in a position where I should just be able to say to Raycast, rename all of the photos on my desktop based on what they are and when I shot them and put them in an order that makes sense. Just clean up my desktop for me.
[31:16]
Thomas Paul Mann
Yeah.
[31:17]
David Pierce
Are we. Are we. Are we almost there? Are we there? Are we nowhere near there? Where are we?
[31:23]
Thomas Paul Mann
We're 90% there, I would say.
[31:26]
David Pierce
Really?
[31:27]
Thomas Paul Mann
In fact, you can do this today in Raycos, we have that, right? You can't do this. And then the 90%, I'd say, like, every now and then, it doesn't work. Right?
[31:35]
David Pierce
Thomas, I'm gonna try this right now while we sit here. And it's not gonna work. And I'M gonna be mad at you about it.
[31:39]
Thomas Paul Mann
I'm scared. But it's possible, right? And you mentioned this sort of super agenda oasis, I think what you mentioned. Right. So where the computer does everything for you. I mean, when we reach that state, we talk about AGI. Right. Then there is a question like, why should I even open a computer? What is a computer at that moment? Right, sure. I think how we think about it is more what's an intelligent os? What's sort of the aios. Right. So how our operating systems will change to adopt it as new feature where everything can be smart and it's not necessarily static. So you mentioned, like you maybe want to have a little app to do something. What if you could have this app just like by asking AI and it builds this little app for you and then you have it for yourself. Right. And then you use it for the job and then the job is done. And then it maybe gets this post and then it's like, that's fine. Right. And so it's like this one off software, this personal kind of software that is like personal to you, but maybe also to your team or your company that is like very tailored to the use case you want to. I think that's something which is quite fascinating as things get smarter and software maybe gets cheaper to build. I think there is something quite fascinating when your operating system becomes similar, right. So where you can just prompt things into existence for a short period of moment when you need them, and then when the job is done, you just don't need to use them anymore. And then tomorrow you have a different one. Or maybe at some point you have apps that are just appearing there as you, as you progress with your day. And it's like, oh, I saw David needs to like, do certain things. Hey, here's a little app for you that you probably can use.
[33:26]
David Pierce
All right, we got to take one more break and then we're going to go back and we're going to finish my conversation with Thomas Paul Mann.
[33:31]
Thomas Paul Mann
Be right back.
[33:34]
Indeed Advertiser
Support for this show comes from indeed. If you're looking to hire top tier talent with expertise in your field, Indeed says they can help. Indeed Sponsored Jobs gives your job the best chance at standing out and grants you access to quality candidates who can drive the results you need. Spend more time interviewing candidates who check all your boxes. Less stress, less time, more results. Now with Indeed Sponsored Jobs, and listeners of this show will get a $75 sponsored job credit to help get your job the premium status it deserves. Indeed.com Vox Business just go to Indeed.com Vox Business right now and support our show by saying you heard about Indeed on this podcast. Indeed.com Vox Business Terms and Conditions apply. Hiring do it the Right Way with Indeed.
[34:30]
David Pierce
All right, we're back. We're talking AI with Thomas Paul Mann. Let's get back into it. You bring up another thing that I've been wondering about, which is I think a thing that Raycast did really well early on was make it really easy to build raycast extensions. Like, it's just a little bit of fairly straightforward JavaScript and you can have something up and running pretty fast. And so you've built sort of an app store on top of Raycast in a way that seems to be working really well, and there's a lot of stuff and it's pretty easy to do. Does that all eventually go away if we get agentic AI that is good enough to just go do all this stuff on my behalf and I no longer need this sort of interim step of somebody built an extension that helps go do it, or is actually what we need lots and lots. Like, should I be using AI to build JavaScript extensions for Raycast, or should I be using Raycast to just completely obviate the JavaScript extensions?
[35:23]
Thomas Paul Mann
Yeah, I mean, fair point. So, yeah, extension was really what put us sort of on the map because we realized really quickly, okay, people just want to integrate and rake us with everything, basically, and there's no way we can build all of that. So we gave it out to community and then we made it super easy to build them. And that allowed us to have over 2000 extensions now in the store. And every day there is new contributions coming and so on and so forth. But if you take a step back, what we really wanted to do is build a productivity platform. That's sort of what we wanted to do. And extensions is almost like an implementation detail or JavaScript itself. But even extensions are an implementation detail, right? So imagine those wouldn't exist for a second, but services still exist, right? You still want to do something with Google Docs or Spotify or you name it, or your files for that. And so the idea was always like, how can you integrate with those things really easily so I can do the job for you. Like, this illusion that we did is like, oh, people can build extensions, you can use them, but you could even equally think about it like, oh, an AI can build something like that for you, so you can use it, and then your extension might be behaved differently. So the notion of extensions becomes almost a bit blurry. Right. It's just like that's evolving software in a way. And even for yourself, you're probably just downloading some extensions, but you haven't built them in the first place or somebody else built them. So it's not too far off for you prompting an AI to come back with a solution for you, but it's tailored towards you. The key thing I think is to make it all cohesive. If everything is different and you can't find yourself around, it becomes quite annoying and not useful. That's why people prefer apps in the first place. And why apps and mobile phones one, because they're optimized for the phone, they follow the same UI and UX patterns and people know how to use them. And so then the mobile app is also kind of like more and more catering towards that. And I think that's going to be similar to make it really useful. We want to integrate with everything around us and make it extremely easy for you to consume that information. And then also because software becomes like free in some way to create at least like little apps, you can transform that however you want to consume it. And I think that's super exciting because we all like slightly different and we have maybe different preferences. I maybe want to see a graph versus you will have a different representation and if you could just change that with just a little prompt and then you have it your way. I think that's super exciting where basically software becomes malleable and you can change it ad hoc and it becomes just what you want and becomes really this personal touch. And that's what I'm personally really excited about. And that's what I feel like Operating systems will evolve into something that is like a personal operating system to you and they're not looking all the same. And software is not all the same. They're like tailored really to the person that sits in front of the screen.
[38:28]
David Pierce
Yeah, it's funny, one of the things I talk about all the time with AIs stuff that I think is actually really powerful is just like simple CSS stuff for styling apps and web pages. Just the idea that all of a sudden what I now have is the power to tell this app that I want it to be blue. And it can be because that is, because like that's a, that's a thing that like Claude code can do, right, is, is change the CSS to make it blue. That that is a thing it is capable of doing. And, and then what you need on the other side is basically just the hooks that give me that tool to Do. And I think what it's been before is like, okay, you have to build a bunch of complicated things and you have to come up with a whole like, how do we display the color wheel? Do we do. And it's like, that's not like an impossible thing to do, but it is a thing to do. But if you just let people plug in that way, you give them all kinds of opportunities and options just by like opening it up to, we're going to let you build this however you want to build it.
[39:31]
Thomas Paul Mann
Yeah, I think that we have all the building blocks. Right, right.
[39:35]
David Pierce
But I think what I'm getting at with the extensions thing is like, as you're thinking about, and I guess just to go back to this, I want to rename a bunch of photos in a folder on my computer, which is a thing Raycast is very well set up to do. If I prompt Raycast kind of out of nowhere to just do that, you have a bunch of tools and you have a bunch of agentic systems that will go try and figure out how to do that for me. Or should I build the thing once, like vibe, code my way into a Raycast extension that, that renames files on my computer and then just use that over and over. Because now I've built a thing that is like reliable and robust and stable and it will do the same thing every time. And the problem with a lot of these AI systems is they don't do the same thing the same way every time. And sometimes that's exciting and interesting and leads you down different roads, but other times I just want it to rename the photos. Like I don't need new ideas about renaming photos, I need you to rename
[40:33]
Thomas Paul Mann
the photos and in the same way all the time.
[40:36]
David Pierce
Exactly. So I think, and especially as you're thinking about this stuff, you're like, okay, well do we want to use all of these AI models in a way to build rigid, structured things that you can then do on your computer over and over reliably? Or is the kind of open endedness of the system a feature, not a bug? And I just can't quite figure out where I land on that spectrum.
[41:02]
Thomas Paul Mann
Yeah, it's a tricky one, but I think like for tools having something unpredictable is like a no go. Right. Like you wouldn't use, let's say, I don't know, something complex like Photoshop. And half of the time the pixel turns red and half of the time it turns blue. Right. Like you couldn't work. Right, right. And so I think that's a strong argument for software, right? So let's say if you can generate the software once, you don't need any AI anymore. It just works and then it does the job perfectly all the time. I think it's like a feature, it's not a bug, it's great. So I think leaning much more towards that because that's kind of what the world runs on, right? It's software. It's like getting written once and then you use it and you can always adapt it, right? Tomorrow you say like, oh, rename the files like this way now and then you can use this. And I think that's like something which is quite nice when you get out an artifact that you can use and that's like what we have at the moment as extensions, right? You get this artifact out, you can use those extensions and use them over and over again. Where we sometimes struggle with is like, yeah, sometimes those non smart things, how you do them, they're like, just because they're so reliable and fast and become the muscle memory are somewhat better in a way. And so you kind of want to find a middle ground. And I think for tasks that are very concrete, you want to have what you mentioned, like just you have an app, an extension, whatever it is, but it does the job, it does it all the time the same way. Great. And there are some other tasks and I feel like they're oftentimes more open ended. They don't have a single solution, they have nuances to it. You don't even know exactly what you want. And those I feel like are the ones that are really good with AI where it just goes out and does something for you and you come back and say, oh, I haven't thought about that. That's cool, that's a nice solution. So yeah, I think there is something nice about the concreteness of software. You write it once and then it works the same way the whole time. Yeah, that makes sense.
[43:02]
David Pierce
Does your quality bar have to be higher than some others because you have this kind of access to all of the apps and even the system. If you wanted to break my computer or allow ChatGPT to break my computer, you could like it's, it's. You have an unusual level of access to my computer in that sense. Does that, do you have to treat this kind of nascent technology differently because of it?
[43:29]
Thomas Paul Mann
There is certainly a lot of like scrutiny there. And then when users come to us, they oftentimes ask us like, oh, is AI running in the background? Can it do something? And so we had Actually to put a lot of like, just like even UI and callout into the product to say like, hey, this is secure, this is not running. If you're not triggering it on, you're in control. So if there is a disruptive action, for example like deleting a file, you will be prompted and you can say yes or no to that. And I think that's definitely something that we need to maybe do more than others, which others can go a bit YOLO in a way. And because we have this like system that you mentioned, like we, we can access your system in a very deep fashion and so kind of need to build up this trust and that's also what people expect from us. Like they used it for years already and it becomes, well, it always works, right? It's this app that basically can never fail because it's like always there. And if you don't have it, people feel like they can't use their computer anymore. And so we put a lot of effort into making super stable. And so that's like in here the same way if you use that, it needs to work basically all the time, which as we discussed is really challenging. Right. And I think this is with machine learning and AI generally it will never be 100% right. This is just the technology doesn't get you there. So it's always like how far you can push it. That's why we have all these benchmarks where all the model providers try to climb them up and be on top of each other, but you will never be 100% correct. And for that it's even more important to have the guardrails right. So if something goes wrong, you can either recover or in idea world, it never goes off rails and you basically give the user the control, which is oftentimes described as like having the human in the loop. Even though that feels like again, a bit of a sci fi term. The human, I mean. Yeah, yeah.
[45:26]
David Pierce
So do you have to be extra careful about that stuff kind of at every turn? Like does it make building Raycast harder? Because you have built in this AI stuff that can do so much but is kind of unpredictable in that way.
[45:43]
Thomas Paul Mann
I wouldn't say necessarily harder, but it's something which we think about from the get go. We say like, hey, we want to build a private company, don't want to collect your data and this kind of stuff. So that is something that we build trust on. You just need to be smart to know what you build and maybe what you shouldn't build. And then when you build it Also in an elegant way and give the user basically the choice of like, do they want to use it? And then if they use it, give them control. You can also say like, hey, always delete my files. Don't ask me for confirmations. That's exactly user configuration. Right? But by default that's not turned on for reasons. And so giving flexibility. Exactly.
[46:27]
David Pierce
Yeah, yeah, yeah, full realism, just whatever. Delete anything you want, go math, see what happens.
[46:32]
Thomas Paul Mann
But then you also want to be smart, right? Like if it's like a rename that you could do, undo, you don't want to like prompt a user for that. So this is the complexity you may be referring to. You maybe need to think a bit more differently about certain things to make sure that the users build up confidence over time.
[46:49]
David Pierce
Okay, what's something you wouldn't build? Like you mentioned things, things you can and can't do because you have this kind of thing. Is there, is there something that feels obviously over the line to you on that front?
[47:01]
Thomas Paul Mann
I should watch out now what I say, obviously, but I think it goes to the privacy aspect. We had certain things, for example, give you a sense of what we felt quite cool. We have this feature called Focus. And the idea of it is basically you can block distractions like websites and other things, and then it basically plants them out. And if you go there, you see a warning and so on and so forth. And then initially we had ideas like, hey, wouldn't it be cool to make this smart so that you don't even need to configure what you want to plug? It just kind of like detects that this is probably a distraction. And then how you would do this is probably you do a screen recording all the time or some screenshots, and then you send them out and then you analyze them and then you come back. But at the end we felt like, oh yeah, this is maybe stretching it a bit too far on analyzing your screen all the time, which we don't really want to do. And what we realized, users probably would be very hesitant. And then we thought about using local LLMs for that. And then we said actually the person that sits in front of the computer kind of knows what the distractions or the better solution is probably just letting them define it. As boring as it sounds, I feel like sometimes that's the right thing, right? I mean we have still intelligence, we can think. So sometimes maybe we can also put in what we want. So that was just one of the things which came to mind, which we sort of first started. I was like, oh, let's make this super cool AI solution. And then you ask yourself like three times why? And then you end up as like, yeah, maybe a more traditional solution actually cuts it here.
[48:41]
David Pierce
That's such a good example because that is the sort of thing that at first glance you're like, yeah, it would be useful if, if Raycast or my system could understand the places that I'm wasting my time.
[48:53]
Thomas Paul Mann
Right?
[48:53]
David Pierce
Cause it's gonna be slightly different for everybody. I spend too much time on Reddit, you might spend too much time on Instagram. And if I could just be like, just delete all the places that I waste time and it could do that. There's something that is cool about that and there is something that is like, immediately horrifying and off putting about that.
[49:09]
Thomas Paul Mann
Exactly right.
[49:11]
David Pierce
What a lot of companies have said forever is just, we're going to push through that discomfort and trust that actually if people will eventually get used to it, we've made it so convenient that they're going to get past the ick factor of this. And I think a, this stuff just doesn't work reliably enough yet to do that in a really sort of predictable way. And the minute I go to like, my work email and my focus session is like, nuh, I'm like, I'm out. Right? Like, you've now broken the system. But, but also, I think frankly, every developer has some responsibility here to say it's actually okay that we're not comfortable with this. And maybe I shouldn't be pushing you to get comfortable with this. Maybe I should be asking you to make decisions because you're a person capable of making decisions not to get over the fact that I'm going to make them for you. And I think we're about to go through a million versions of that with all of this AI stuff. It's like, should we just, just bet on the tech getting good enough that everybody will get used to it or have to, or should we like, continue to make an effort to let people be in charge of their own existence? And like, this gets big and heady and existential really fast, but it does feel like we're encountering that question kind of a million times every day. And even, like, I just keep thinking back to this thing Satya Naedla said about like, we're not that far away from people mostly not using their computers and just directing their computers to use themselves. And I think philosophically there are ways in which that feels wrong to me.
[50:35]
Thomas Paul Mann
I feel like it's always sort of this value exchange, right? What do you get out, what do you put in and what do you get out, right? And so if it's super valuable, people are willing to put certain things in, right? I mean people upload hell stuff to chatbots nowadays and all this kind of stuff, but they're getting something out of it, right? So I think it's always the question like, what is the value exchange here? I think it's at this moment really hard predicting the future. If I would look back two years ago when we basically just started this whole AI wave, right, Would you think the world is as it is right now, where everything is AI? I don't know, it's really hard to predict. Would you think coding have changed that much? Would you think. Pick any topic really. It's really, really hard to predict. And I think it's the, the classic we overestimate the short term and underestimate the long term. In this case, I think it's really like that. I think no idea what's going to be happening in the next six to 12 months. I mean everything changed so rapidly. One thing is clear that those things are here to stay, you hear? Sometimes even if no models progress any further, we by no means have reached the limit of what you can do with even the state of the art. Right. And I think that's kind of like nice for everybody in the industry because I mean before AI, let's be honest, there was a bit of a try phase in tech, right, where everything was hyper optimized and nothing really radical changed, at least in the terms of software. And so now there's a lot of buzz and every week there is something new. And I think even if everything stagnates, we haven't reached sort of the limits what we can do with all the technologies that we invented in the last two years alone.
[52:20]
David Pierce
Yeah, I know. It is strange that it feels like everyone is so busy. I mean it's the self driving cars thing is a perfect example, right? Like every, everybody is so busy trying to invent the absolute end state of this where it's like, what if it reshaped society? It's like no, no, no, no. What if my car parked itself? That's awesome. Let's, let's do that. Like, let's figure out how my car can park itself and then how my car can like run more efficiently. And there are like a million things along the way that are cool and exciting and powerful that don't require like rethinking the way an economy works. And like, let's let's not skip all the steps because those are interesting things on the way to something potentially bigger.
[52:59]
Thomas Paul Mann
Yeah.
[53:00]
David Pierce
Before I let you go, let's just spend a couple of minutes talking about how you use AI in Raycast and in general. Like, where, where does this stuff fit into sort of your day to day life and workflows right now?
[53:12]
Thomas Paul Mann
Yeah. I think the biggest change for me is like, for me it's prompt first. Now basically everything I do, I start with a prompt. Like, well, we launched something. Okay, gotta write a blog post. Let me ramble for five minutes into my microphone and then that's my starting point and then iterate on that. That's one of the things. Oh, I need to answer emails, which I do a lot. Okay, I'm gonna do a lot with AI here writing code same way. One of the things that changed for me quite radically is that you can sort of do things in parallel in the background. Like I can just kick off a bunch of things. Oh, there is a feature request on Twitter. Okay, let me kick something off and address that right away. Oh, there is another one here. Let me do that as well. Oh, I have this idea. Let me kick off some deep research and figure out what's a good solution for that. And it's like, oh, I need to prepare for the board meeting. Oh, let me put a few things together. So I think my brain is completely rewired and it's like I'm prompt first by now and I basically just put things on the. On like start with a prompt and then see.
[54:21]
David Pierce
Do you then wait, I have a procedural question about that.
[54:24]
Thomas Paul Mann
Oh yeah, please.
[54:24]
David Pierce
Do you. If you start everything with a prompt, is the goal then to kind of filter everything out into somewhere or do you find yourself like living more and more of your life kind of inside the chats of these LLMs?
[54:38]
Thomas Paul Mann
Oh yeah. There are sometimes things that are just like inside of our AI chat in Raycost where like this never really sort of produces an output. Right. It's maybe me like chatting for a while through something. Oh, like I have this, like, pick any topic I have. Like, oh, I want to think about how we can land a deal. These are sort of the points we have. Like, what are elegant ways to like, maybe continue the conversation. So how could I, like, find. Find a solution to like reach our customers better and like sort of, it's almost. I think about it as like a thinking partner, like throwing things back and forth and talk to somebody for a bit and sharpen myself up in a quicker fashion. That's how I use it a whole lot. And so that's, I see also in our company it's just changing where more and more people just start with a prompt. A big change that we've seen in the company is all our designers decode now what used to be basically all static designs, then more and more become interactive prototypes directly in our product. So they can get something where you can feel it and see it and it works. And then oftentimes an engineer brushes it up. But all our designers are basically also halfway developers now, which is an incredible change. And I think that just is really nice for creative people as well. Because there was always this barrier of like, oh, you throw a few pixels and then somebody else needs to rebuild them to make it interactive. And so now we cross that bar. Essentially it's just like it's a plant. Like if you're a creative person, then you have the will, you can make things happen. Which I'm super, super happy to see that basically coding becomes more accessible to a way. LLMs are still failing in a lot of ways at that regard in programming, but I think that's like something that we've seen in our company happening really heavily, that designers become also developers. Okay. Yeah.
[56:40]
David Pierce
I think to me, part of the reason I ask is because one of the things that was most sort of unlocking in my brain was the thing in Raycast where you can like, you can basically ention one of your apps.
[56:51]
Thomas Paul Mann
Oh yes.
[56:52]
David Pierce
And then prompt it. And it's like that, that to me is like, okay, now we are, now we are getting to like the sequence of things that make sense together. Right where I, I don't now need, I don't know, I don't, I don't now need a bunch of different, very specific apps. I, I can just ask AI models to talk to the apps that they already have access to. It sometimes works, it sometimes doesn't. My whole clean up the desktop thing has not worked at all as we've been sitting here. Just, just nothing. It gave me a bunch of semi helpful information about the files that I
[57:26]
Thomas Paul Mann
have got to improve it. See, that's the way.
[57:28]
David Pierce
But I, I can do more prompting. I'll figure some stuff out. But I think like there's just, there's something that unlocks when you start to see, okay, here, here are kind of the things that are available to me and you've just seen more of those things than most people. So I was curious to know, like, are you just constantly doing computer activities through prompts now? Like you're, you're starting by. Everything starts with a prompt.
[57:54]
Thomas Paul Mann
Yeah, pretty much. Like, basically for me it's, it's a lot of like, I'm in a browser, I have a few tabs open, I pull them in with my ad browser. Essentially I get all the tabs in, then I start from there. Then I say like, oh, by the way, put this in a notion page. So then it ends up in a notion page and I can share it with my team. Then I iterate on the notion page. I do those things, like quite a lot. But also I let it write code for me to do certain tasks. Like I had recently. A bit of a silly example, but I had to do my text return. Well, I didn't do the text return with AI, right? But for that I needed to download all my payroll and all of them had a password. So I was just asking AI, like, hey, take those 10 PDFs and here's the password. Can you remove it so I can send it to my accountant? And it did it for me. It just wrote some code. I didn't really look at the code because I kind of know, okay, that's like what it would do. And then it like perfect. Otherwise I would have spent like, I don't know, five minutes going over each PDF, first of all, figuring out how to remove a password, which I have no idea. And so I think that's, I think the change, which I'm quite happy about. And it's like for programmers, this has kind of existed for a long time. We call these things scripts. It's like little things that a programmer, every programmer you ask, they have a script for various random stuff that I do multiple times a day. What is if this script is just natural language? Like what if you just say this and then to your point, if it solved the problem once, just reuse it so you can use it like many times. Right. That's, I think like those kind of little things that will make a big difference. And that's what we do with Raycast. We want to speed up every little thing. And you use Raycast hundreds of times a day. How can, what, what are the next hundred things you should do with Raycast? That's how we think about it. What are the problems we can solve that you use actually super often and not just like once a year or whatever. And that's like, I think the journey we on. That's pretty cool. Yeah.
[59:51]
David Pierce
And it's like once you have computer access, the number of things that can start to comprise becomes just Enormous, Yes. And you have access to the browser and it's like, again, this is why I think Raycast is so fascinating because you have, you, you can see the whole stack in a way that is very hard to do for almost any other app. It means the trust bar for you is very high. But it also means, like we talk a lot about, you know, these AI agents just can't see and do all the things that they need. Raycast kind of can.
[60:27]
Thomas Paul Mann
Yeah. I think like that's the nice position to be in, like being at this position to do all of this kind of stuff. But we still got to connect all the dots and build up the discoverability, as you mentioned, make sure that people get it and also make sure people get real value out of it. I've seen so many demos of cool stuff, but then you're never going to use this day to day or only so little that it doesn't really play well. And so that's like, for us, really the challenge, like natural language is great, but its coverability is hard. You don't know what's feasible and so on and so forth. But yeah, like, I'm excited about this helping. Basically making your computer smarter by using the same apps and tools you have, by having one AI that kind of follows you around across your journey on your computer and not having like an AI in every app and it's like everything is like isolated. We've been there with apps, right. It's kind of like annoying and we don't want to spread that again, that all our knowledge, memory, context lives in each and every app. And I get it. Like every company of those apps want to have this, right? They want to lock you in so you stay in that single app. It's like the financial things that they want to have, right. They don't want to give it away. But if you purely think from a user's standpoint, AI should be on the operating system level. It just makes so much more sense to be there instead of like in every app and every app needs to rebuild. Just happened to be this gold rush that everybody sees. But truly from a user's point, I feel like the best thing is if you have a smart operating system that helps you to get your job done. Yeah, I agree.
[62:11]
David Pierce
All right, Thomas, this has been very fun. Thank you so much for doing this with me.
[62:14]
Thomas Paul Mann
Well, thanks for having me. Long term listener and finally making our way here somewhere together.
[62:21]
David Pierce
We did it. All right, that's it for the show. Thank you to Thomas again for being here and thank you to all of you for watching and listening. As always, if you have questions, if you have raycast extensions you want to tell me about, if you have thoughts, concerns, feelings about any of this, I want to hear all of them. You can call the hotline 866 verge11. You can email vergecastheverge.com, i'm davidge.com, hit us up. I think this question of how AI belongs in our software is big and fascinating and messy and I want to know how you feel about it. So get at us. Ask us all your questions. We have another one of these coming up next week about a very different kind of app that I'm very excited to talk about. We'll get to that. But for now, the vergecast is a Verge production and part of the Vox Media Podcast Network. The show's produced by Eric Gomez, Brandon Keefer and Travis Larchuk. We'll be back on Tuesday and Friday with all of your usual good vergecast stuff. We'll see you then. Rock and roll.
[63:16]
Thomas Paul Mann
Fox Creative.
[63:19]
Hiro the Cat (IM's Pet Food Ad)
This is advertiser content from im's Pet Food. Hey humans. My name's Hiro. I'm a cat. I'm here to give you a crash course on how we went from fierce hunters to the floofy friends you can't live without. Although, let's be honest, we could probably live without you. Around 10,000 years ago, humans started farming, which accidentally created a rodent hunting bonanza that meant full time employment and activity for us. Humans were like, okay, these guys are chill, they can stay. So unlike dogs.
[63:59]
Thomas Paul Mann
Ew.
[64:00]
Hiro the Cat (IM's Pet Food Ad)
Sorry. We basically domesticated ourselves. We chose you. Fast forward to today. Some of us may not get as much hunting in and okay, I admit, maybe we can get a little chonky. But you, you can help keep us healthy and active with Eins Healthy Weight cat food now available in stores and online.