Loading summary
Kip
On today's show, we're going to show you how to save over an hour a day with an app that makes typing obsolete. You're never going to want to type again. I've got the world's leader with voice AI Alan Go from Willow Voice with us today. Let's get to today's show. All right. On today's show, we have a very much underutilized, under talked about aspect of AI. You know, you hear a lot about large language models and browsers and you hear some about audio, but you rarely hear about voice AI outside of things like Siri and Alexa. And today I'm with Alan Go, who is founder of Willow Voice, who is pioneering a lot of those kind of voice AI use cases. And so, Alan, welcome to marketing. It's the grain. Thanks for being here.
Alan Go
Hi, Kip. Yeah, thank you so much. I'm really excited to be on here.
Kip
Yeah. And so, Alan, one of the things that I think is interesting is when somebody decides to like dedicate all of their waking moments to something and you've decided to do this with voice AI and so I was wondering if you could just help people understand, like, why, why is voice AI important and how is it going to become just a part of how we all work and, and live on a day to day basis?
Alan Go
So my journey really all started when we were building in healthcare and so we were pivoting a lot in healthcare and every day we were interacting interfacing a lot with doctors. They were telling us that they were spending all of their time working on documentation for the patients. You know, they're spending their weekends, they're spending after clinic hours working on this documentation. So when AI scribes came along, they, they said this was the best invention they've ever seen since ChatGPT. They suddenly got all their weekends back. They were looking forward to clinic days again rather than being worried about them because they no longer had to spend all their time on this documentation. And when I saw this, it felt like the new way to work with voice. And also around this time I was visiting some family in China. I noticed that everybody that uses app called WeChat and they love to send voice memos. And now if, you know, sending a voice memo, it's really effortless to send these voice memos, but really painfully long to listen to.
Kip
It stinks to get a voice memo if there's no like transcription and you just get the raw voice file.
Alan Go
Exactly.
Kip
Brutal.
Alan Go
So it clicked to me that, you know, voice is the fastest way to communicate thoughts to paper but text is the fastest way to process information and that's why if we could just find a way to combine the two, which is now possible with voice AI and a speech could land as polished and ready to use text on your computer, then the keyboard itself could be unnecessary, which is what we unlocked at Willow. Willow is a voice dictation app that lets you write anywhere on your computer. You can think about it like Apple's built in dictation if it were actually fast, accurate and intelligent.
Kip
Good summary.
Alan Go
Exactly. Yeah. And people love using it for anywhere from like email, Slack messaging, prompting, AI and more.
Kip
Okay, so I'm a Willow user. It's been a game changer in my workflow. I sent you a prep email for the show today. Just using Willow took me like 30 seconds. It would have taken me 10 minutes before. Massive time saver. We're going to do some demos, going to talk through some of the use cases that you're going to want to think about adopting with Willow or any voice AI product for that matter. One of my thesis here, Alan, that I would want to talk with you about is that AI is going to make like content formats, like obsolete. Like there are better formats for certain things than others. And what I'm positing is that in a year or two we will just get the perfect format of information in whatever modality it needs to be text, audio, video, whatever. And it seems like voice AI and dictation is going to be like a big part of that. Do you buy that? Is that what's going to happen?
Alan Go
Yeah, 100%. And we're already seeing voice AI change so many modalities and just how people work in general. Because in the past or historically, voice AI has never been something that has worked. If you think about tools like as you're mentioning, Alexa or Siri, they've sucked because, because they were inaccurate, they've been clunky, they've been terrible interfaces. But of course, with AI that's made voice finely intelligent and context aware, we have infrastructure to support that at a low cost and real time. And we have transcription actually just to be accurate in general. It's actually opened up a whole new playing field of use cases and it's unlocked a whole new level of voice productivity for people in general.
Kip
Yeah, I think that's the thing. It's like what you're trying to tell people is like, okay, the technology's actually finally here. I know you tried to do this stuff over the last five to 10 years and it's been terrible and I will tell you, as somebody who's been using it, it is actually finally here. Not only is it accurate, but man, is it lightning fast. Now that's the thing that's blown me away was I was like, oh, I can see with these large language models that we can get to accuracy. But the raw speed is so impressive to me. Like, it's wild.
Alan Go
Yeah. And it's only going to get better over time. As you know, these things get instant, if not near instant.
Kip
It already feels that way, by the way, for somebody who started using this more and more. So if the technology's here now, walk us through, like, what are the core ways that people should be using it? I want to talk about that. I want you to show us some demos. I wanted to spend a few minutes helping people understand how they can transform how they're working. Maybe doing marketing, doing sales. Like if you're a sales rep, sales emails with voice dictation is like a game changer, for example.
Alan Go
Yeah, 100%. So I'd say the biggest use case that I've seen has been Async word communication. So we've seen people handle all of their word communication, that's slack email, personal messaging, all of those things that they typically in the past have to sit down, type on their keyboard, write out that message. They now just speak to a voice AI, like a dictation tool like Willow, and they speak out all their thoughts and that, that AI transcribes what they say, it formats, it understands the direction that they're trying to go, this tone, the style, everything, and it writes it out instantly for them in front of them. And so that's a massive productivity boost to tech leaders, for example, who have tons of work communication or busy professionals in general who have tons of communication to be doing, because voice is so much more natural and fluid and obviously it's faster than you would be typing. It just saves them so much time. The second big use case that I've seen is with AI prompting. Now, I know that you show a lot of workflows and prototypes built by AI tools like ChatGPT or Claude. Yeah. So I think this is a really neat productivity or life hack tip for the audience. Next time when you're prompting AI, don't even just type it, just use a dictation tool.
Kip
Can you pull up Willow and just show us how that works? This is my primary use case. It's why I'm being selfish. But you are probably better at the BE. It's like using the ChatGPT app and Willow, to just get stuff to an LLM in the most efficient way has been life changing for me.
Alan Go
Exactly. Exactly. So, for example, I can prompt Claude here. And this is already what engineers and power users are doing already. Because if you think about what prompting is, if you're trying to build an app, for example, you're explaining to a really smart friend like Claude what the technical specifics look like, what the ui, what the UX look like, and you're essentially just writing really long paragraphs to a smartphone like Claude to build it out for you. And obviously, would you rather be typing out those paragraphs to a smartphone or would you be rather speaking that. So for example, I can just press the function key on my keyboard and just be like, hey, I want to build a really cool reminders app that allows me to put all the things that I need to remind it of in my day to day. And whenever I finish one of my tasks or whenever I'm reminded of item, I want there to be like balloons and confetti showing everywhere. And I want the UI to be green looking with a green type of brand. Let's see how that turns out. And you can see in a matter of seconds.
Kip
It's so fast.
Alan Go
Exactly. Yeah. What would have taken me longer to type? I can just speak that out. I can brainstorm and brain dump and put that all in my.
Kip
Yeah, before you even like run that, what happens to me, and I think it happens to a lot of people, Alan, is like, we just kind of get sick of typing. Yeah, it's like, yes, I have more instructions I want to give, but I'm just kind of like sick of it. And because of that, I'm just going to stop here. And I think what I've found, and I think what you found is like when you're just talking, you can just like just sit there and talk for a minute or two and get much better details and prompt. And the hack here is do exactly what you just did and then have it turn it into the right prompt for you so you can give it all the context. Then it'll reformat it into a great prompt and you can run through and you can get to that spec and like basic version of the app in like minutes.
Alan Go
Exactly. And I really like how you said that. It removes the initial inertia of like, creativity because we've also seen a massive use case for content creators in general. So we all know that feeling when you're sitting in front of a blank doc and you're trying to come up with Ideas, creative juices to write, maybe website copy or social media posts or all different types of things. You know, typically you'd have that initial inertia of typing, but with voice AI, you can just simply speak what's on your mind. You can brainstorm out loud and that AI will, you know, clean it up, format it, rewrite it a bit better for you. And so we've seen tons of people write everything from like LinkedIn posts to website copy with Voice AI, and they love using it to brainstorm.
Kip
Yeah. There's a songwriter, Jason Isbell, who I love, and his thing about creativity and writing is that you get it all out and then you edit it later. You know, you whittle it down once all the information's out. And most of us don't write like that. Right. We're in our head. We write, we are editing in our head before we get it out. But we talk like that. Right. And so it might sound strange, but just expressing the ideas in a different actual way can get you better ideas.
Alan Go
Exactly. Let me show you another use case with email as well. This is what, you know, tons of busy professionals and tech leaders use every single day and sort of the moment that makes it magical for many people. So, for example, if I want to send out an email, I can just press a hotkey on my keyboard and just. Hi, John. I recently discovered a product called Willow and I think it's super cool. There's so many different things I could do. One, it can automatically format my email. Two, it can automatically correct my words. Three, it even cuts out my filler words. I think it's super cool. Best, Alan. And in a matter of seconds it comes out and there's a couple of things to note. It automatically formatted my greeting. It automatically formatted my signature. It even understand that I was trying to create a list and it breaks that down in the perfect structure. Another cool thing to note is that it knows that I like to use an exclamation mark at the end of my emails and understood that I have that personal style choice and it put that in.
Kip
And Alan, it knew because you were in Gmail that you were doing an email and it needed to be formatted in this way. Is that right? I just want to make sure I'm understanding. Everybody watching is understanding it, right?
Alan Go
Yes. So, exactly. So it's context aware. So it's able to understand that if for an email, it should be styled like an email. If I were in Slack, for example, it'll understand my context and write it as If I were writing a slack message and it'll be that style that's super cool.
Kip
You must fly through email, man.
Alan Go
Exactly. And you know what? If this style didn't work for me, I could just highlight this text and I can say, hey, Willow, can you make this a lot more casual? And you can see now that it just changed what I wrote into a more casual language, which is way better.
Kip
Than going in and just editing it. As a Willow user, I did not know I could do that. I'm going to do that in the future. That's pretty cool.
Alan Go
Yeah. And so a lot of people, for example, will go back into ChatGPT or Claude to maybe rewrite an email more formally or more casually. Yeah, it replaces a lot of that context switching between apps. And you can all do it all in one place with voice. I can just generate the entire email myself. For example, if I wanted to say like, hey, Willow, can you write an email to John in a pretty casual tone and tell him I'm going to be late for the meeting? And you can see just like that. Willow can just generate that email for me and it understands my writing style already, so it can just do it.
Kip
That's awesome. Email is something we all have to do and that's a way better way to do it. And I've been starting to do it, but, you know, you just gave me a couple pro tips that I did not know that I could do. So I'm pumped for that.
Alan Go
Yeah.
Kip
AI agents are transforming marketing. They're changing it as we know it. And the old ways of marketing, they're gone. The new ways are really agent first ways. And Kieran just wrote this amazing blog post about how to think about marketing and do marketing in an agent first world. In a world where agents might be buying from agents or agents are facilitating ways people can buy. And there are these three very specific changes that he outlined and that if you're in marketing today and you are not clear on what these changes are, you're missing the boat. You're gonna get left behind. And we don't want that. We wanna help you stay ahead of the pack. And so you wanna read Kieran's post right now, you can click the link in the description below. That is gonna give you the blueprint you need to do marketing in an AI first world. Okay, so what we wanna do now is actually show you how quickly you can iterate through, through like a marketing idea, marketing concept. Alan, I know you and the team at Willow Voice just launched personalized dictations where you can kind of personalize how the dictation style is going to be. Could you maybe just show us how you would go and make a product page for that feature using, like Willow and Claude and show how quickly you can iterate through, like in this marketing use case.
Alan Go
Yeah, of course. So opening up Claude here, let's just see how fast we can be with this. Right? I can just start dictating and say, hey, so pretend that you're like a really smart and expert, big expert in writing website copy. I'm creating this new product page for our landing page that's going to be highlighting our personalization feature for context. Willow is a voice dictation app that lets you write anywhere on your computer. You can think about it like an apple built in dictation, but smarter, faster, and more accurate and intelligent. And so some of the things I want to highlight on this page are that personalization, it allows the product dedication tool to match your style and tone and writing over time so that you don't have to go back to your keyboard and have to make those edits. Let's see that with an initial prompt and see what Claude can give us here.
Kip
So that was literally like less than a minute to get a pretty good creative direction. Like, this is what you would tell a marketer if you were hiring somebody to do it, right?
Alan Go
Yeah. So I see now that it's. What is it creating?
Kip
It's creating an artifact around the page looks like right now. So what I do love about Claude, and just a quick reminder for everybody is Claude is really good at the writing and creative use cases. And I do think it does a great job of not just coming up with good quality, but also packaging it in these artifacts in a way that's like, easy to kind of review and understand. And also, Claude loves to code. So it's writing a bunch of code and everything for us. Because what you said, you said a landing page. You didn't even say landing page copy. So it's taking you at like, oh, he needs like a whole landing page, not just some text, right?
Alan Go
Yeah. Oh, there you go. And we got the landing page and we got like the structure of it down.
Kip
It's even in your brand colors. Close or close, right?
Alan Go
Yeah. Let's see.
Kip
The design fidelity is not great, but we got essentially a like V1 of a landing page and about, you know, two minutes.
Alan Go
Yeah. And you know, of course, maybe if I want to change different things. Yeah. I can be like, hey, so I actually want to add a couple more things to highlight on this product page. You know, I want to emphasize that the product is private and secure so that people can use it without worrying. You know, I actually also wonder how we can make this page a bit more modern and sleek. I think it's. It doesn't look that great yet. Can you make some changes there?
Kip
What I love about that is that that is like what every founder ever would say about something. It's like you're mastering founder feedback already. It's like super marketing critical. As somebody who's done marketing for founders, it's like, it's a tough game, but now it just all goes to Claude and it's great. So I think for folks who may have never seen, like, a voice AI app, you have on your machine running a standalone app, that's Willow Voice. And then if you are in Claude or you're in Slack or you're in imessage, whatever you're actually working in, you're just holding down a function key and talking. And it's basically doing the dictation in the Willow Voice app, but also putting the text in the pane that you're working in directly. Right. Cause that was, like, kind of counterintuitive for me, like, the first few times I did it is why I'm bringing it up. Yeah.
Alan Go
So exactly. Willow is an app that you install on your computer. So it's at the system level where it understands, like, your accessibility. It can paste text anywhere on your computer and result it works anywhere on your computer.
Kip
Yeah. So you gave your creative direction on the landing page. I gotta say, it's a lot better.
Alan Go
It is a lot better.
Kip
V2 landing page was like, way better.
Alan Go
Yeah. The first part's pretty sleek, actually.
Kip
Yeah, it's pretty sleek. It hit your security concerns. It even decided to make, like, a creative choice to make them green and, like, and call them out, which I thought, I think is interesting. It's like the subliminal, like, green color tones.
Alan Go
Yeah. And again, what really mattered to me in the beginning was just the website copy, which is great here.
Kip
Yeah, exactly. And so what's really cool is that you can get here quick. With AI, myself and Kieran, we've done tons of shows on creating landing page and copy and everything. With AI, we got here faster. With voice AI, though, we got here in like five minutes versus, like 10, 15, 20, 30 minutes, which has been my experience previously, because you can just get through the iteration cycles faster. Because, like, what happens when I'm not using voice AI is I'm reviewing a page like this, and I'm thinking about it, and then I'm thinking a little bit more about it than I'm typing. And literally all you did was just provide the voiceover to your review in real time. And that allowed us to save a ton of time.
Alan Go
Yeah, it's just like how I'd speak to a really smart friend. It saves so much time to just speak to them with my words rather than having to type long paragraphs to them telling them what I want to do.
Kip
It's not like the 1800s where we're writing long letters to each other, right?
Alan Go
Yeah, exactly.
Kip
Okay, now let's look at the Willow app because I want people to understand, like, what's actually happening. Because I imagine most people watching the show today has probably never used any type of voice AI app.
Alan Go
Yeah, let me show you. So you can see here, I've dictated a lot of words.
Kip
Yeah, you're almost at 100,000 words dictated. I'm nowhere close to you. I think I'm at like 5,000 or something.
Alan Go
Yeah. And the thing is, I also have multiple accounts, so I think my total is actually like closer to 500k.
Kip
I bet that's probably true.
Alan Go
But you can see here that the Will app is actually made to be super simple and easy to navigate. Because the thing is, with most dictation tools in the past, they've always been clunky and hard to use. Experiences that have been hard to navigate and hard to figure out how to use. I can remember Dragon dictation specifically. It's like you have to watch hour long tutorials to be able even to start using dictation. It's crazy. And they sell these DVDs or tutorials on how to use it. So with Willow, we've really tried to change that experience to be really simple and easy to understand from the beginning. So you can see here there's only really three sections you have to care about. The home section, the dictionary section for adding specific niche terms that Willow might not be spelling correctly. And then of course, the personalization page. And that gathers your insights to eventually use them. When you're, for example, writing in Superhuman, you're writing email. Willow now knows to add a period to separate NDAs rather than EM dashes because it knows I don't like to really use EM dashes or knows how to spell a specific name correctly. These are insights that I gained just by me dictating and me making those edits to my keyboard. If I don't like it, it learns over time so that it doesn't make that mistake again. But you can see here it's really easy to navigate where the user doesn't really have to customize much. You know, they don't have to customize prompts to make it write better. They don't have to go in their settings and change different models or anything. It's just the most premium and simple to use product right now to create that magical experience of dictation and write anywhere on your computer.
Kip
Got it. And if you go back to home, I didn't want this to like go past people. All of the stuff we've done on today's show, for example, that went directly into Claude, which is some of the use cases we've been seeing. It's also here in this app. What's happening is everything's always going to be in Willow Voice, but whatever other app you're working in while you're holding that function key and dictating it, it's going to auto put the text there instead of you having to copy and paste every time.
Alan Go
Right, Exactly.
Kip
Which I did not understand the first couple of times I was like, wait, it could do this? That's magical.
Alan Go
Yeah, it saves the transcripts local in the computer. So if you're ever dictating something and you lose it, for example, you can always go back to your history and copy and paste it. Put it right back. You can also retranscribe it or look back at it for history. Anyways, that's pretty cool.
Kip
I mean, I would also say you could actually keep an amazing personal diary just summarizing your day, one minute every day, and just having it auto save into a Google Doc or anywhere would be incredible. Just a random thought that I just had. Okay, so if you think about some other use cases, I'll give you some other use cases I've had. It's like, if you're a marketer, what do marketers have to do a lot. They got to do presentations a lot. You can actually just record your rough draft of your talk track and then use that to basically generate slides with an AI slide builder like genspark or Gamma, or to create an outline. And like, that would have just been impossible before. You would have done a zoom recording. You would have gotten the recording, you would have done the transcript, you would have then put that into ChatGPT instead, make an outline and instead you can just do that way faster. Which is really cool.
Alan Go
Exactly.
Kip
Alan, talk to me about note takers. ChatGPT recording. There's a lot of voice stuff out There, what's best for what? When do you use a dictation thing like Willow or one of the others that are out there versus like ChatGPT recording or a note taker, that type of thing.
Alan Go
So I'd bucket ChatGPT's record feature into the same category as like an Apple built in dictation or something like that, where they focus more heavily on transcribing that text rather than understanding what you're trying to say. And that's the key part which makes Willow so great. It's because it has this extra AI piece that interprets what you say. So if you really want to be able to use voice to unlock productivity in so many different use cases like your email or Slack messaging or prompting AI or just communicating or writing things, you really want that AI to be able to understand how you style your text, how you format things, how you write your text in the direction where you're trying to go. And that's where Willow really shines. And even for a use case like AI prompting, where the style or formatting doesn't really matter, Willow's still better just because of the pure speed. I mean, it's three times faster than built in dictation tools or ChatGPT record features. And of course having it on your keyboard as a hotkey and that speed to combine with it, it really keeps you in that flow state when you're trying to come up with new ideas or even prompting an AI tool.
Kip
Yeah, I agree. And note takers are much more like group conversation stuff. Is that how you think about them? That if you're going to have a big meeting or you're in person with multiple people, then you need like a note taker kind of AI application versus voice dictation because you got a lot of voices. What matters then is the context of the conversation and who is who and who's saying what. Right?
Alan Go
Yeah. I'd separate like a note taker, like granola versus a dictation tool. Because a note taker, it sort of listens and transcribes what you say and then after the end of the meeting it turns out a summary. Whereas dictation tool, it's more like real time. It does like the use cases where you'd be maybe writing an email or sending a Slack message. That requires, you know, faster latency and formatting and understanding how you're writing.
Kip
What I have found with AI tools in general is that the more focused they are, the better they are. What you have is a lot of people using note takers for a lot of different things, for example, and it's really good for what you just said. And AI voice dictation apps like Willow and others are like, it's really good one on one. If it's you and you're trying to get your ideas out, communicate, get through the path of work you're trying to do, that is far and away the best, most efficient path. Right. Versus like recording zooms or using note takers or any of the other kind of voice AI products that are out there.
Alan Go
Yeah, 100%.
Kip
Okay. So what we've talked about today is like using voice AI to save a bunch of time. How much time do you think it actually saves? I imagine you guys have done a bunch of research, like what does it get you back in doing this?
Alan Go
Yeah. So if you just looked at my dashboard, it says that my words per minute when using dictation is 180 and most people are actually way faster. I think on average when people are using dictation it's 180 plus 200 words per minute and more. And I imagine it's a lot faster when you're prompting AI and you have a lot of ideas in your head. Maybe you're an engineer, brainstorming and brain dumping. So when you compare that to the average typing speed of people in the world, it's 40 words per minute. So that's literally a four to five times increase in speed and productivity and time saved just by using dictation rather than having to type things out.
Kip
Yeah. If you are somebody who does a lot of email, slack messaging, writing, like it's going to save you like an hour plus a day. Right?
Alan Go
100%.
Kip
Just the raw math of that backs into that.
Alan Go
Exactly. Yeah.
Kip
We talk a lot about AI productivity, but it's like kind of hidden productivity. Like this is actually real productivity because you can actually measure it because it's just making the stuff you already have to do in your day to day job much, much faster. Which may seem a little less flashy to some people, but is actually just way more important.
Alan Go
Yeah. And not only does it save time, it makes you feel better doing your job as well. Because imagine going through 100 emails where you can just speak through them rather than having to type all those out. So there's not only this actual time saved, but it actually just makes you feel better in your mood. Better.
Kip
Yes. Like the whole aspect of doing it is better. You know, like I almost make it a kin of like when you're driving, you're singing along With a song, like, you just feel better because it's like, I'm communicating, I'm getting some stuff out there. It's like a little bit better. As somebody who's been using it, the thing that surprised me, it's like, oh, it's kind of nice to just not sit by myself, quiet, not saying a word for an hour at this desk, and instead, like, feel at least like I'm almost talking to the person I'm sending a slack to or an email to or whatever. Right?
Alan Go
Yeah. Yeah.
Kip
Okay. So we went through some of the big use cases. We're going to save a bunch of time. You can use it to get ideas into LLMs, to review marketing materials with LLMs, much faster to do kind of any of the async messaging you do, Email, slack, those types of things. Any other advice for folks when it comes to voice AI anytime they should not use it? Like, what would be kind of your closing advice here right now?
Alan Go
I say voice AI, it works better for very repetitive workflows like, you know, messaging, email prompting, AI. In the past, I would say that it doesn't work as well for things like personal messaging because there's so much, so much more style needed. But we just, you know, obviously released a new personalization feature that actually makes that better.
Kip
It is a lot better now.
Alan Go
So you can, like, text messaging things and still sound like you, you know, if you don't use periods or different types of formatting and text messages. You know, I actually really just want to say that voice AI can replace everything on your entire keyboard and your typing workflow. And that's sort of the vision that.
Kip
We'Re headed towards, which is wild to think about. How much less do you think you type now? Are you typing, like 100% less, 50% less, 20% less? Like, where do you think you're at?
Alan Go
I'd say I'm typing 90% less.
Kip
Wow. Okay.
Alan Go
So actually, still, right now, us, the will of founders, Us too. We still handle, you know, all of our customer support cases and we get, you know, 50 emails every single day with different questions or maybe they have a problem. And I'd say the biggest life hack for us has just been using our own product, Willow, to be responding to those emails. And we're able to do that so quickly and still do it ourselves and still have time because we're using dictation.
Kip
Got it. As somebody who doesn't love typing, I have found the cultural change of getting used to dictation is real. It has taken me a while to adopt and really get in the habit of it. And I know there's probably some hesitancy out there on terms of like, is this technology actually ready? I think we've shown today that it is. And I will tell you in my uses, at least with Willow, and I know there's Whisper, there's a bunch of other products out there too, that people are using. It's like they generally have gotten good. And I think to your point, the biggest challenge is in understanding the very personal context that you might have with one person versus a different person. Right. And I think you have gotten down that path of personalization and that'll, it seems like, keep getting better and better. The last topic here is like, what's next? What are the things that need to happen for dictation voice AI to really be amazing, be mainstream?
Alan Go
You think so? When we're just thinking about the technology itself, it's obviously going to get better. AI is going to get better, and that's going to allow for infrastructure that's going to support instant, if not near instant, latency. We're going to see bigger context windows that's going to allow for voice AI tools to have deeper integration into your apps and workflows to be able to better understand you and better interface with your entire life. And obviously that's the direction where Willow is headed towards as well. But I think the most exciting part for us as a team is just thinking about the user experience layer of voice as a dominant interface in the future. Because when you think about every new, you know, change in human computer interface, whether that's the mouse and the keyboard, and then it was touch, every single change has opened up a whole new design language for how we use computers. When you think about touch, we had like scroll gestures, we had pinch in, pinch out to zoom in, zoom out. We have even these, like, three lined hamburger menu items. None of these existed before and it wouldn't have made sense at all even for a mouse and keyboard. So when we think about voice as the next primary interface, we think about what could that entire user experience be? And it's gonna be entirely new. I mean, a couple questions which we have in our head are like, are abbreviations gonna exist when you type those out to people? When it's just as easy just to say that are emojis gonna exist when voice AI is gonna be so good that it interprets your tone and what you're trying to say before you even sort of send that out?
Kip
Right, right.
Alan Go
And what is the output of that going to look like when you can actually understand the tone of what you're saying. And so we think about all these different things that are going to change as voice becomes a dominant interface. It's going to be a completely new user experience and we're just excited for the future, to be honest.
Kip
Yeah. And so if I'm a marketer out there or I'm a founder and I'm listening to this, it's like, oh, I can save some time with voice AI. That's my immediate here and now. Right. I think what we're saying is in the future the modality of how people communicate is going to change significantly and the user experience around that's going to change significantly. I think especially like if you think about it, since the dawn of websites, they've just been hands on keyboards. Right. Like, we don't really talk to websites. We will start talking to websites. Yeah, right, yeah. And you're seeing that start to happen a little with a few of the AI sites, but I think in a couple years that's just going to be commonplace. There will be some use cases of the web browsing experience that will be voice LED versus, you know, hand led or typing or clicking. Right. That's pretty wild. And so I think if you're a marketer, you're a business owner, you want to keep an eye out for the trends and the evolutions of user experience that's going to happen as this and the other AI technologies get better and kind of shift that user experience. I think that's the right call out to make for everybody and.
Alan Go
Exactly. Because typing was honestly just, you know, a temporary solution for humans to be able to interact with their computers. But as we think about the development of AI and these speech systems, we're going towards a direction where a computer will have the same fluency, a human fluency, and understand it just as you would be when you're talking to another person. So when you think about that, it's going to completely change the interface of computing as a whole.
Kip
It's mind blowing to think about. I suspect that this is the first of many shows we'll have over the next year or two on the topic of voice. And Alan, I really appreciate you coming on, giving all of our viewers a breakdown of what the heck it is, some things they can do now and some things to look out for in the future. Thank you so much today for joining us on Marketing Instagram.
Alan Go
Thanks so much, kid.
Kip
Awesome. We'll see everybody real soon on our next episode. Sa.
Podcast Summary: Marketing Against The Grain – Episode: "Everyone’s Using AI Wrong – This Is the Real Unlock"
Released on August 7, 2025, by Hubspot Media
In this insightful episode of Marketing Against The Grain, host Kip Bodnar engages in a deep conversation with Alan Go, the founder of Willow Voice, a pioneering voice AI dictation app. The discussion delves into the transformative potential of voice AI, its current applications, and its future in reshaping how we interact with technology and perform daily tasks.
[00:00] Kip Bodnar opens the episode by highlighting the underutilization of voice AI beyond mainstream applications like Siri and Alexa. He introduces Alan Go, emphasizing Willow Voice's role in revolutionizing voice-based productivity.
Alan Go shares his journey into voice AI, initially rooted in the healthcare sector. He explains how doctors benefitted from AI scribes by reducing the time spent on documentation, thereby reclaiming their personal time and improving their focus on patient care.
Alan Go [02:22]: "Voice is the fastest way to communicate thoughts to paper, but text is the fastest way to process information."
Willow Voice is presented as a sophisticated voice dictation tool that surpasses built-in options like Apple's dictation by being faster, more accurate, and contextually intelligent. This allows users to write seamlessly across various platforms, including email, Slack, and AI prompting.
Kip and Alan discuss the historical shortcomings of voice AI, citing examples like Siri and Alexa, which were often inaccurate and cumbersome. However, recent advancements have made voice AI more reliable and efficient.
Alan Go [04:32]: "With AI that's made voice finely intelligent and context-aware, we have infrastructure to support that at a low cost and real-time."
They emphasize that the technology is now mature enough to replace traditional typing in many workflows, significantly boosting productivity and enhancing user experience.
Alan demonstrates how Willow Voice integrates with AI tools like Claude, showcasing real-time dictation and content generation. For instance, Kip illustrates how he used Willow to compose a prep email in 30 seconds, a task that previously took him 10 minutes.
Key Use Cases Discussed:
Async Word Communication: Streamlining email, Slack messages, and personal messaging by speaking instead of typing.
Alan Go [05:30]: "Voice is so much more natural and fluid and obviously it's faster than you would be typing. It just saves them so much time."
AI Prompting: Enhancing interactions with large language models by dictating prompts instead of typing them, allowing for more detailed and nuanced instructions.
Content Creation for Marketers: Facilitating brainstorming sessions by speaking ideas out loud, which the AI then transcribes and refines, overcoming the initial inertia of typing.
Email Composition: Automated formatting, correction, and personalization of emails, maintaining the user's unique writing style.
[10:54] Alan showcases how Willow intelligently formats an email, adjusting greetings, signatures, and even stylistic choices like the use of exclamation marks based on the context (e.g., Gmail vs. Slack).
The conversation contrasts Willow Voice with other dictation and note-taking tools. Alan categorizes tools like ChatGPT's record feature and Apple’s built-in dictation as primarily transcriptive, lacking the contextual understanding that Willow provides. This contextual intelligence allows Willow to not only transcribe but also format and style communications appropriately.
Alan Go [22:18]: "The key part which makes Willow so great is that it has this extra AI piece that interprets what you say."
They also differentiate between real-time dictation tools and note-takers designed for group conversations, highlighting Willow's superiority in one-on-one or personal productivity scenarios.
Alan presents compelling statistics demonstrating the efficiency gains from using Willow Voice:
Typing Speed vs. Dictation Speed: Average typing speed is around 40 words per minute, while dictation can reach 180-200 words per minute or more.
Alan Go [24:54]: "That's literally a four to five times increase in speed and productivity and time saved just by using dictation rather than having to type things out."
Time Savings: For frequent communicators, such as those handling numerous emails or Slack messages, the time saved can accumulate to over an hour daily.
Kip underscores that this productivity is hidden yet immensely valuable, as it accelerates routine tasks without the need for additional resources or flashy advancements.
Looking ahead, Alan envisions voice AI evolving into the primary human-computer interface, much like the transition from mice and keyboards to touch interfaces. He speculates on future developments, such as:
Enhanced Personalization: AI interpreting tone and context to incorporate emojis or adjust communication styles seamlessly.
New Design Languages: Just as touch introduced gestures like pinch-to-zoom, voice AI will necessitate new interaction paradigms.
Comprehensive Integration: Voice AI interacting deeply with various applications and workflows, creating a unified and intuitive user experience.
Alan Go [29:16]: "When we think about voice as the next primary interface, we think about what could that entire user experience be? And it's going to be entirely new."
Kip emphasizes the importance for marketers and business owners to stay abreast of these trends, as the modality of communication is poised for significant transformation.
Alan acknowledges the cultural shift required to adopt voice AI, noting that while the technology has advanced, user habits take time to change. He highlights Willow's focus on simplicity and user-friendly design to facilitate this transition.
Alan Go [27:12]: "Voice AI can replace everything on your entire keyboard and your typing workflow. And that's sort of the vision that we're headed towards."
Kip concludes by reflecting on the transformative potential of voice AI, ensuring listeners understand both the immediate benefits and the future possibilities. He encourages audience members to experiment with tools like Willow Voice to enhance their productivity and prepare for the evolving landscape of human-computer interaction.
Key Takeaways:
Voice AI is Mature and Reliable: Recent advancements have addressed past limitations, making voice AI a viable replacement for typing in many scenarios.
Significant Productivity Gains: Users can save substantial time by adopting voice AI for routine communications and content creation.
Contextual Intelligence is Crucial: Advanced voice AI tools like Willow Voice understand context, formatting, and personal styles, enhancing the quality of transcribed content.
Future Transformation: Voice AI is set to redefine user interfaces and communication modalities, necessitating adaptation and forward-thinking strategies from marketers and business professionals.
Notable Quotes:
Kip Bodnar [00:00]: "You're never going to want to type again."
Alan Go [02:22]: "Voice is the fastest way to communicate thoughts to paper, but text is the fastest way to process information."
Alan Go [04:32]: "With AI that's made voice finely intelligent and context-aware, we have infrastructure to support that at a low cost and real-time."
Kip Bodnar [12:11]: "Email is something we all have to do and that's a way better way to do it."
Alan Go [29:16]: "When we think about voice as the next primary interface, we think about what could that entire user experience be? And it's going to be entirely new."
This episode serves as a comprehensive guide for marketers, business owners, and professionals eager to harness the power of voice AI to streamline their workflows and stay ahead in the rapidly evolving technological landscape.