
Loading summary
Joe McCormick
Right before I started college, I ended up losing most of my central vision due to a rare genetic disorder called Lieber's hereditary optic neuropathy. I was talking with someone who was losing their sight recently from the same disease and they were asking about different things and I was like, oh, you can just do all of that now with Gemini or ChatGPT, the world is a whole lot easier.
Claire Vo
So you're going to show us some of the things that you've built for yourself.
Joe McCormick
So when someone sends me an image, I use this tool to be able to get the gist of an image without needing to ask somebody to explain it to me. If I hit continue control shift D on any message, it's going to pop up and go off and describe that image for me. And the cool thing is I can go ask some follow ups what age child is this for? And it will head off to ChatGPT and get the response for this as well.
Claire Vo
I'm curious for you, what are you most excited about in the multimodal world of AI?
Joe McCormick
One thing that I was always afraid of can I read stories? I can memorize stories, I can tell stories. But you're sudden being like, I want to read this book. And you having to be like, sorry, I can't. And now that sorry, I can't becomes sorry, I can with the assistance of sort of so many different tools now.
Claire Vo
Welcome back to How I AI. I'm Claire Vo, product leader and AI obsessive, here on a mission to help you build better with these new tools. Today we have Joe McCormick, principal software engineer at Babylist, who has a vision impairment and he's going to show us how he uses AI to build micro Chrome apps to make his everyday life and work a lot more accessible. You're going to learn how to use Claude code to write Chrome apps and you're going to be inspired at the little things you can do to make your own slack a little bit more efficient. Let's get to it. This episode is brought to you by Tynes. The intelligent workflow platform powering the world's most important work Business moves faster than the systems meant to support it. Teams are stuck with repetitive tasks, scattered tools and hard to reach data. AI has huge promise but struggles when everything underneath is fragmented tines fixes that. It unifies your tools, data and processes in one secure, flexible platform. Blending a jet tech AI automation and human LED intervention, teams get their time back, workflows run smarter and AI actually delivers real value. Customers now automate over 1.5 billion billion actions every week. Tines is trusted by companies like Canva, Coinbase, Databricks, GitLab, Mars, and Reddit. Try tines@tynes.com howiai Joe, thanks for joining how I AI and I want you to spend a little bit of time introducing yourself and your story and how AI has impacted your ability to do work and build interesting things and engage in lots of awesome projects and what's different about your Life now with AI versus before?
Joe McCormick
So yeah, my name is Joe McCormick, I'm a principal software engineer at Babylist and I think I took maybe a little more interesting journey than most into the computer science world. So right before I started college I ended up losing most of my central vision due to a rare genetic disorder called Lieber's hereditary Optic Neuropathy. And so before starting at Harvard I was more interested into the mechanical world and kind of robotics and everything in that space and then found that that was a lot harder and doing things with my hands was becoming a lot harder month after month. And so I took the Intro to Computer Science course at Harvard and immediately fell in love and found that I got the same feeling of creativity and being able to come up with the idea and make it happen. But now I was on maybe on a full equal plane to my competitors at that time or my other students. But then obviously as AI took off became even more equivalent and the gap between I think software engineer for a sighted person and a visually impaired person is closing day by day. And also in my personal life I think it's even been extra impactful. I was talking with someone who was losing their sight recently from the same disease and they were asking about different things and I was like, oh, you can just do all of that now with sharing your screen with Gemini or ChatGPT. Whereas when I was first losing my sight it was using different magnification tools or even glasses and things and it's like now the world is a whole lot easier. I'm an avid Meta Glass user and different things make my personal life a lot easier as well. But yeah, I do lots of AI product engineering now and I at Beta List lead the AI enablement and trying to make sure all of our software engineers can build AI as correctly as possible at all different parts of the software development lifecycle.
Claire Vo
So you figured out a way one to adjust your interest in engineering to something that's a little bit more accessible for you and then two lean into how these AI tools can really increase the accessibility and user experience of supportive technology that you've maybe used in the past, but that you've been able to make better yourself. And what I love about this personal software moment that we're in right now, which is unfortunately, accessibility software and custom software that meets the needs of a lot of people is simply in some instances not an economically viable business, for example, to build. And so in the kind of broader economic world, there's not a lot of incentive to build a full set of robust tools that can meet the needs of everybody who deserves to have their needs meeting and needs met. And what I love about what we're able to do now with AI is not only are more interesting sort of accessibility tools and, and platforms be able to be built, but people can build these solutions for themselves and they can be very customized to your experience, your needs, your strengths. And I think that's a really underappreciated benefit of AI. And so you're going to show us some of the things that you've built for yourself and you're actually going to walk us through your coding flow, which I think is really awesome on how to build one of these tools. So we can follow us step by step for sure.
Joe McCormick
So yeah, we can jump right in. I'll show off two that I built myself ahead of time and we're going to do one on the fly. And I do think personal software is going to be huge. One reason why I like building some of these. So I'm going to show off a couple chrome extensions that I've worked on. And one thing I like about building some of these as compared to maybe some of the offerings we have today from the AI native browsers is ama. Browsers are great. Like I do use Comet, but it's using the Swiss Army Knife and it does everything. But in order to do that, some of its processes are definitely slower, which there are certain things that's totally fine for, but there's other steps where you want it to be really quick and you want to use the drill instead of the little tiny screw driver that came with this was Army Knife. And so I'll show off a couple that I've already built and then we'll jump into one that's not necessarily even accessibility, only that I think everyone could benefit from. So I'm going to hop into Slack now. Babylist, where I work is a big Slack company. So lots of my stuff is very Slack based because it's where I spend a lot of my non coding time. So I'm going to hop in here and I'll share and show a couple ones I've built so far. This is a little temporary Slack channel that we have for this with some example messages that were actual messages sent by my colleagues. I just had them resend it into here first Demo is a image description tool. When someone sends me an image, I use a screen magnifier. So I typically am looking at my screen at about 10x zoom, but it's not the easiest to do and I prefer to not have to always be paying attention to that if possible. So I use this tool about to show off to be able to get the gist of an image without needing to ask somebody to explain it to me or me have to actually use my eyes to do it. So I have a shortcut in Slack. If I hit I'm on Windows so Control Shift D on any message it's going to pop up and go off and describe that image for me and tell me a little description of it so I can see, hey, it shows a modern infant baby stroller with a car seat, it's got a canopy, it's got all details about this. And the cool thing is I can go ask some follow ups so I can say let agechild is this for and it will head off to ChatGPT and get the response for this as well. And so we can get our answers there. And it's just a nice way for me to not have to necessarily push back and work with people and get some answers to my questions as I go on this too.
Claire Vo
And I think this is something that folks don't really appreciate, which is for you, you know, you have the ability to zoom in, look at this. But it's just from a time perspective, probably a lot more tedious for you to do. And so folks have thought a lot about image to description in terms of generating metadata for E commerce sites, which I'm sure you all think about a lot. Or we had an episode with a documentary producer that highlighted using image to text descriptions plus metadata to organize archival footage a lot easier. But this is a great example of image to text being just a much more efficient information transfer method for someone like you who might need to parse this this information differently. And then what I love about this is you could have just done the image description, right? Just what is this image? And tell me, but you actually were able to go that next step and say great, if I need to query more information about this to understand more context, you make that really, really easy. So I love this example. And you built this all yourself.
Joe McCormick
Yeah, this was probably 25 minutes of a cloud code session. It was pretty straightforward.
Claire Vo
Awesome.
Joe McCormick
A flavor of this one I'm working on right now is a version that works in Figma directly as well, that, given any Figma node, will explain it to me with a much different prompt. Right. In the Figma case. I want to hear about the colors of the CTAs. I want to hear about all this stuff because I am a full stack engineer and so you can get all that out of figma. But it's lots of clicks and lots of different steps and being able to just hit one keyboard shortcut and find out what this design is really accomplishing is going to be a nice easy win for me. And that one is just about done as well. It's got a little bit of bug, so it's not ready to fully demo, but that's one that I'm excited about for as well.
Claire Vo
So before we go into building one of these, are there a couple other extensions that you've built just as inspiration for folks watching or listening that you think are really interesting to show?
Joe McCormick
Yeah, another one that's not necessarily accessibility focused, but I think it's a cool one. So I am not the best typer in the world. I don't even think that's my vision's fault. I'd like to think I'm a touch typer, but I think my brain goes faster than my finger sometimes. So I have one that I built that is just like a really easy spell checker. There's lots of tools that do this Grammarly and all this, but simply they're not all screen reader accessible. They're multiple clicks away sometimes. And so I built one out that works in any input field on the web. I'll demo it here. I'm going to say test testing typos in the mass and then if I hit Control Shift s here on this one, this is going to go off. Send that off to OpenAI and come back with that. And while it was doing that for me on my screen reader, it said like processing spellcheck, spellcheck complete. And so I know when I'm writing a message I don't need to necessarily worry about all the polish on it. I can just do that. I hear spellcheck complete, I'm ready. I go off, hit that and send it off to people. And I have a prompt there that's basically like, do not change any of the words, just fix typos, like really hyper focus to make sure that it's the content that I wrote. But just with the typos corrected it.
Claire Vo
So I'm leaning in and smiling a lot. One, because if you've been watching how I AI, you have heard about my fancy nails. I'm a fancy nail gal these days, and with these fancy nails, I cannot type anything. It is all typos. And so this is such a great little workflow that you built for yourself that I'm going to steal. The two things I want to call out for people who are watching or not watching the details is you are actually running Slack right now in Chrome. So at first I was like, wait, how are all these apps interacting with the Chrome desktop app? But you're running Slack in Chrome, which means that these are all extensions that are available to you to interact with content in Slack and make modifications via a Chrome extension. So I think that's a really interesting hack for folks that are like, okay, I can't hack my way into the desktop app, but I can load Slack in the browser and then on top of that add a browser extension that can do these interesting things for me. The second thing I love is how you're using so many keyboard shortcuts to trigger these micro apps. And again, this is about efficiency. I always say in these AI products, that latency is the killer feature. And so anything you can do from a UX perspective or from a performance perspective to make these little apps more efficient, the better they're going to feel. And so I love that you type a couple keys and you get a fully corrected sentence here right in your browser. It's a great idea.
Joe McCormick
Yeah. The first version I had of this was just using the ChatGPT alt space shortcut that would open up that little mini ChatGPT window. And I had a saved just custom GPT to do it. And then I was like, well, why am I jumping out of. Where am I actually working? I can save two steps in three clicks here. And so I think it's almost like piloting it first without doing this and then realizing, oh yeah, there's a better way. And I think one thing I've heard from software is the return on investment became so much faster. Like before, you'd have an idea like that, you'd be like, this is going to save me like three minutes a day, but it's going to take me three days to build, so the payback period is just not totally there. And now it's like, it saved me three minutes a day and it takes me 30 minutes to build. Like, the payback period has just become insane for a lot of this tooling.
Claire Vo
I love it. So let's build one. I want to see what your flow is for actually building one of these things.
Joe McCormick
Yeah. So before we build it, I'm going to talk about what I want to build. So one thing that comes up a lot in the baby with Slack world and probably in many other companies slacks is people send links all the time. And for me, I often just hit like the save for later button and then maybe at the end of the week I decide if I want to read them. But I've realized that maybe it's do the thing that takes you one minute in the moment instead of keep deferring it. I think it would be great if there was an easy shortcut where I could have an AI go off and fetch this article, give me the key takeaways, and then I'll decide do I want to actually do a full read and save it for later or do I just skip it in the moment and have that all work in under five to 10 seconds? I think it's much more powerful than deferring and having this big to do list at the end of the week when I want to catch up on all these messages.
Claire Vo
So you're going to show us how, how you built this. And what I have to say is all of us are so overwhelmed with so much context and links and docs and yeah, you know, I would see something like this, whether it's a partner or a competitor or just something somebody found that was interesting. And you want to go, oh yeah, I should definitely read that. But should you? Should you definitely read it. So this quick summarization is a great idea. And so you're going to walk us through how to actually code this up using cloud code, I believe.
Joe McCormick
Yeah, and I'll jump in along the way with a couple like Claud code tweaks that I've made or at least lean into to try and make it a little more screen reader accessible. But again, I similarly think that lots of things that are good screen readers probably are going to be good tools for everybody across the board.
Claire Vo
Great.
Joe McCormick
So let's jump right in. So I'm going to switch over to my terminal for just a second so I can initialize our project. So I'm going to run a make DIR command to get our repo set up so we have our Slack summary extension. I'm making that directory and open up Claude code quick on this. Or should I open up VS code on this? So VS code opening and initializing here. I'm going to open it up as big as I can and we are going to jump in as this finishes loading. So I'm going to start here by making a PRD like every good how AI podcast goes.
Claire Vo
That's exactly right.
Joe McCormick
And I'm going to do this with audio. So sometimes I'll use Whisper Flow for what I'm doing, but in this case I actually find the VS code copilot audio to be pretty good. And so if I'm just doing something kind of quick like this, I'll just end up using the copilot integration. So you'll see I can do Control I and then when I hit Control I again it's going to dictate. So I'm not going to. I'm going to pause my talking and switch in that mode and I'm just going to dictate out for us a little bit of this PRD and then see what it comes up with for us. We want to build a simple PRD for a locally run Chrome extension whose job is it to exist in Slack alone. And when focused on a Slack message you can hit the keyboard shortcut Control Shift 1 and it will search that message to find any external links. If there are external links found, it should open them up in hidden tabs, extract their content and send it off to OpenAI. To summarize and we'll see here, I just finished that. It's going to go off and quickly generate a small PRD for that. This doesn't take very long at all. I'm just going to accept all the changes because I find reading in that diff view to be particularly painful with the screen reader. It's not like terrible, but it's much easier just to read it in the document. And since it's a new document being made, I'll now look at it here and we'll see how this looks. So we have our goals. We want privacy, security makes sense. Some user stories. Here's some function requirements parse. It's all making sense so far. Some non functional and some out of scope images. Yep, I already demoed my image processing and so open questions where and how should we summarize? Makes sense. We don't need success metrics for this. It's internal. So let's answer some of these questions. So let's just select it all and add it. I'm going to dictate again so I can hit Ctrl I and I'll start. We want to build a very simple PRD here for a locally run Chrome extension. We're going to where the job is when in Slack, in Chrome and hovering over a message that has focus, you can run the keyboard shortcut control shift 1 and that will look for any external links in the message. If any are found, open them up in hidden tabs and extract that content and send it over to OpenAI. When in OpenAI should summarize them and extract three to five key takeaways from the article and return those to the user in a fully screen reader accessible modal which includes the article's title and a link out in a new tab to view the article. Okay, we now have this. Generating the PRD so we can see it talks about our keyboard shortcut. It's got our goals, minimize user effort. Extreme reaccessibility is key for this. Some basic user stories, some function requirements. Cool, this looks good.
Claire Vo
And one thing I have to call out here is you're an engineer and that was a pretty good product description and that resulted in a pretty good prd. So one of the things I like about AI is as I say, there are no lanes. If you are an engineer and you have an idea, you can write a very good PRD using a little bit of AI assistance. If you need a tip, I know one or two tools that can help you with it.
Joe McCormick
Yeah, and this is no custom purity prompts or anything like that. This is just mostly the foundational models work here.
Claire Vo
Yep.
Joe McCormick
Cool. So now we're going to hop into a new tab, spin up Claude code in here. And ahead of this, as I mentioned, I built a couple of these Chrome extensions. So because I've done a few of them, I end up building out a Claude skill to help me build more Chrome extensions. So after I built the first two, I had Claude look at both and figure out what was the patterns that were common across them and work on a skill. So I could just build the third, the fourth, the fifth in a much simpler version and extract out that common piece. I have found Claude's skills to be a mixed bag in terms of actually being picked up automatically. So I am going to explicitly say use the skill, but technically you're not supposed to do that. But I definitely found that that's buried in its approach. So I'm going to make a prompt here. It's going to more request the skill.
Claire Vo
And for folks that are wondering how to set up their own skills, we do have a how I a episode. It is Introduction to Claude Skills where I explain that Claude skills are files in a in a folder. Sometimes they're zipped files in a folder. So if they seem mysterious to you, go check out that mini episode from, I think it was October or November, and learn how to make your own Claude skills. They're pretty useful.
Joe McCormick
Yeah. So in here the prompt's just going to be first mention the prd. So we have that and we'll say use the Claude skill for creating Chrome extensions to build out this prd. And one thing, Claude code has added this feature where you can edit a prompt instead of just in the terminal in a code file. And so in cloud code, if you hit control +G, it will open that prompt in a text editor. So especially for me, where navigating that terminal is not super screen reader friendly, I now am navigating it in the same place that I write code on a day to day basis, which is very screen reader accessible. And so again, other people may find this to be useful. You can craft deeper prompts. You can like Control F in here, you can do whatever. It's just a file. And so I think it's a really useful tool. They added a few versions ago to make it a little bit easier to work with. And I'm just going to put a note here. Use my OpenAI key from my shared Chrome extension config. So I've had some. I don't want to keep pasting my OpenAI keys and stuff like that. So I end up pulling out some shared config to share across all my Chrome extensions. That way I don't need to rinse or repeat that step over and over again. So I saved this file now and now whenever I close this, it's going to replace my prompt in Claude code with that completed prompt.
Claire Vo
Oh, interesting.
Joe McCormick
Yeah. So it's super effective, especially as you want to do a deeper prompt to use this and not have to worry about the whole terminal side of things. Cool. Now we're going to kick this off. And so I do have this Claude code session right now in planning mode. You'll see it's requesting you with the skill, which is great. We want to use our skill. I'm going to shrink this terminal so we can see more clouds. I don't really need this as much.
Claire Vo
Okay.
Joe McCormick
It's going to run some commands that actually pull in that cloud, that Chrome extension config that I talked about. Another thing I have to make Claude code a little more accessible is again, I'm not necessarily seeing everything that pops in as it's going. And so I set up a Claude hook that whenever Claude needs user input, it will basically like ding a bell on my computer so I could hear a sound that is like, oh, Joe, you need to do something right now to work on this. I actually don't want Claude code to read this file, so I'm going to say no because this is some secret configuration. So I'm going to say no, don't read that file. Now that has a API key in here. Don't read it. If you need, you then use JQ to extract the keys. Luckily I am engineer, so it's not like fully vibe coding this from scratch. I know that there's a utility that will extract just the keys out of a JSON object and CLAUDE won't be able to see the values which is the actual secrets there. So now here we are, it's just going to make a symlink to this extension for us, so we don't need to worry about the configuration. And if I ever do change it, it will automatically update in all my extensions. So instead of having each one of their own API key and my key becomes invalidated for some reason having to update all of them, I use this concept called a symbolic link. So just link the same config file into all my extensions. So one change fixes everywhere.
Claire Vo
Yeah, and this is one of those things that's just easy to do when you're running these things locally or just building stuff for yourself, is you just make the maintenance and the maintenance and deploy of these really easy for yourself and make it as simple as possible for you to repeat building things and using the same, for example, API keys. And you know, when you want to share all these publicly and publish them to the Chrome extension marketplace, we can, we can do a little cleanup.
Joe McCormick
Exactly. So here's our plan. And just like before, Control G also works to edit plans in the editor. So similarly, this is going to be a pain to read in this terminal for screen reader. But also if I want to make a tiny tweak to one thing here, I don't need to worry about telling Claude to update it or write it to a file. I just hit Control G, opens it up in this file, and now we have our full plan here. And you can just tweak different parts of it if you want and modify it. So it's another great usage of the Control G shortcut.
Claire Vo
Yeah, I want to call this out for people because so many folks would get something like this and then if it was wrong, kind of say, no, this is wrong. Please update XYZ or ABC and you're calling out. Not only is that a pretty inefficient way for you to interact with this file in terms of accessibility and your need for a screen reader. But it's also just not the fastest way to give it feedback. And so your ability to just take this, move it into code, use this control G, I believe, edit it, close it out, run it is just again, a lot more efficient.
Joe McCormick
Yeah. So we'll do a quick run through of this plan. I'm using again some more keyboard shortcuts just to break it down and fold the markdown headings. From my perspective, it's hard for me to visually or I don't visually scan the page. So reading through a big file, I typically encode or markdown, rely on folding so I can collapse different sections and read them and then expand only the sections I care about. So I don't care about certain aspects here, but like maybe I want to get deep into the error handling piece. So I'll expand that section and just read this part. So we have some logging, some key patterns, but this plan generally looks good to me. So now I will just save this and again close the file and that is what is over here in the prompt. Because I didn't modify it, it didn't take any time, it was just ready. But if I modified it, it would take a split second while it loads that new plan in and then it moves forward.
Claire Vo
Yeah. And I just have to call this again out again for people who are maybe listening to the podcast or again are not paying attention to what it means to use a screen reader here, which is you got your little headphone plugged in right now, and I am so impressed that you're using the screen reader while walking us through this demo. And what I think is so fascinating about watching your workflow here is it's super efficient and very fast, even if you don't take into account you're using a screen reader. So the fact that you've been able to build these shortcuts, these tools use Claude code in a more effective way. And then you add on this layer of and it makes using this kind of screen reader a lot more accessible to you is just very impressive. And I don't want people to miss that there's this invisible layer that we don't get to see or hear right now that you're also putting in between this, which adds a little bit of micro friction.
Joe McCormick
Yeah. And I think one thing that's great about cloud code, so like right now, visually, one of the options is selected in blue. I don't necessarily know which one it Is and using the arrow keys. It does not tell me with the screen reader what's selected. But CLAUDE Code has done a great job of standardizing where 1 means yes, 2 means often yes, but manually with a variation and then three is like no or type something extra. And so I can basically, instead of using the arrow keys and enter, I can just be like, yeah, I want to just move forward here. I'm going to hit the number one. And so they've done a good job of using, I think, lots of different inputs and lots of different ways to make this a little more accessible as it goes as well.
Claire Vo
Yep. And so that consistency, again, maybe this is for folks that are building AI products and trying to reinforce workflows, especially for folks that are building maybe these terminal UIs that I think are really lovely and interesting to build is you want, you know, people love the terminal because it's so fast and you want it to both be performance fast, but you also want it to be UI UX fast, which is if your user always knows 1 is X, 2 is yeah, 3 is Z, then they can consistently use these keyboard patterns of one key or two keys to efficiently get through your ui. And I think taking that mental friction, that cognitive friction off a user by driving consistency and patterns that people can either explicitly or implicitly learn is a really useful tool when you're using UI that is more constrained because again, a terminal UI is naturally constrained to basically text.
Joe McCormick
Yeah. And CLAUDE Code has been working on and has released a VS code extension that is more of a gui. I've just found that it's a little bit lacking behind some of the latest and greatest features. And I'm like, I want everything immediately, something a little spoiled. But I think that's going to catch up as we go to. And maybe a potentially more screen reader accessible option for some of these things too.
Claire Vo
Yeah. And again, we love a beep boop at the end of an agent completion. So I love the cursor sound. I love that you're using one here for CLAUDE code. Because again, I'm presuming with your screen reader you're not going to read this whole stream of what, too much going on? No, that's too much going on.
Joe McCormick
I think it's a big difference too between vibe coding things and production quality code. The final output here is just for me, it's going to run in my Chrome. I don't really care what the code looks like at all versus when I am building software for my full time day job and actually building stuff that's going to be in the hands of millions of users and many developers. I do do things a lot differently. The plan I'm going to read very detailed what it's going to actually do. The code I'm going to be reading, I'm going to be doing smaller commits and reviewing it kind of chunk by chunk and getting much more detailed. In this case, when the final output is just a user of one, the code quality is a lot less important.
Claire Vo
Yeah. What matters is does it work?
Joe McCormick
Yes, exactly.
Claire Vo
And it doesn't work in a little bit of like, did you leak your API key? Those are the two things we want to, we want to worry about. But other than that, we are, we are on our, on our way. And you know, the, the other thing that I think is really fun here is because you've built, I think the idea that you pick a platform or a framework for a set of your personal software and then establish best practices through a skill and then just rinse and repeat for other use cases is a really good way to get super fluent with some of these AI tools. And so I've seen a lot of people say all I do is build markdown based repos for my documentation and everything else. Everything I build is just a markdown based repo. And then I've gotten really good at using cursor for this. And then, you know, you have this example of every, you know, not everything, I'm sure, but like a lot of what I build are going to be chrome extensions. So I'm going to make, you know, this framework, get it going and then I can get really good at Claude code because I'm not relearning learning a technology and on top of relearning a tool. And so I do think it's, you get compounding effects by staying in the same technical space when you're trying to learn these Vicod tools. Because you're not trying to learn, you're not trying to learn on two fronts. You're trying to learn on just the tool, the tooling front. And some of the technical pieces have already been established for sure.
Joe McCormick
And I think as we talked about before when we were starting, like return on investment gets better and better because building this one as my third takes half the time as the first one. And when I build the fifth, I think it's going to take a fifth of the time of the first one. I am getting better the Claude skill each time I build one. I'm just going to feed it back in and be like, what was the skill missing make it better. And so I think it makes the return investment may turn into be two days of payback period or something crazy. So yeah, it does feel like it's cool to I think spread out and try many different things. But it does also just feel great to be like I have an idea, it is in my hands in under 30 minutes. Like it's, it's just very cool.
Claire Vo
Yeah. And again, this is one of those things where I tell people to work on their anti to do list. And when you have a recurring task or a recurring point of friction where you're constantly like opening links from Slack in a new tab and then trying to come back to them later and read them or you know, you would be constantly doing this like let's zoom in on this image and figure out what it is and is it something I need to worry about? When you have those recurring tasks, it's a hundred percent worth it. Instead of spending the time on the task itself to spend the time never having to do that task again. And I like this idea of the payback period of personal software basically collapsing to zero because it really just illustrates where we are in terms of the efficiency and value out of AI, which is it is much more important to learn to build some of these tools than to do the task right now. Like the payoff is so much higher. To learn how to automate the task versus doing the task. And if you can just do the change your muscle memory to every time you do the task, pause yourself and say actually I'm going to learn how to how to automate the task. You can really, really create a lot of leverage in, in your day to day life, even in your personal life.
Joe McCormick
Yeah, for sure. And we are just about done here. So I was checking back in on our little to do list which it just finished. So it's doing some final steps here. But we're just about ready to actually load this in right now. It's just kind of analyzing to see. Yeah, perfect. So it's running this one last step which is going to be. It's basically telling us hey, go load this in. So once we are done with this, we need to actually load it into Chrome. Chrome has a mode for extensions called developer mode. You'll see I have that toggle on at the top and it basically means that you can install extensions not from the Chrome web Store. You can install extensions from your local computer. You don't want to generally have that on because somebody could have side loaded in some, some credit card skimmer that you've imported or something. But if you know what you're doing and you know you just built a thing, you can go in here and turn this on. And then this looks a little bit different once you have this on compared to probably what you guys who are not having this on look in your Chrome extensions world. Because we have these options on the side here to load an unpacked extension. So this means an extension that's not like fully deployed in the App Store. So let's hit this and this is going to open up our little file browser here. So let's just pop back and we had called this Slack Summary extension. Okay. So that is now loaded in the Bone of Truth with the two of the software is you can only test it or easily test it in Chrome. So we're going to actually try it out. Whenever you download a new extension, you do need to refresh your tab so it picks up that extension. So right now I tried to use it. It's still working with the extensions that I had at the time I loaded it. So I'm going to refresh this slack so I can pick up our new extension here. And our moment of truth is going to be we're going to focus on this message and I'm going to hit my shortcut of Control Shift one and we'll see. Did we nail it?
Claire Vo
Oh look, it even has a black color, right?
Joe McCormick
Yeah, it's kind of interesting. So it's processing our link. We'll see. Did it work? It kind of work, but we have JSON here, right? It's not perfect. So let's work on one level of refinement here. So I'm going to take a quick snippet shortcut of this. So here I take a screenshot of this and we're going to send this back to Claude code and say almost again, it'll be cool if we one shot it. That'd been a really cool demo, but it's not a perfect one shot. We'll make one slight tweak here and I'm actually use a custom slash command I wrote to deal with screenshots. So because I'm developing on Windows, but I actually run Claude code in this thing called the Windows subsystem for Linux. It doesn't have access to my Windows clipboard, so I can't do what everyone else can do, which is just like hit Control V here and paste it. So I added a slash command here called Paste Image that uses some PowerShell shortcuts to pull images out of my clipboard and share them with Claude and so I can take that snippet and share it. And again this is similarly I would copy a file and I'd like save it in Windows, then move it to Linux and then import it with an app mention. And I was like, there's gotta be a better way. And I use the slash command now all the time for building stuff out.
Claire Vo
This is extreme software engineer stuff where you're like, okay, I run on this os, but I run my terminal on this. And now I can't access my clipboard, but I still like it that way. So I'm going to write a little script to give myself a two word shortcut to make this happen.
Joe McCormick
It ported this and it drops screenshots in our MITMP directory. So I just have to say yep, please read it. So again it's saving those two minutes every day adds up fast. And so it's just going to fix this little JSON piece here. And again it was kind of interesting, it was close, it got the right content, just didn't display it right. And so now it's going to go off and work on this for another second here and we should hopefully have a quick update. And the nice thing is with Chrome extensions in the developer mode, there's just an easy one click button where you will update and grab the latest copy of all of your extensions. So it's actually working multiple at a time or whatever. You can just hit that button and it's going to update for us. So we'll see. It's finishing up here and one of.
Claire Vo
The things that it's doing just I'm calling this out for people who are writing queries to OpenAI. It's moving the JSON out of the prompt which is saying please return JSON. I think it's actually just returning JSON as a string and text and it actually moved that to change the response object to being JSON so then it can actually be read by the. By the Chrome extension in a. In a more structured way.
Joe McCormick
Control shift 1C and it works.
Claire Vo
Beautiful.
Joe McCormick
We got it. Nailed it. So again we've got these takeaways we can now action on this and I can decide do I actually care that iterable added mcp? I do. Spoiler alert. But yeah, again we did it I think under 25 minutes here.
Claire Vo
And is it working in your screen reader? Did the accessibility Fully accessible.
Joe McCormick
So this modals in general can be sometimes problematic because a screen reader will sometimes read behind the modal. But surprisingly, although a Bunch of the web is not accessible. If you tell some of the foundational models like please make this accessible. The accessibility standards are actually incredibly well documented, so they do actually a great job of this. So they use the right it's called ARIA A RIA roles and make this modal have the right focus, not let you read behind it. So out of the box they're not going to make everything streaming accessible. But you say, hey, go do this. It'll gladly go follow the spec and make it accessible.
Claire Vo
So this is a meta question and maybe before, before I get into the meta question, let's just recap for folks what we saw. So you built a Chrome extension that's focused to the web version of Slack, that that Chrome extension that's running locally because you've toggled on developer mode in your Chrome settings, we'll take a focused link that's shared by a colleague or somebody in Slack. It will go out, it will parse that link. We'll see if there are any links in it. It'll parse it. It'll take some key takeaways. The way you built this is you bopped into VS code, you dictated a short prd, you let AI kind of build that out. You made minor tweaks to it, but basically shipped it. You used Claude code, including some custom slash commands and a Claude skill specifically around building Chrome extensions to then scaffold out that Chrome extension. You showed us control G in CLAUDE code where you can actually modify prompts and inputs as code, which is much more efficient, both from an accessibility perspective and just general user experience perspective. And you I showed us a custom screenshot. So you're very special, as I say, you know, unique snowflake software engineer environment can operate as if you as you want, even if there are some technical hurdles. And then now we have this great little extension that I want running on my app. So this is great. Joe, I love this. I want to hop into some lightning round questions and this has given me an idea of one that I really want to ask you, which is about mcps. So one of the things that I think is so interesting about MCPS is it allows you to bypass all UI and just get to the bones of what a SaaS product does. And I can imagine that while there are lots of okay, accessible enterprise software products, not all of them are building for maximum accessibility either in their design or in their kind of underlying way they're implemented. Have you found MCPS? And just that interface into some of these SaaS tools has improved accessibility for you has not. What are your thoughts there?
Joe McCormick
I think the altered goal of mine is I would love to do everything in one place and not have to switch tools like whether it's a context switch cost or just a switch cost. So having MCPS has been great for that. Luckily I actually think lots of enterprise software surprisingly is being built pretty accessibly. Like I want to really give strong kudos to Google Docs. Google Docs for what it does is so crazy accessible and the work that people don't know that goes in to make it that every single thing that is being done is being communicated to the screen reader basically letter by letter via the secret system that people don't know about called aria. Live announcements is kind of crazy, but I do find hey, I need to get something from three sites that's kind of painful. Can I just use the notion MCP and cool docs? MCP or glean in there is great notion is one that is a little bit harder. I think they do their best from accessibility standpoint in some ways, but there's a lot going on in a notion post a notion article. So I think that's another example where it's like, yeah, I can just pull this down and work in the markdown version. That's going to be a lot easier. And again, I've got it's way easier to navigate with these keyboard shortcuts and the folding features. So I will pull down some notion posts and just be like dump this into a markdown file for me and then use my little code shortcuts to help navigate some of those pieces.
Claire Vo
Yeah, I love that. And so my second question again is around personal software and the ability to translate sort of as you said, you can take a pretty complex notion page, turn it into markdown that allows you to, you know, read and parse it in a much more efficient way. And so I think this ability to translate files or formats is a really exciting part of AI. And we've hinted at a couple things we've seen in this episode. A little bit of like image to text, a little bit of voice to text. But I'm curious for you, what are you most excited about in sort of like the multimodal world of AI and what recently has come out that's caught your eye and made you excited? Or what are you hoping to see in the next couple months or years that you think could really open up stuff either for you personally or just as a product builder?
Joe McCormick
Yeah, I'll talk in the personal space. So I have two kids. I Have a five year old and a three year old and reading books to them is a challenge. I don't no braille I've been trying to learn, but it's a hard skill to pick up at 33. And so I've memorized a handful of books and I'll read those, but it's not really reading, it's fake reading. But a big shout out to the Gemini app and its live share features. I can now read any book. It sounds like me reading it, but me and my 3 year old Cole will sit on the couch, he'll bring a book over and I'll be like, hey, I can read this one. Or no, Gemini can read this one for us. And we'll turn the pages and say, gemini, next page. And Gemini will read that page and then we'll turn to the next one and it'll read the content of that. And so I think just equitable access to everything is great. And that piece is one thing that I was always afraid of. Can I read stories? I can memorize stories, I can tell stories. But there is something to just your son being like, I want to read this book. And you having to be like, sorry, I can't. And now that sorry, I can't becomes like, sorry, I can with the assistance of so many different tools now. But I think the Gemini one is particularly useful and I found that one to be the strongest for just easy sharing. Just saying next page. It knows all the context. It immediately starts reading it. I have meta glasses, I have the ChatGPT Pro. I've got all these things. But I think Gemini is doing the best job of it right now. And this is before me trying any of the Gemini Pro 3 that just came out to see if any of that makes it even better.
Claire Vo
Well, that is a very, very sweet story. And yes, I was just thinking I wish the meta glasses, which I love and, and use every day would, would. Would also help do a better job there. But it's awesome to hear that Gemini can add to that that special time with kids which as you and I were talking before the show, you know, we both are boys, parents of boys, is just is. Is such a special time. So my last question is when AI is not listening and I'm curious if you type or if you speak this, your prompting techniques, have you ever whisper flow yelled at your AI or do you have any tricks for us when AI or Claude gets really, really stuck?
Joe McCormick
It's kind of a nerdy answer, which makes sense for me, but my typical mode is basically clear the context and start fresh as much as possible. I think a lot of people will try and keep massaging it and being like if I just send this one extra prompt in this conversation, it'll figure it out. It's like, no, you just have to start from scratch and take the learnings that you have from the last time. And so sometimes it'll be like this hasn't been going great. What did you learn about this? Take that and feed that into the next prompt. But most of the time I used to be like let's start from scratch. Something clearly got poisoned in this context. And when you start from scratch I feel like everything just feels smoother.
Claire Vo
I love it. Well Joe, thank you so much for showing this. I think it's just one of those workflows that we haven't seen before. Everybody can find a use case. I am thinking of all sorts of little microfibers, frictions in my own life where a keyboard shortcut or two could really make things a little bit better for me. So where can we find you and how can we helpful to you?
Joe McCormick
Yeah, so I mentioned at Babylist Babylist is very actively hiring and if you are somebody who likes using AI in your day to day building of software and you're a software engineer, we are a Ruby on Rails and React shop but hiring across the board all different levels. So check us out on babylon.com, and personally I'm on LinkedIn as well so feel free especially if you have any accessibility questions or any questions on some of the Chrome extension piece. I'm always happy to answer on LinkedIn.
Claire Vo
Well thanks for joining How I AI.
Joe McCormick
Thank you.
Claire Vo
Thanks so much for watching. If you enjoyed this show, please like and subscribe here on YouTube or even better leave us a comment with your thoughts. You can also find this podcast on Apple Podcasts, Spotify or your favorite podcast app. Please consider leaving us a rating and review which will help others find the show. You can see all our episodes and learn more about the show@howiaipod.com See you next time.
Podcast: How I AI
Host: Claire Vo
Guest: Joe McCormick, Principal Software Engineer at Babylist
Episode: How this visually impaired engineer uses Claude Code to make his life more accessible
Date: February 16, 2026
This episode of "How I AI" delves into practical, personal software creation, focusing on accessibility. Joe McCormick, who is visually impaired due to Lieber's hereditary optic neuropathy, shares how he leverages AI—especially Claude Code—to create Chrome extensions that make both his work and life more accessible. Joe demonstrates his workflow, showcases custom-built tools, and discusses how the ability to rapidly prototype personal software is transforming accessibility—not just for himself, but for anyone willing to build.
"I got the same feeling of creativity and being able to come up with the idea and make it happen. But now I was on maybe on a full equal plane to my competitors [...] as AI took off became even more equivalent." — Joe McCormick [03:31]
"What I love about what we're able to do now with AI is not only are more interesting accessibility tools and platforms able to be built, but people can build these solutions for themselves." — Claire Vo [05:33]
(Demonstrated at 06:22 – 09:59)
"I use this tool... to be able to get the gist of an image without needing to ask somebody to explain it to me." — Joe McCormick [02:22, repeated at 06:22]
(10:50 – 13:46)
"They're not all screen reader accessible. They're multiple clicks away sometimes. So I built one out that works in any input field on the web." — Joe McCormick [10:52]
(14:37 – 41:00)
"If you tell some of the foundational models like 'please make this accessible,' ... they do actually a great job of this." — Joe McCormick [40:21]
"The gap between... a sighted person and a visually impaired person is closing day by day [...] it's even been extra impactful." — Joe McCormick [03:31]
"The payback period has just become insane for a lot of this tooling." — Joe McCormick [13:46]
"Things that are good screen readers probably are going to be good tools for everybody across the board." — Joe McCormick [15:58]
"You get compounding effects by staying in the same technical space when you're trying to learn these Vicod tools. Because you're not trying to learn on two fronts." — Claire Vo [31:30]
"A big shout out to the Gemini app and its live share features. I can now read any book. It sounds like me reading it, but me and my 3 year old Cole will sit on the couch, he'll bring a book over and I'll be like, hey, I can read this one. Or no, Gemini can read this one for us." — Joe McCormick [45:08] "Now that 'sorry I can’t' becomes like 'sorry, I can' with the assistance of so many different tools now." — Joe McCormick [45:45]
"My typical mode is basically clear the context and start fresh as much as possible. … Something clearly got poisoned in this context. And when you start from scratch, I feel like everything just feels smoother." — Joe McCormick [47:18]
“I added a slash command for Paste Image... again this is similarly I would copy a file and I'd like save it in Windows, then move it to Linux... and I was like, there's gotta be a better way.” [37:35]
Joe McCormick demonstrates how AI—specifically, tools like Claude Code—enables visually impaired engineers (and everyone) to rapidly prototype highly personal, efficient accessibility tools that transform daily work and life. By walking through his real-world workflow, Joe illustrates not only how to make personal software with modern AI assistants, but how building such tools is now practical and offers a staggering ROI. The episode is a masterclass in blending empathy, technical skill, and the very latest AI to create a more accessible world, one keyboard shortcut at a time.