Loading summary
A
Andrej Karpathy, I mean, one of the godfathers AI has just launched something called Auto Research. And Auto Research is a huge deal and it's going viral on Twitter. And I just wanted to do an episode where I can explain to you in the clearest way possible what it is, what are the use cases, how to make money from it, how to be more productive with it, how to create impact with it. And by the end of this episode, I'm going to give you a bunch of different ideas, use cases for how to use Auto Research. I'm going to explain it to you in the most clear way possible, and at the end, I'm going to tell you how you can actually get started with it. So let's go right into it. So what is Auto Research? Well, it's like having a super nerd robot intern that runs science experiments on AI models for you on all night without you doing the boring stuff. I mean, sounds intriguing, right? So how do you actually, you know, program it or get started with it? Well, the first thing is you got to give it a goal. So you can say something like, make this small AI model smarter. That's the goal. And then an AI agent will actually plan what to do, like different settings, code changes, edits the Python code for you, runs a short training experiment on a GPU for about five minutes, it reads the results and then it decides what to change next and to repeat the loop. So in some ways, if you've seen my video on the Ralph loop, where it basically would do engineering 247 and you'd wake up to new stuff happening, in simplest terms, that's what Auto Research is helping it do. You give it a goal. The AI agent does a thing. You know, you tell the AI what better means. Cheaper leads, more clicks, higher sales, better model school. And then the AI keeps changing things, testing them, and it only saves the changes that improve. So what's really cool about it is you wake up, you grab the best version, and then hopefully you turn it into something you charge for or, you know, you give it away. I saw this tweet by Toby, who's the CEO and co founder of Shopify, on Auto Research works even better for optimizing any piece of software. Make an auto folder, add a program md, that's just a markdown file, which is really the foundation of how you're going to be using Auto Research and a bench script. Make a branch and let it rip. So that's why I started paying attention to Auto Research, right when Andrej Karpathy Legend and Toby and more people start playing with it. I'm like, okay, I gotta pay attention. So I created this little visual for. For how to think about what Auto Research is. So you set the goal. The AI plans an experiment. It edits and trains the code and settings. It runs a short training on a gpu. By the way, this is an important. I should mention that you need an Nvidia chip to actually run Auto Research or you can do it in the cloud. I'll talk about this at the end of the episode. But you do need that. You can't just run it on. Let's say you have a MacBook M1 or something like that. It reads metrics. It says, is it a better result if it's not, it's going to log the attempt and it's going to discard the config. If it's yes, it saves it to the config and then just plans a different experiment. And it just hopefully gets better on your goal, whatever it is. So let's get into. We're going to get into some of the ideas, business ideas around it. But right before that, I just want to say here's a simple mental model for how I'm thinking about Auto Auto Research. So imagine you have a research boss. You can boss around. Number one, you write a clear task. So for code experiments, maybe it's improve this model test score for business, figure out the top five competitors for product XYZ and make a short report. Step two is you give the bot access to the code, a GPU for ML experiments. You obviously need to give it access to the Internet and documents. If you're doing reading tasks, the bot then runs a loop. So it plans, it acts, meaning it might run code or search, it reads results, it updates the plan and then you just come back later. It could be 12 hours, 20 hours, six hours. And you see if it's logged everything, charts and metrics, and then it gives you a written summary in normal language. So, you know, think of Auto Research as a research bot that runs experiments for you while you sleep, tries lots of ideas fast and keeps the winners. Quick break to invite you to something. Now, this isn't an ad. I just want to invite you to a free event because I think that you're going to get a lot out of it. I wanted to take one hour of time where we just talk about building businesses in the age of AI. People say SaaS is dying. I actually believe the quite opposite. I think that SaaS is just evolving. I think right now is an incredible time to be building something software startups that help you craft your dream life. And for all those reasons, I said let's just book one hour of time. It's going to be 11am March 12th. That's a Thursday where we can go and lock in and just talk about building businesses in the Ajai. I'll include a link in the description in the show notes to join and I can't wait to see you there. Okay, how do we use it? Here's some ideas for you. So the first idea for you I have is a niche agent in a box, you know, products. This can be multiple products. And by the way, I put out these ideas. I want you to do these ideas. I think that, you know, even if they don't turn into businesses, you will learn about these tools and that is going to help you outperform 99.9% of people on this planet. So you package tiny auto research loops tuned for one painful niche. So the example I think of is an Amazon listing experimenter, an email sequence tuner for realtors, a pricing optimizer for SaaS. Those are, you know, auto research loops and ideally in a niche that you understand well and then you charge a monthly fee. So the value prop is this thing runs experiments for you 247 and just shows you the winner to click accept. How valuable is that and how many different niches are there that you know, this plays into. The hard part is figuring out what the, what's the pain points and then, and then obviously you know, you want to be quick, quick to market, right? So here's a visual of it. Pick the painful niche, design the tiny auto research loop, run experiments automatically, see which setup works best, turn best, set up to a simple agent product and then you charge that monthly subscription number two. You're going to want to, you know, here's an idea. Print money using an AB testing for marketing. So this is, it's, it's very similar but instead of, you know, you know, instead of, you know, doing it for realtors or whatever, you're doing it for ads and landing page experiments. So landing pages. So the agent writes variants of headlines, layouts and offers, pushing them to traffic measures which one converts better and keeps iterating. So this is like conversion rate optimization around landing pages. You know, the old think of, you know, tools like optimizely. That's a SaaS tool that, you know, when I first moved to San Francisco I remember how big they were and everyone was talking about Optimizely and a b Testing and it's like, well this is the future of that auto research for different landing pages. You can also use auto research for something like ads which auto test creatives. It auto tests angles and audiences and then it keeps the combos that lower CAC or raise roas. So you know, you profit by running this for your own products. Like if you, if you want to build your own products and just use this internally, that works. Or, or you know, offering an always on experiment engine to clients as a retainer service for 5k a month. I'm going to give you the best landing pages every single month and it's just going to come to your inbox, that sort of thing. Visual of it business goals. You know, the goal that you're giving the auto research is more sales. It's generating things like pages and advertions, sending traffic to the versions, measuring conversion and revenue. Does any version beat the current best? If it doesn't, then you're going to keep the current control. But if it does, you're promoting the winner to a new control and you're asking the AI for new ideas. All right, hope your creative juices are starting to get flowing. You're starting to understand a little bit more about how it's working, how you think about goals, how you can think about agents and how you can set up these loops. Number three, research as a service. So auto research's recipe is basically a loop for doing research, right? Because you're searching, reading, summarizing and you're comparing and then you're repeating. So how do you point that at money problems like market and competitor research for startups. So constantly updated reports on who's doing what, pricing features and gaps. Super valuable investor and M and A decks, fast technical and market due diligence summaries. Super valuable compliance and regulation tracking for niches. I don't know, crypto, healthcare, finance. Super valuable. So you can charge per report like a one off or you can set up like a monthly subscription for always fresh dashboards. So visual define client research question. Auto research searches and reads, summarize and compare findings, creates reports and dashboards. Deliver insight to client and the client pays per report or monthly, whatever you decide. Number four, power tool inside your own product. So if you already have built a SaaS or workflow, embed an auto research style agent so your users can press optimize just like a big, I envision like a big button that just says optimize and the system runs a mini research loop for them. So for example tune prompts, pick Best pricing, rank suppliers. Then you can charge higher tiers for this feature or you can use it as a wedge to upsell pro and enterprise brands. So maybe that's a part of pro and enterprise. Maybe it's something that you just send an email to your entire list and you're like, hey, we have this really powerful tool. Imagine you press this button. It's like bending spoons, right? It's like bending spoons. Like, how is this bending spoons? Not the private equity group I'm talking about. I'm like the idea of you can bend a spoon, right? It's incredible that you'd be able to optimize, press a button and this would happen. So visual over here. Have an existing SaaS add an optimize button. Users run many research loops. Tool suggests better settings or prices. Users see better results, offer higher price propens and enterprise plans. Number five. This is a saucy episode, by the way. This is saucy, all right. Agency that sells. We run more tests than anyone else. Because auto research lets you run hundreds of experiments instead of a few, you have a simple pitch. We do 100 times more testing than other shops for the same or lower fee. A niche example, a Shopify store conversion lab, B2B SaaS pricing experiment service, email, subject line and sequence optimizer. You charge per month and a bonus if you hit specific KPI lifts. Rev share performance fee. People love that. You know, of course they're going to be, you know, interested in. Yeah, if you can do, if you can lift this KPI, we'll give you some bonus. So here's the visual Start an optimization agency. Use auto research to run many tests, improve stores, pricing emails and funnels, show clients more experiments and wins. Charge monthly retainer and performance fee. Number six. And we've got about 10. Yeah. So we're almost, almost done. And then after we're going to talk about just some cool, interesting stories around auto research and then I'll end with how you can set this up very briefly. So auto quant for trading ideas. So you can use auto research to run small, fast backtests of many simple trading rules. So LLM based factor screen sentiment filters on one GPU overnight. So you can keep the few strategies that look promising, then either trade on your own account or sell signals and strategy reports. So depends if you're a trader, maybe you're doing yourself or. Yeah, you can just, you know, sell this as a digital product or. Yeah, yeah, yeah, basically a digital product. So you define the simple trading rules you run Many back tests overnight. You review the strategy performance, you keep only promising strategies. Trade your own capital or you can sell the signals. I think finance is changing a lot and I think with things like auto research you know it just, it's, it's going to be an unfair advantage for a lot of people. So I think you're going to see a lot more digital products that people sell and also you know just using their own money, trading themselves instead of giving 1% or whatever to a financial advisor. I'm sure also by the way a lot of people are going to get burned by this too. Like they not they're just going to blindly just trust an auto research. You need to have a human in the loop and you need to manage that obviously accordingly. But yeah, you can just see yeah there's definitely going to be some people are going to get burnt. You just give the entire. They're just going to like give a bank account and just let auto research just trade for it. I mean would be interesting, it would be an interesting test, that's for sure. Number seven always on lead qualification and follow up point an auto research style agent at your CRM. So like a salesforce or something like that. And inbound leads let it test rules and messages to see which leads are most likely to buy right. It auto grades the leads, suggest next actions and drafts follow up. So salespeople only focus on high value deals so it's more revenue per hour spent. Visual over here for you connect to CRM. You know auto research test the leads, rank leads by likelihood to buy, draft follow up messages, sales focus on best leads, revenue per sale increases. 8 Finance Ops autopilot for businesses use the loop to grind through invoice matching, expense report generation and exception detection with continuous small improvements to rule and prompts. You can sell this as we cut your AP expense time in half either as software or as an OP service with a small team and agent. By the way I can totally see someone like someone starting this and this gets acquired by one of the large fintech companies or one of the large banks. So visual here ingest invoices and expenses. The auto research improves rules and prompts, matches invoice and detects exceptions. It generates clean expense reports, reduces manual finance work and then you can sell it as a software or op service or you start, maybe you start as op service and then you kind of evolve into the software. Two more for you. Number nine an internal productivity lab for your own org. I thought this was interesting. So treat your company like Carpathy's GPU lab define KPIs so like response time, close rate ticket resolution and let agents iterate on workflows and templates and routing rules. So you just get fewer meetings, less manual grunt work and then you personally touch only the high impact decisions when everyone else rides the improved process. So the goal here is defining the key metrics. Auto research is testing the new workflows, it's improving templates and writing rules. You're cutting meetings and manual tasks. That's good team focuses on high impact work and then higher productivity and ideally higher profit. Last idea for you, done for you. Research or due diligence shop. So you use the research loop to chew through docs, filings, product pages and reviews and keep an evolving living memo for clients like investors, acquirers, execs. You make money by selling fast well structured briefs and a monthly update packs instead of one off manual research logs so you know the goal, get investor or acquire a question. This happens all the time. Auto research reads through docs and filings, it summarizes that product, market and risk and maintains a living memo for the client. It delivers a brief and updates packs and the client pays for reports and ongoing access. I would pay for something like this so hopefully someone builds it. All right, so those are a bunch of ideas for you. I also saw a couple interesting things this morning. My good friend Morgan Linton who's been on the pod before, he says I woke up this morning and all I can think about is auto research. So so many idea ideas swirling around in my head. Not sure 99% of the world realized the incredible breakthroughs Carpathy is making and just sharing casually on X. Right now where my mind is going is medicine. It feels like in many ways clinical trial design is itself kind of like a hyper parameter search. I know right now trials cost tens of millions of dollars minimum. It feels like an agent swarm could optimize treatment protocols on small proxy experiments, promote the most promising candidates and then move to humans to review. So humans still very much in the loop but later on and experimentation going much deeper, happening faster and for far less money. I think for me, while I'm not a doctor, he's an engineer. What I'm the most excited about when it comes to AI is the impact it will have on human health and critical critical areas like disease treatment might be a crazy idea so a real doctor can jump in the comments and slap me on the wrist here. I looked at the replies. I didn't see you know any, any doctors come in, but I don't Know, I just can't stop thinking about how what Carpathy has discovered here could have some pretty profound implications. So only halfway through my coffee though, but woke up this morning and this is what I'm thinking about. So thought I'd share. I agree. I think there's a lot of really interesting, not just like business profit ideas, but also just like medicine science research. So I'm excited for people to, to take this and to continue with it. I also saw this tweet here. What's after Auto Research? It's Karpathy's new open source project, Agent Hub. So Carpathi also launched Agent Hub. What is agent hub? It's GitHub for humans. Sorry, GitHub is for humans. Agent Hub is for agents. So it's basically a GitHub for. For agents, an agent swarm collaboration platform. A very promising direction. I'm watching him speedrun, a one man billion dollar company. If you look at the GitHub for agent hub, it says first use cases for auto research. But it's a lot more general than that exploratory project. He says agent first collaboration platform A bear get Repo. A message board designed for a swarm of agents working on the same codebat code base. Think of it like a stripped down GitHub where there's no main branches, no main branch, no PRs, no merges, a sprawling dag of commits in every direction with a message board for agents Coordinate. I think this is really interesting and just like whenever Carpathy is up to something, I'm always paying attention, so I had to put that one in there as well. So maybe you've gotten to the end of the, this episode and you're kind of like, okay, I kind of. I think I understand what Auto Research is. I think I know what you know. Carpathy is a G. Toby's a G. Like all these smart people are playing with it. How do I get started? Well, to get started, I'd recommend just tell Claude Code to get you started. So you know, I went ahead and I basically was like, uh, I. I gave Claude code the li. The this, this GitHub repo, the GitHub, the auto research GitHub repo. And wow, 25,000 stars already. So this is crazy. It's really growing, growing quick. So I just gave it, I gave, gave it the link. And I was just like, I need help installing Auto Research by Carpathy. And it says, here's how to install it and set up Auto Research by Carpathy. You need an Nvidia gpu. So I talked about that. In the beginning it was tested on a H100. But other Nvidia GPUs should work and you need a UV package manager. So you have to install uv, you clone the repo, you install the dependencies, you prepare the data and run a training experiment. In my case, I don't have an Nvidia GPU. I'm actually using a MacBook and an M1 Pro. I know I need a upgrade to a new Mac. So I was like, so wait, I need an Nvidia GPU to do this. But there's a few options. Cloud gpu. So you can rent an Nvidia GPU from a service like Lambda Labs, Vast AI Run Pod or Google Cloud. Some offer free tier with GPUs. This is the most straightforward path. So that's, that's the answer to people who don't have an Nvidia chip. Just rent it on one of these services. I personally use Google Collab. Why? I just know Google the best and trust Google the best. You know, it also says you can try it, you know, via Apple Silicon, via an NPS backend. I'm like, no, I'm not going to do that. So that's what route I did. I went on Google Collab. The easiest way to get started, you go to collab.google.com you create a new notebook, you change the runtime to change runtime T4 GPU and you run a bunch of commands that might be like complicated. Sound complicated? This is what colab looks like. You literally just tell, you know, you listen to what cloud code tells you to do and you just paste it in and you can get started. So, you know, if, if people are interested, I can spend, you know, more time with this, with auto research, as I'm learning, sharing more about it. But I just wanted to do give you a quick primer on what it is, why it's important, what are some ideas on how you can actually use this thing and then how are people installing it? They're just, you know, you can use Claude Code as your helper to get it installed, installed and you're going to want to rent a GPU in the cloud at least to start. So hope this has been helpful. This is another solo podcast that I'm doing on the Startup Ideas podcast. The last time I did this last week, I had a lot of comments that said, yeah, Greg, I actually really like when you just come in solo and just start like telling us what's on your mind and stuff like that in real time. So I'm here, I read every single comment. So, you know, keep commenting, keep liking, keep subscribing and I'll keep, you know, putting this out there for you for free. Yeah, I'm excited to see what you end up using this for. Of course it's early, right? Like this is, this is brand new. People are still trying to figure out what are the use cases. But I always find that, you know, in the, in the fog. In the fog, people don't really understand where the opportunity is, is when there's sometimes an opportunity. So one thing I've just learned in my career is just like when I see people like Carpathy doing things like this, you want to pay attention, you want to tinker with it, you want to have some fun with it and you want to see what it's all about. So thanks again for giving me your time. Hope this has been clear. Share this with a friend who you think would see it valuable. And if you need, if you need any ideas, more ideas on startups to build with AI, ideabrowser.com, definitely your place to go. And I'll see you in the comment section and I'll see you next time. Have a creative day.
Host: Greg Isenberg (CEO, Late Checkout)
Air Date: March 11, 2026
In this episode, Greg Isenberg demystifies Auto Research—Andrej Karpathy's latest open-source project that's rapidly going viral in the AI community. Greg breaks down what Auto Research is, why it’s a game-changer for startups and productivity, and most importantly, how listeners can use or build on it to create new businesses and tools. He offers concrete startup ideas, technical context, and signals why now’s a unique time to experiment with novel AI workflows.
Greg likens Auto Research to having a tireless “super nerd robot intern” who will run AI model experiments for you all night, tuning code and configurations autonomously.
“It’s like having a super nerd robot intern that runs science experiments on AI models for you all night without you doing the boring stuff.”
— Greg Isenberg, (00:26)
“Think of Auto Research as a research bot that runs experiments for you while you sleep, tries lots of ideas fast and keeps the winners.”
— Greg Isenberg, (07:51)
(A series of practical business angles and examples, with “visual” step-by-steps outlined verbally.)
“You package tiny auto research loops tuned for one painful niche... and then you charge a monthly fee.”
— Greg Isenberg, (09:22)
“Print money using AB testing for marketing... offer an always-on experiment engine to clients as a retainer service for 5k a month.”
— Greg Isenberg, (11:11)
“Constantly updated reports on who's doing what, pricing, features and gaps, super valuable.”
— Greg Isenberg, (13:41)
“You make money by selling fast, well-structured briefs and monthly update packs instead of one-off manual research logs.”
— Greg Isenberg, (24:23)
“…right when Andrej Karpathy legend and Toby and more people start playing with it, I’m like, okay, I gotta pay attention.”
— Greg Isenberg, (02:49)
“Even if [these ideas] don’t turn into businesses, you will learn about these tools and that is going to help you outperform 99.9% of people on this planet.”
— Greg Isenberg, (09:45)
“I’m sure also, by the way, a lot of people are going to get burned by this too... you need to have a human in the loop and you need to manage that obviously accordingly.”
— Greg Isenberg, (19:57)
“Clinical trial design is itself kind of like a hyperparameter search... it feels like an agent swarm could optimize treatment protocols...”
— Morgan Linton via tweet, read by Greg
“Agent Hub is for agents... a sprawling DAG of commits in every direction with a message board for agents to coordinate.”
— Greg Isenberg, (28:04)
Nvidia GPU Required:
Can’t run on Apple Silicon Macs; need Nvidia GPU locally or access to a cloud GPU.
Easiest Setup:
Quick Steps:
“If people are interested, I can spend more time with this, with auto research, as I’m learning, sharing more about it.”
— Greg Isenberg, (30:48)
“When I see people like Carpathy doing things like this, you want to pay attention, you want to tinker with it, you want to have some fun with it and you want to see what it’s all about.”
— Greg Isenberg, (33:26)
Auto Research represents a paradigm shift: codifying “always-on” AI experimentation so that anyone can accelerate research, especially entrepreneurs and operators. The future will belong to those who tinker and figure out real-world applications—so grab a cloud GPU, try it out, and be on the lookout for both business and societal breakthroughs.
“Hope this has been helpful. Share this with a friend who you think would see it valuable… I’m excited to see what you end up using this for.”
— Greg Isenberg, (34:11)
For more startup ideas and actionable prompts, visit https://gregisenberg.com/30startupideas.