
Loading summary
Jonathan Siddharth
I think the era of data labeling companies is over and it's now the era of research accelerators. All knowledge work is going to be automated. It's only a matter of time. I don't see an AI bubble. These models are incredibly powerful today. SaaS as we know it I think is over. I think it's completely over.
Harry Stebbings
This is 20 VC with me, Harry Stebbings and I'm so excited for the show today. So I'm fascinated by the data labeling market. McCaw Surge, Turing Invisible There are seven players that I know of that do over $100 million in annual recurring revenue. All of them have more than 50% of their revenue from two customers. The way they use the term revenue is also sometimes super unclear versus gmv. There are bluntly just a lot of questions today. I do not pull any punches with the man leading one of these companies, Jonathan Siddharth, founder and CEO of Turing and a company that he has scaled to over 350 million in annual recurring revenue, raising $225 million in the process and now being a profitable company. This is an amazing discussion and as I said, the tough questions get answered. But before we dive into the show today, are you drowning in AI tools, Chat, GPT for writing, notion for docs, Gmail for email, Slack for comms and you're constantly copy pasting between them all, losing context and losing time. This is the AI productiv tax and it's killing your output. At 20 VC we're all about speed of execution and Superhuman is the AI productivity suite that gives you superpowers everywhere you work. With the intelligence of Grammarly, mail and coda built in, you can get things done faster and collaborate seamlessly. Finally, AI that works where you work, however you work. Superhuman gets you from day one with zero learning curve and it's personalized to sound like you at your best, not like everyone else.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Using genetics, generic AI.
Harry Stebbings
Get AI that works where you work. Unlock your superhuman potential. Learn more@superhuman.com podcast that's superhuman.com podcast and just like Superhuman gives you superpowers in your inbox, Fanta gives your company superpowers.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
In security and compliance.
Harry Stebbings
Customer trust can make or break your business and the more your business grows, the more complex your security and compliance tools get. It can turn into chaos. And chaos isn't a security strategy. That's where Vanta comes in. Think of Vanta as your always on AI powered security expert who scales with you. Vanta automates compliance, continuously monitors your controls and gives you a single source of truth for compliance and risk. So whether you're a fast growing startup like Cursor or an enterprise like Snowflake, Vanta fits easily into your existing workflows. So you can keep growing a company your customers can trust. My listeners can get $1,000 off Vanta by going to vanta.com 20v V-A-N-T a.com 20vc 20vc for $1,000 off Vanta. And like Vanta builds trust with automated security. Angellist drives momentum by helping you find and fund breakout startups. If you're listening to 20 VC, you know we have a really freaking high bar. Well, Angellist is the modern platform used by the best in class venture funds where over 40% of top endowments and banks are LPs. Their customers include a top five venture firm. 20 VC now have, check this out, $171 billion of assets on the platform. They combine an all in one software platform with a dedicated service team that moves as fast as you do. One manager said this awesome quote, angellist feels like an extension of my fund. Another said Angellist gives me total peace of mind. The attention to detail, lightning fast response time and just real sense of ownership from the team are exactly what I need to stop worrying about back office ops. So if you're starting a new fund, don't be a moron, just use Angellist. They're incredible. Head over to angellist.com 20vc to learn more.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
You have now arrived at your destination.
Harry Stebbings
Jonathan, I've been so looking forward to this. Thank you so much for joining me in person. It's such a treat to do it.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
In person while you're in London.
Jonathan Siddharth
Thank you for having me Harry.
Harry Stebbings
Now I want to start with a little bit of definitions because everyone thinks.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
That talent marketplaces and then everyone pushes back on talent marketplaces. How do you describe it and why are we not dealing with talent marketplaces anymore?
Jonathan Siddharth
I think of a talent marketplace as something that's basically matching talent to something. Maybe it's an opportunity. So Turing is not a talent marketplace. What we do at Turing is we are training superintelligence. We work with seven out of the eight Frontier Labs. To get to superintelligence you need research, compute and data. Research the labs do in house with OpenAI, Anthropic, DeepMind, etc. For compute, we have Jensen to thank and maybe Nvidia as well. On the data side, Turing Power is the data pillar. On the data side, there's been a significant shift in the last couple of years. So a few years back the Models weren't quite smart enough. And as the models have gotten increasingly smarter, the data needed to improve them has become harder to generate.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
And this is because it's more sophisticated data that's required to improve the models. It's like vertically specific people in task and workflows that isn't so obvious, like cat pictures.
Jonathan Siddharth
That's correct. And there's a shift in the data going from simple to complex. I mean, let's take coding for example. A few years ago, the kind of data set a contractor could generate might look like, hey, write a Python program to sort some numbers. Today the data that's generated might be write a B2B marketplace app that connects doctors with patients and write it on for Android with Kotlin, Java, Write it for iOS with Swift and write it on the web like with Next JS or something. That's the complexity. So there's a shift in going from simple to complex. So it's no longer the kind of data that low skilled, medium skilled contractors can generate. You need expert humans in every domain. The second shift is we've gone from teaching AI to take tests and pass tests to teaching AI to do real work. It's less about having AI pass the bar, it's more about can AI do the job of a lawyer, can it do the job of a privacy lawyer, a compliance lawyer, a paralegal? Having AI be good at doing economically valuable work, that's a shift. The third shift is we've gone from chatbots to agents. We started off with ChatGPT, where you're asking questions, getting answers, which is great. But now it's about the models becoming agentic, where they can execute complex multi step workflows in a real world business setting. And the type of data you need for that is totally different.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
How is that different? That's so interesting. So in the transition from chatbots to agents, how does the data required change with that transition?
Jonathan Siddharth
When you are training a chatbot, you'd usually do a lot of SFT and RLHF. SFT where you're giving the model input prompts and output completions. You kind of teach the model to imitate experts. With RLHF, you're basically teaching the model to produce responses that a human would tend to prefer. RLHF is used to train what's called a reward model and then the model is trying to produce completions that give it a high reward. With agents and let's define an agent. I mean different people define agents in different ways. I would define an agent as Something that's capable of taking action in the real world or in the physical world. Something that's executing a multi step workflow, calling different functions. The agent could be operating a computer or making backend API calls to actually do stuff. You might have an agent to file your taxes, you might have an agent to prepare your monthly financials. So to train an agent, you would also want to teach the model how to do tool use. So you teach the model how to call other functions, how to use other applications to be more leveraged. Today the dominant paradigm is reinforcement learning. And oftentimes these agents are trained through reinforcement learning where you'd build what's called an RL environment, which is like a mini world model for business. And in that RL environment you have input prompts, output verifiers, and you'd have the full system state tracked along with the data model. Let me give you an example. Imagine a workflow for a salesperson that an SDR would go through. Where before a sales call, let's say, the salesperson has to research the prospect. Look up Salesforce to see whether somebody from the team has already spoken with this human. And maybe if needed, looking up this person's contact information, maybe using Zoom info or something like that to reach out to them. This required this human to use three different tools, LinkedIn, Salesforce and ZoomInfo. In an RL Environment setup, you'd create a mini world model with clones of these applications that are created with a fake database with like synthetic data. And the prompt might be prepare for a call with this person. And then after the call is done, update Salesforce. Right? Like let's say that's the prompt and you have what's called a verifier to check whether the agent completed the task. And in this case, and this is where I think AI is kind of beautiful and somewhat magical. You set up the agents in this environment and the agent is going to try different trajectories, try different tool calls to try to complete the task. And you would set this up so that the curriculum is optimally defined. The curriculum is the set of tasks that you have this agent do. If it's too easy and the agent completes everything, the model doesn't learn much. If it's too difficult, the model doesn't learn much. You'd ideally want the right mix where the model is getting positive and negative feedback. It's very similar to the technique that AlphaZero used in mastering Go when the model played against itself. It's kind of like a form of Synthetic data because the agent is trying different approaches by itself. But it's humans. I mean, in this case at Turing, we created these RL environments at massive scale for every workflow you can think of across every function, across every industry.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
And so you create the RL environments to create the data that then allow the models to train to have further use cases like that.
Jonathan Siddharth
Correct. And we create RL environments for every industry you can think of. Retail, healthcare, life sciences. Imagine like this four dimensional matrix where the first dimension is every industry. Like financial services, retail, healthcare, podcasting. Maybe it's one of the dimensions. The second dimension could be every function, software, engineering, marketing, sales, finance, et cetera. The third dimension could be every role in that org chart. Let's say in sales there was SDR as like a role. The fourth dimension is a workflow that a human goes through in that role. You can think of every role a human has as like a composite of workflows. And we are creating RL environments for every workflow, for every role in every function in every industry. That's like $30 trillion of knowledge work.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Is that possible to have that breadth and quality?
Jonathan Siddharth
Yes.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
How?
Jonathan Siddharth
Time and lots of money.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Because I was speaking to candidly, one of your competitors, board members the other day in prep for this and he said the big thing that we all got wrong was we are so in innings one of the acquisition of verticalized data. There is so much room to run in the data acquisition for dental, for SDRs, for product managers. You name whatever function you want. Do you see us very much in innings one of the data acquisition for these very specific vertically focused workflows?
Jonathan Siddharth
Absolutely. It's innings one and I believe in slow takeoff. I'm sorry to pour cold water on all the AI doomers that might be listening to this, but we are not in a rapid takeoff scenario. I believe in slow steady takeoff for AGI and eventually superintelligence. So we're still in innings one. It's going to take a while before we get all of this data into the models.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
When we think about then the breadth that we go after and us specializing in RL environments. Just so I understand, like the marketplace that we sit in, because there is your macaws as your surges. How do you differ for those for people who are wondering, hang on a minute, I thought they were all one.
Jonathan Siddharth
Turing is a fundamentally different animal. So what we do is we are training superintelligence for all these frontier labs to get to superintelligence requires research, compute and data. The data needs have Significantly changed. It's more complex data rather than simple data. It's more real world data. Data that touches how real humans do knowledge work. And you need data to train these agentic systems. What the labs need in a partner in this new world is somebody that has research DNA that could be a proactive research partner for them because these paradigms keep changing. Last year at this time we were not talking about reinforcement learning at all. But then two things happened. O dropped in December, Deep Seq launched in Jan, and now it's all about RL environments. It's not just imitation learning, it's also reinforcement learning. So the labs need a data partner that's more research oriented. Second, the labs need a data partner that also touches the real world. So at Turing, we don't just generate data to train the models. For the Frontier labs. We also work with enterprises. We work with Disney, Pepsi, Blackrock, Fiserv, Johnson and Johnson to build fine tuned custom models to solve real world enterprise problems for those enterprises.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
So this is like FTEs that you send in to go and build custom models.
Jonathan Siddharth
Correct. So we touch reality, we know where the models break in the real world.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
How much of business is that FTEs deploying custom models versus more horizontal?
Jonathan Siddharth
So the horizontal business is bigger, but this business is also fast growing. And third, you need a platform with the world's smartest humans on it, as well as experts in different domains so that you can build these RL environments. And you need that platform to be really good at sourcing talent, vetting talent, matching talent and generating data. I think the era of data labeling companies is over. Turing is a research accelerator and it's now the era of research accelerators. The labs want to work with a proactive partner that can think about what types of data are likely to be helpful for these models and can make recommendations to them.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Why would you need a custom model? When you look at a lot of the customers that you mentioned there, for the ones where you have FDs who go in and build custom models, what is the reasoning around that? And is that a temporary moment in time or is that a permanent requirement from them for a certain reason?
Jonathan Siddharth
I think it's a permanent requirement. I'll give you an example. Let's pick an insurance company. For insurance companies, two really important problems they have to solve are underwriting and claims processing. Let's pick underwriting, for example. So with underwriting, the problem statement is you might get multiple types of unstructured medical data. It could be somebody taking a picture of their medical history on their Smartphone or it could be some OCR data from somebody's medical history or data in PDFs, etc. And a human has to look at that person's medical information and then decide is this person high risk, medium risk or low risk? What medical conditions do they have? Do they have cardiovascular, do they have renal? And how do you price insurance for somebody like this? Do you even take them on as a client? If you're an insurance company now this is a problem that an LLM can solve really well with a human in the loop system. You may not need a trillion parameter world model to do a task like this. In fact, there's lots of research that shows a smaller language model will actually be faster and more accurate at a task like this than a giant world model. And the insurance company also may not want their data to go back to like a Frontier model. So oftentimes in these cases what we would do is we would work with a Frontier lab, take one of their smaller models, maybe something in the half a billion parameters to 10 billion parameter regime, have an AI system built that's on prem with the customer, that's fine tuned on that enterprise's proprietary data. Because this insurance company might have data from like the last decade of humans making judgments, right? You want to make use of that data, but you don't want to help other competing insurance companies with your own data. So normally for these cases you would have a smaller fine tuned model that's fine tuned on your proprietary data, distilling your proprietary human knowledge into the models. That human underwriter that's been doing this job like has a lot of institutional knowledge in their brains. You'd want to distill that into the LLMs. And this human that I mentioned who's doing this job of underwriting, they might be using other internal tools inside this insurance company. You might want to automate those tool calls in the agent. So you'd build basically a version almost like a ChatGPT agent that's smaller and fine tuned for that specific workflow and use case. I do think this will become more popular across the board and it'll be a big market for the Frontier labs. If you want like a general purpose assistant, I think you need a trillion parameter model, right? Like that can answer anything to be a universal assistant.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Is that a good business for you? When you think about taking someone else's model, retrofitting it to a business, doing a lot of custom work with your own engineering teams in those businesses, is that a good business?
Jonathan Siddharth
It would be a good Business like we are still early, but it's growing pretty fast. We have this unique vantage point where because we are generating data for all the frontier labs, we get to see a glimpse of the future before it arrives. And the glimpse of the future that I see is that all knowledge work is going to be automated. If a human's job involves looking at a computer, analyzing what's on the screen, using different tools, using a keyboard and a mouse, it's going to be automated. It's only a matter of time. I mean these computer use agents are going to keep improving over the next decade. That's $30 trillion of digital knowledge work.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
My question to you is I spend a lot of time with very large companies, mostly where I speak to them. The thing that I'm astounded by is we hugely underestimate the pace of AI progression in terms of the technological capabilities. But consistently what I see is the laughable state of internal data, internal processes.
Harry Stebbings
I mean Jonathan, these guys are so.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Far off adopting Slack and notion, let alone building custom models and embracing the latest AI tools.
Harry Stebbings
I respectfully push back on the all.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Knowledge were Automated maybe in 20 years, but not in a 10 year time frame. Am I wrong?
Jonathan Siddharth
What do you think is the biggest constraint or obstacle?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
The inability for them to try and implement new tooling.
Jonathan Siddharth
But what if the cost is too high? If they didn't do that? If the hypothetical insurance company that I told you about, if there was a competitor of theirs that could operate with 100th the headcount while delivering a better experience to their customers by pricing insurance deals better, making more money from insurance premiums, less claims payouts, they'll get their lunch eaten.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I think they will. I think you'll see this transfer of value from old incumbent who can't adopt new tools to startup company, hence why we invest. Who is eating their lunch. Absolutely.
Jonathan Siddharth
Ah, I see. So your theory is that the incumbents won't adapt and it'll just be a forest fire and you'll just.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
100%. We will be on a 10 to 20 year decline of incumbents who are unable and unwilling to adopt new tools because of data, because of permissioning, because of internal buying processes. I mean I go to a European bank. You will be astounded by how bad it is internally, the poor quality of the processes of buying technology. It's abhorrent.
Jonathan Siddharth
I have a hypothesis. My hypothesis is that companies will be very slow with back office automation. But in the front office, for example, I speak with financial services clients in New York, like some of the biggest, biggest companies. I speak with the C suite of these companies and they are extremely interested in applying AI to help them make better investment decisions because it directly translates into helping them make more money. It's a lot easier to like convince people to use a piece of technology to make more money than to save money. And in financial services, it's pretty brutal, right? Like it's kind of an efficient market if there is alpha to be found in like how you can allocate capital, better make investment decisions, better figure out what opportunities to invest in price deals better. You'll get killed if you're not at the bleeding edge. And I've heard Mark Chen, the head of research at OpenAI, say this about how financial services is usually at the bleeding edge among all the other industries in the S&P 500, but even they are like about two years behind, usually relative to the state of the art. So I agree with you that in back office automation it'll probably be very slow and it'll probably be the upstarts that'll do things. Well, I think the change management will be too slow. But I'm optimistic about front office, especially in financial services, life sciences, pharma, where if you can accelerate the time to discover a drug or to get to a molecule, if you can help somebody win in the main thing that they care about in their industry, I think they'll adopt it faster.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I'm always told by a dear friend, Rory o', Driscoll, at scale. I don't know if you know Rory, but fantastic investor. And he always says to me, listen, value generation from AI is fundamentally dependent on one simple question. Will we see the transfer of budget from human labor to AI technology? And if we see that transfer of budget, oh my God, the $30 trillion that you said it does. And if we don't, we operate in maybe a slightly larger software technology budget world, but by no means a world that we can have the valuations and the money that we have going in. When you look at that, are there any areas truly today where you're like, we have seen the full transition from human labor budgets to AI technology budgets.
Jonathan Siddharth
I think the transfer is pretty high in areas like customer support, copywriting, SEO, like some of these marketing related areas. As you would expect, the transfer is faster in these low risk to fail areas, like when it's relatively easy. But I encourage your listeners to look up GDP, Val, which is this paper by OpenAI, where they measured the impact of today's AI models in automating all types of economically Valuable work. It's a lovely piece of research. I encourage everybody to read it. Where they did this study, where they looked at, I think like 9 verticals and 44 occupations, they took like a very diverse sampling of different types of knowledge work. Everything from financial services to real estate to healthcare to law. And they took very specific occupations. And in those specific occupations, they took real tasks where, like, a real deliverable has to be produced. Imagine an engineer, like a civil engineer, creating a blueprint for a building they're about to build, or somebody who's at the set of a movie studio coming up with a schedule for how you'd organize your cruise. Like real work. Right. And for coding, you can imagine, like a real world software engineering project. And they saw that today's models were quite good at achieving parity with the best human experts in that field. We were roughly at, like I say, we. I don't know. Which side am I on? Am I on the side of the AIs or the humans?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I think you're on the side of the AIs. Oh, really? I mean, from that positioning, that would infer so.
Jonathan Siddharth
Yes, yes. But Turing is like a blurry line, right? Like, as the CEO of Turing, like passing the Turing test is about not being able to tell the difference. So what I noticed was about 50% of the time in GDP Val the best models were producing work that was indistinguishable from a human expert, which is remarkable. And kudos to OpenAI, where they also flagged that the number one model was Claude 4 Opus, although GPT5 was quite good also. And this was for relatively simple tasks, like a task requiring one single step. Whereas in the real world, if I give you a task to do a certain project, you won't just go off and do it. You might ask for clarifying information. You might do other things to acquire more context. You might go brainstorm with other humans to do that task. And you would do it in a sequence of steps so there's more room to go. But I think we are well on our way to AI eventually automating all types of knowledge work.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What happens in that world? If AI automates all types of knowledge work, what happens then?
Jonathan Siddharth
Three things that will happen. First, we'll all have the potential to be 100x more productive. Today I'm able to run one company. Elon can run maybe like five companies. But in a world where I'm 100x more productive, maybe I'm able to run 100 companies. Elon maybe runs 600 companies every human will just be so much more leveraged. The nature of a job itself could change. Today we are accustomed to the idea of one person doing one job, but people could be doing multiple jobs at the same time. People could be running different companies at the same time. The second implication I think, is it's going to be wonderful for entrepreneurship. So you, Harry, I think, are going to be very happy because today, for a lot of ideas, founders are intelligence constrained. I think of being capital constrained as a form of being intelligence constrained. For example, if you pick a therapist who wants to start a mental health startup today, that founder would have to raise at least a few hundred K, if not a few million, to recruit some software engineers, maybe a marketing person or a growth person, maybe a product manager. But in a future where AGI exists, this person will recruit a marketing GPT, a software engineer GPT, a pmgpt, and get off the ground for a lot less capital. A million flowers will bloom. Lots and lots of non technical founders will start companies. We'll see a broader distribution of founders than just those who live in London or Palo Alto, who are kind of connected to these pools of capital, who might start companies, which I think is wonderful for the world.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you think we will? What I mean by that is there's six and a half million people today in the UK of the working population who actively do not work because of an inability to work. I'm not going to get into the analysis around that because I'll get in trouble for it. I think we grossly overestimate the intelligence of the general population and I know that sounds incredibly arrogant, but most people say, actually a lot of people just don't want to work and are not at the level of recruiting GPTs, assistants. Do you not worry that it will widen the chasm between those that have and those that haven't?
Jonathan Siddharth
I'm an optimist and I think the opposite will happen. Because what we're really doing when we are training superintelligence is you're basically training intelligence as an API. And what's the alternative to that? It's like hiring a human to provide you with that intelligence. That human is quite expensive and if anything, that creates an even broader gap between the haves and the have nots. Whereas for $20 a month, if you had access to the smartest experts in coding, in stem, in sales, in marketing, I feel like more people will be able to start companies and produce active, valuable work. I believe that when we have access to superintelligence, I firmly believe this. We are not all going to chill out on a beach somewhere and contemplate what do we do next? We humans, like we are tool builders. We are problem solvers. We'll solve problems at higher and higher levels of abstraction. In a world where we have AGI, we'll just solve much more exciting problems. Maybe we'll cure diseases, reverse aging, maybe go to the stars. I don't think we'll be bored.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I'm glad. I often hear the UBI and we're going to sit and write poetry and I'm like, I think that might be a little bit challenging. When technology is not the moat, what is the moat? I had the founder of base44on and he said 99% of code in the next year will be written by AI. Technology is no longer the moat. What is the moat in that world?
Jonathan Siddharth
I think one moat will be data driven feedback loops. For example. One reason Google had such a great lead in search for a while was these data driven feedback loops that come from people using your product and generating data that gives you, the algorithm developer, a high quality gradient for which direction to step in. So the importance of PageRank was known. The recipe for ranking search results was well known among Google, Yahoo, Microsoft and a few others. Obviously people move around these companies all the time. But the advantage Google had was because everybody preferred Google and liked that search engine, you saw a much more representative set of queries. You had data from clickstream, from the clickstream of what results people were clicking on. That helps your algorithms improve at a much faster rate. I think data driven feedback loops will be key for all types of enterprise applications. Also today OpenAI and ChatGPT has a good data driven feedback loop in enterprises. Again, I think it's wide open. Whoever is deploying the right custom fine tuned models and agents for specific workflows or roles or functions or companies. If you get in first and solve a customer's problem really well, you start getting that flywheel going where you will discover first where the models don't work well. And you will use that data to work with a company like Turing to generate additional data to plug that gap. So you will improve. This is what I mean by it's important for the models to touch reality. I feel like the models have touched reality in consumer, we haven't yet touched reality in enterprise. And the only way we'll improve is by deployment.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
And that deployment is fundamentally predicated on handholding, correct?
Jonathan Siddharth
Yes. Hand holding. And I feel like there's still a lot of first mile schlep and last Mile schlep that needs to be handled.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What does that mean? First mile schlep and last mile schlep.
Jonathan Siddharth
When I say first mile schlep, I mean for that underwriting copilot example that I gave you for that insurance company, I painted a pretty rosy picture of like how you take this model, you fine tune it on your proprietary underwriting data. In the real world, it doesn't work that way. First mile schlepp is, let's say I'm talking to the CEO of this insurance company or the CTO of this insurance company. They'll say our data is a mess. It's in silos, it's super fragmented. Some of the data is in spreadsheets, some of the data is in a file that Bob has. And Bob doesn't work here anymore. Right. The data kind of is all over the place. You first have to acquire the data, convert the unstructured data into structured data into a format that to fine tune LLMs, you might want to set up good infrastructure for evals. You'd want to create good evals for the models or agents. You might want to build a workflow designed for partial autonomy. So this human underwriter that's about to use this model to evaluate these medical histories, you might want to build a cursor like interface for them so that they can work alongside the AI to do their job. Also training the humans in these new workflows, you want to make sure you're collecting data the right way. The way, for example, we do deployments is like a tandem system where you'd have a human and AI doing the same job for a period of time where a manager can see the output of both. If the agent is right and the human is wrong, you train the human. If the human is right and the agent is wrong, you've created a data point to fine tune the next iteration of the agent. So the agent is steadily improving over time.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
If the agent is right and the human is wrong, why don't you just fire the human?
Jonathan Siddharth
I mean, it depends on at what frequency you track things like precision and recall. You'd want to analyze this over a period of time. You wouldn't fire them over a single mistake.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Bit harsh.
Jonathan Siddharth
Yes.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What's the margin on that business?
Jonathan Siddharth
It varies. We're also in the early innings of figuring out how to price that. Today we do it in a relatively simple way where we're just billing these things for time. I don't think that's the right way to do it. We'll switch to A more value oriented pricing model at some point. Right now we are just laser focused on the Frontier 8 labs. So enterprises for us is a longer term play.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
When you look at revenue numbers in this space, a lot of people shout back, they're not revenue numbers, they're gmv. Given our understanding now that Buntney, there's no talent acquisition from your business. It's all an RL environment creation business. When you look at the other announcements of alternative providers, can you help me understand are they revenue or are they gmv? And is there mislabeling being done here?
Jonathan Siddharth
So I don't want to comment on other companies, but we think about revenues differently. Like the way we think about revenues is more in terms of GAAP revenues.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
It's like traditional revenue numbers. I'm an investor, Jonathan. Essentially I'm trying to understand and learn from you of how I should weight revenue in today's AI world versus the previous historical world. Should I be impressed by these revenue numbers or should I not?
Jonathan Siddharth
I think it depends on the type of revenue. Obviously These are not SaaS ARR numbers. These are not those types of revenues. This is a different beast. I think this requires thinking from first principles. The revenue here is reoccurring in the sense that oftentimes when you're working with a lab on helping the models improve in some area and I'll speak to Turing, I don't want to speak to other companies when we are helping a lab, let's say, improve their models for coding or multimodality or tool use, or are working on RL environments for automating all types of professional knowledge work, it's usually a reoccurring project. Projects will start, projects will end. And as long as you're doing a good job, there is lots and lots of demand, but you have to consistently keep doing a good job. It's also important to be a trustworthy partner to the labs. Like, we take secrecy very seriously. We make sure that our projects are all firewalled between labs, oftentimes even with teams within the labs. Like sometimes that's the level of secrecy that you'd need. I'm reminded a little bit about how I've been told Foxconn operates. I mean, I don't know anything about that, but I've been told that they have different floors where maybe on one floor the iPhone is getting made and on another floor maybe a Pixel phone is getting made. And obviously you have to firewall all of that.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Of the eight largest providers, do they not spend with all of you they.
Jonathan Siddharth
Spend with a handful of companies. They do that to have some level of resilience. And I imagine there's some price benefits to having more than one person that they could work with. But I think the resilience piece is important. I mean, we know what happened with the, you know, when the scale investment happened. The labs did benefit from having other partners that they could work with. I would say it's a small handful. It's a small handful that are trusted. And of course, there's probably a giant pool of smaller startups, but it's a small handful of big companies in the space.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Which one do you worry about most?
Jonathan Siddharth
So this is just a big, big market that's growing super fast. I'm excited for all the companies in the space. I feel like different companies come into this world with a different DNA.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Which leaded you most respect?
Jonathan Siddharth
Sam Altman, Elon Musk of the data providers. Jonathan of the data providers.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I'm pushing you, dude. This one, I'm going to get a name.
Jonathan Siddharth
I mean, I have a lot of respect for Alex Wang from Scale AI. I feel like Alex and Scale were prescient in seeing the importance of data, and I admired how having started in autonomous labeling, like how they kind of navigated the ups and downs. Yeah, I really like the way he operates as well. I feel like there are certain elements of leadership that I think I share with him and I think he did a great job for scale.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
How did Scale being acquired impact Turing's business?
Jonathan Siddharth
We just got flooded with a lot of demand, and we've also amped up pretty significantly in multimodality. Multimodality was something I think Scale was quite strong in. Multimodality is teaching the models to operate well with not just text, but audio, video, image, et cetera. Outside in. I've heard that because of their roots in autonomous labeling, like, they were quite good in multimodal stuff. It was good primarily from just increasing demand. I feel like they were the company that had been working in this space the longest.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do they have a business left? Again, I mentioned Rory o'. Driscoll. I think he said in a show with me recently that there's this kind of carcass or husk left behind. If everyone benefited from their being bought, they can't be doing that well.
Jonathan Siddharth
I don't know enough about their business in terms of.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you pay attention to competitors?
Jonathan Siddharth
I pay attention to competitors in terms of things that they do well and when there are any significant learning opportunities from them that could help us serve our customers. Better.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you worry about revenue concentration? You said about kind of eight of the biggest labs. Say if you look at like in OpenAI, they have, I don't know, whatever it is, 100 million, you will know these numbers much better than me. But say 100 million paying customers, I'm just taking a 10% on a billion people, but give or take 100 million, whatever. And then you look at say a business like ours here, where there's like seven core customers. How do we feel about revenue concentration?
Jonathan Siddharth
The last time I checked, I was told that for Nvidia, 39% of their revenue comes from two clients and roughly 50% was like four clients. So in this market, I don't worry that much about revenue concentration. I think we are in the same boat as Nvidia in that there will be lots and lots of spend from these big eight companies. Look at the scale of the spend, right? Like Stargate is like 100 billion a year investment on computer, there's going to be significant amounts of spend on compute, energy and data. It's a little weird to have this level of concentration, but things could change. I think it's also possible that governments spend even more. I think it would make sense for governments to build their own internal versions of some of these models, which would require proprietary data again to be collected.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you not think they'll have to? And what I mean by that is we'll see sovereignty of models. I do not think there's any way you'll have the German healthcare system working with American model providers.
Jonathan Siddharth
I think you're right. I think it'll be necessary in that world.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you provide that FTE service to governments?
Jonathan Siddharth
Yes. The work that we are doing in not just training superintelligence, but in deploying superintelligence is with that goal in mind for these governments. I imagine we'll help them not just with fine tuning their models with data and evals, but also helping them with the first mile schlep and the last mile SHLP to make these systems actually work. And you might want to have full control over what type of human data is going into these models. Like if you're the German government, presumably you want German nationals to be the ones that are contributing data, whether it's for SFT or reinforcement learning.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
So I'm worried that we are potentially not going to see AI deliver the immediate revenues that we've promised. And we're going to go through a kind of cooling period which everyone kind of suggests and thinks that we're going to go through. In the next six to 18 months, which is, as I said, it doesn't hit the revenues that we said it would. And kind of the AI bubble deflates slowly. To what extent do you think that's possible? Or we'll see this continuing gradual increase as we kind of touched on there.
Jonathan Siddharth
I don't see an AI bubble. These models are incredibly powerful today. GPT5 is like fucking awesome. I don't know what people were talking about when they're talking about, you know, I know there was some chatter. I think we've just gotten used to magic. I feel like a. These models are incredibly powerful today and they're the worst they'll ever be. They're only going to keep improving. I say that about the Gemini Pro models, the GROK models, the Claude models. These models are amazing. And there is a very significant model capability overhang. By that, what I mean is the models are capable of X, but what we are getting out of the models is X minus delta. With the right agentic scaffold around these models in terms of the right system prompts, the right user prompts, giving the models access to the right context, teaching the models how to acquire additional context, teaching the models how to use the right internal tools, there is significant amount of capability that can be unlocked with today's models. For example, you, Harry. I imagine when you do an interview with somebody, one of the things you probably do is you apply your secret sauce to pull out the right clips from the interviews. Like what to highlight, what are the catchphrases, what will drive more engagement that can be done by a model with the right agentic scaffold that's fine tuned on all the work that you've done in the past.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I wish every weekend I go through every single show and I pick out 15 to 20 clips per show and then I make notes on each one.
Jonathan Siddharth
Yeah. Are you saying, Harry, that you want to use Turing?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
If you could fucking make it work, dude, I'd pay you a lot of money.
Jonathan Siddharth
Yeah, maybe we should partner.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
That'd be great. Seriously, every weekend I spend probably three hours per show, definitely 12 hours a weekend doing that.
Jonathan Siddharth
I think there is this model capability overhang where the full potential of the model has not been unlocked by humans yet. And no, I don't think there's an AI bubble. I think there are some growing pains.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What are the growing pains?
Jonathan Siddharth
I think everybody keeps citing that MIT report about how 95% of pilots fail. But because we are in the business of deploying AGI in enterprises, I can tell you Why I think that happens, which is one of the growing pains. Step one is that most enterprises need to do some work to structure their data in the right way. Again, that first, mild schlep has to be done. Second, you should surround the model with the right agentic scaffold that I just described. The right prompting, the right context engineering, the right internal tool calls that you should teach the model to call. All of those have to be distilled into the models. You need really good evals. You also need a workflow designed for partial autonomy. Andre Karpathy like articulated this first when talking about why cursor works so well. Because it's not designed for full autonomy. It's designed today for partial autonomy for humans to collaborate with the AI to do that specific task. So that cursor for X needs to be built for every role, for every workflow to help humans work more easily with the models.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Does every role need to go through that pathway of cursor for X before it goes to full autonomy? Or are there some roles like customer service where it just goes to full.
Jonathan Siddharth
Autonomy, some roles where you can see that the models are quite good at matching humans? I think we don't need that intermediate step. There are certain roles where by virtue of how the models are trained, where they're pre trained with tokens on the Internet, and then of course with talent from research accelerators like Turing, that's fine tuning the models. But there are certain types of roles where the tokens from the Internet give it sufficient intelligence to do the job well. Customer support is an example. But if you pick other roles, like for example, if you picked the role of, let's say an AI researcher or you pick the role of a lawyer specializing in venture capital financing, it's possible there's not enough of those tokens on the Internet so the models will be relatively weak. They're out of the box. And also the way Wilson Sonsini does financing might look different from the way a coolie does it. Maybe they have their own way. So you might want to fine tune them on your own proprietary data, distill the proprietary intelligence of humans working there. So for those things like you may need to do some fine tuning, the models may not work very well out of the box.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
A lot of people suggest the circular deals between some of the large providers suggest the strains in the ecosystem or the bubble like tendencies. Do you think that's fair or not?
Jonathan Siddharth
I sort of categorized the world into two classes. Class one is those that believe in AGI. Let's call it The AGI pilled group that believe that we are on the path to getting to AGI. And let's define AGI as an AI system capable of at least matching humans in almost all types of intellectual knowledge work. Then there is a world, maybe another category of people that don't believe this will happen. We'll hit a wall. And in the past there have been other AI paradigms where we did hit a wall for the camp that believes in AGI, and I believe in AGI, unsurprisingly, but I love AI and it's been my passion for the last 20 years. I really believe that we will get there. If you believe that the grand prize is so amazing, right? Like if you've solved intelligence, you've solved all of humanity's grandest problems, from curing diseases to potentially pausing aging, to interstellar travel to energy. All of our problems are intelligence constrained. So the prize is so large. I mean, whoever wins the superintelligence race will probably win search will probably win, consumer devices will probably win, operating systems will probably win cloud, like business productivity software. It's like the prize is so massive that it's worth placing big forward bets in these areas because the cost of not winning is too high. Whoever wins AGI would also probably win social networking. So you can see why the big eight are excited about it because you're playing for everything. It's like whoever wins this could be responsible for that $30 trillion of knowledge, work, automation.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
If you're Zuck, you spend $100 billion on it. If you lose or fail, likely everyone else will fail, in which case you're behind just like they are. And you've lost $100 billion, which isn't a huge amount of your free cash flow. Maybe 12 to 18 months of free cash flow. If you don't spend that $100 billion and someone else is does and wins, you lose 2 trillion, $3 trillion of market cap.
Jonathan Siddharth
Correct.
Harry Stebbings
You have to play.
Jonathan Siddharth
You have to play. Imagine if somebody built a more engaging social network and social networks have just one unit, which is attention. I mean, we all only have maybe four to five hours a day to spend on an app. If there was a more engaging app, then, yeah, those are high stakes.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you agree then with the notion that if you believe in AGI, you cannot be investing in SaaS apps?
Jonathan Siddharth
Yes, I absolutely believe that SaaS as we know it, I think is over. I think it's completely over. I feel like Quite a few SaaS apps were built at a time when software was relatively hard to build and complex to build. Imagine if you were building some customer support software, some customer support bot. To build a company like that, you would have had to hire some Stanford PhDs in NLP. You'll collect data for six months. You'll use like a support vector machine or a neural network. That'll kind of sort of work and then you'll deploy it and you'll grind away for a while. There is a significant amount of capital that needs to be invested to get an app like that to work well. So it made sense for many companies to not bother doing that if that's not their core business. Let me just use like some third party SaaS app. Now many of these AI applications are incredibly easy to build on top of these LLMs. So I feel like most companies will start building custom software super easily. I mean we help companies build some of these custom apps. The bar to create many of these apps significantly comes down. So that's one risk. Risk number one, companies do it themselves. Risk number two is you get sonic boomed by the foundation model.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Companies sonic boom. Meaning they move into the apps layer and just create it themselves.
Jonathan Siddharth
Yes, it could happen, right? The models are becoming agentic. I mean you've seen many of these agents, right? It's fundamentally, it's about computer use agents. If the models get better and better there, it's possible. Like the model is all you need. Imagine if you wanted the model to. Let's say I'm doing some HR thing. Hey, update my medical benefits information to add. We've just had a new daughter, we want to update my medical information. If the model is agentic and it is sufficiently integrated into the database of the organization, you don't need anything else in the middle. So that's the second risk. The model is becoming more agentic. The third. And I worry about this a lot. I feel like a lot of our software was designed to be used by humans. Humans navigating a GUI and clicking around and doing things, I think that's going to go away. I think of four pillars to superintelligence. It's multimodality, reasoning tool use and coding. Multimodality is important because we humans interact in natural language. We talk, there's video. The future might look like some type of ambient AI that you talk to that will just go and do things and maybe it'll use the GUI of the current SaaS application as like an intermediate step or it'll use MCP and use tool calls and get what it needs. The GUI was like designed for a world where humans were using a keyboard and a mouse and clicking around and doing things. I think humans can do better things with their time than click around.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
And actually one big change that I have is I never actually type emails anymore. I use whisper flow and it's so good in transcription that I don't ever type emails ever. Now the only trouble is everyone knows what I'm saying in my emails.
Jonathan Siddharth
The whisper founders interned at Turing back in the day.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
No way.
Harry Stebbings
So how do I feel about it?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
No, I don't agree.
Jonathan Siddharth
Why?
Harry Stebbings
Because the average company today has between.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
80 and 100 different SaaS products that they engage with to one, just the multitude of how many they'd have to create, number one, number two at maintaining them.
Harry Stebbings
You think they're going to maintain 80 to 100?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Oh my God.
Harry Stebbings
You're going to have like just teams.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
And teams of people doing maintenance and updates and debugging. I don't think so either. And that is for the technology savvy.
Harry Stebbings
Let's talk about every plumbing provider, law firm, accounting firm, restaurant that can barely.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Use WIX and Squarespace, let alone build out their own CRM system and POS system. Not a fricking chance.
Harry Stebbings
And then we move to foundation model companies moving into very vertically specific elements.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
We're in a business that does AI for patent creation, updates and collaboration. Sam is not going there. Sam has health solving, cancer, energy utilization. I don't think Sam's touching patent creation and updating. And so I think for the more verticalize you go, the more defensibility you have. And so I think for those reasons SaaS has life. Mine is a very biased perspective because it's my job.
Harry Stebbings
Is that all wrong?
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Again, you're the master here, Jonathan. So VCs are literally middleman, I'd say.
Jonathan Siddharth
Harry, you have an interesting data set because you invest in a ton of startups. So I would be curious looking at your sample of startups that you've invested in and just tally how many SaaS apps they use today at every stage and see if that has changed. Like post ChatGPT. My hypothesis is that today's companies use fewer SaaS apps and have fewer people.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you think we have more or less software engineers in 10 years?
Jonathan Siddharth
More, I think the definition of a software engineer will change. A Stanford doctor who's in oncology who has an idea for some cancer detection type app. Now that person will be able to create a very simple version of an app that somebody could check by themselves and do a home diagnosis. I think there'll be More software engineers. Because if you define a software engineer as somebody who's capable of building a software product to solve a real problem, that pool of builders is going to expand way beyond people who've graduated with like a four year computer science degree.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
So we have more software engineers creating more software and the problem then becomes discovery. How do we solve the discovery problem in a world of infinite software?
Jonathan Siddharth
You might have an agent for yourself that's talking to other agents on the Internet. You might have an agent of yourself that's discovering startups to invest in. And while you and I are chatting, I don't know, you might be having a million conversations with entrepreneurs from all over the world who today you're kind of constrained by space and time, but in the future you'll only be compute and data constrained.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do we lose the phone as the interface to this world? We obviously see Sam and Johnny. There's rumors of pendants and some hardware devices. I'm not asking you to comment on that. I'm just saying does the foam still remain the primary interface and design device?
Jonathan Siddharth
I think we'll have some type of a device that we'll carry that's always on and processing multimodal tokens. For example, as I'm talking to you, if I were to envision my perfect device, it would be something that has. It should have cameras. So maybe it's a wearable as a glass or something that I'm having on me that's processing visual input because I want to be able to read your body language. I might have like an airpod like thing in my ear that's whispering to me that maybe, says Jonathan, as you were talking about multimodality, Harry seemed less interested. His body cues suggest that he was losing interest. But when we were talking about ar, he perked up. So those types of feedback and cues I think would be good. So I envision a device that I think of it in terms of sensors and effectors. In terms of sensors, like obviously it has to be listening to stuff, it has to be seeing stuff. But in terms of effectors, it'll probably also be speaking in my ear. Ideally it should be something that you can talk to and have it do things later. For example, I might say, remind me to follow up with Harry on that idea for using Turing to automate clip generation. So it has to remember that and come back later. So I do think there'll be all sorts of new devices and glasses hearing like these airpod type devices seem obvious. There could be like, do you remember this device that was called the Meeting Owl. No, like during, like the COVID era. One of the tools that sort of spiked was like this. It was basically like a speakerphone for, like having better distributed team zoom meetings. When somebody's talking, it would focus on them with a camera. It was also a decent speaker. So I can imagine devices like that. So it's hard to predict. But the thing that I almost feel confident about is that the phone will look so different. I mean, when we think of our smartphone, it's basically a computer with like a phone app in it, right? Like, the phone app is like the least interesting part of the phone. And I think even for an AI device, it'll probably have some phone app in it. But everything else I feel like will be magical. I feel like I would definitely benefit from a device that's constantly listening to everything that I'm listening to, constantly processing all the video audio input that I'm processing. Something that's paging things to memory. Like, maybe it'll like write things down and be able to look it up later. I see it almost like an extension of my brain.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Before we move into the Quick Fire round, I do just have to ask, what does your market and the data provisioning market look like in 10 years? I always try and think about like market composition and dynamics. Is it a winner take all? Is it a very fragmented. Is it a three or four? What does that look like?
Jonathan Siddharth
The market will reward players with research depth because the pace of AI research is so rapid. Like all these RL environments like this spiked in like the last 12 months after O1 came out in December and Deep Seq came out in Jan. So now it's like in addition to imitation learning, we are in this reinforcement learning regime. One year later, it could be something totally different. So I think the market will reward a company with research DNA and it'll reward a company that can move fast and adapt very quickly. Because this is. I mean, when you're.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
So do you think this is a monopoly market or do you think there will be many winners?
Jonathan Siddharth
I think there'll be a few winners. I do think for the labs, it helps them to have a few partners for resiliency, I imagine also for price competitiveness. I think there'll be a few winners. In the realm of robotics and embodied AI, we are still very early at Turing. We are scaling up on the robotic side as well in terms of data that we generate. But there's so much data that's missing that the models need to see that they haven't seen yet. I can totally imagine some newer companies also coming up that, that don't exist today.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
If you were to invest in companies in your space, where would you invest?
Jonathan Siddharth
Probably in robotics or embodied AI. The vertical stuff. Like, I mean we are scaling up pretty massively in generating data for different verticals. I don't see that as like a big white space, but I think everybody is relatively early with robotics and robotics is such a vast realm that there could be interesting things to do there. One way I see the space, again, think of like these three dimensions. The first dimension being the type of intelligence that you're baking into the models that could be encoding in stem in functional expertise like sales, marketing, software engineering, or vertical expertise like healthcare, legal, finance, et cetera. So the first dimension is the type of intelligence. I do a cross product of that with the modality, audio, video, image, computer use. So that's multimodality. That's the second dimension. The third dimension is multilinguality, like different languages. The fourth dimension is different learning paradigms like imitation learning, reinforcement learning, pre training as well, which is unsupervised learning. All of those may require different platforms to be built. Like we've had to adapt our platform for imitation learning, for reinforcement learning, for multimodal modality. So I feel like in this matrix there's like all sorts of new opportunities that could emerge. And I only listed the digital intelligence. I didn't talk about physical intelligence. So I think robotics is like wide open. I mean the kind of data that a robot that is in someone's home is totally different from like a robot that's doing things in a factory and humanoid versus non humanoid robots.
Harry Stebbings
I could talk to you all day.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
I do want to move into a quick fire answer. I say a short statement, you give me your immediate thoughts. What's one widely held belief about AI that you think is wrong?
Jonathan Siddharth
I don't think we'll see rapid takeoff. I think we'll see incremental continuous improvement in AI. I actually think this is good for the world because if what we believe happens, which is all types of digital knowledge work, gets automated, I think humanity needs time to prepare its workforce. I think we could use the extra time to upskill humans, to rethink education, to make sure there isn't massive job displacement. I also think in the steady continuous improvement in AI models, there'll be value realized every step of the way. Unlike self driving cars. I feel like people have this wrong model for AI that comes from self driving cars, where you get it 99% of the way accurate. And if you can't solve the last 1%, they're not useful. AGI is not like that. I think when we automate the job of an underwriter or a claims processor or a CEO, there's incremental value that's unlocked for every percentage. Improvement in the model is becoming more reliable. I believe in slow and steady takeoff and that's actually going to be great for the world.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
You mentioned Deepseek a couple of times. Do you think we underestimate China?
Jonathan Siddharth
It depends on who you ask. The folks that I work closely with don't underestimate China. I think it's very impressive. Like the progress that they've made in open source with Deepseek. Kimiketu Kwen these models are state of the art. So no, I don't think, at least among the frontier AI circles that I'm in, I think there's a clear realization of how close they are.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
The world seems to be moving to closed models. Is that good or bad?
Jonathan Siddharth
I think it depends on the application. Firstly, in enterprises it's often a mix between closed and open models. We do see demand from enterprises that want either. The closed models often are easier to get started with, but there are some cases where enterprises prefer open models for cost customizability. And I'm talking about the small language model regime between the half a billion parameters to 10 billion parameters. I worry a little about frontier models. I feel like for frontier models there is some value in keeping some of the technology closed just because of how powerful they are. And I feel like the US labs are extremely responsible and safety conscious in how they think about training these models, deploying them.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Do you think Elon is. You mentioned reading his book earlier. Elon is often chastised for his lack of care around some of the training elements. Do you think he is and do you think actually he'll benefit from not having that God rail?
Jonathan Siddharth
I think Elon is also. He cares a lot about humanity too, at least in his book. One of the things I recall reading is his motivation for getting into AI was that he wanted an AI that was speciest and loves humanity. That was one of his reasons to get into it. Everything I see about the GROK team, I feel like their goals are much like any of the frontier labs, like quite noble in terms of having this powerful AI that can help humanity understand the universe, solve some of our biggest problems.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What did you believe that you now no longer believe?
Jonathan Siddharth
I used to believe that to build a enduring, valuable company, you hire a strong exec Team operate with a lot of leverage, hire great people and get out of the way. I used to believe that. Now I believe you hire great people and work really closely with them and their directs and their directs and their directs. Get as close to the ground as you can. Where ground truth usually exists with the customers. The next step to customers is the engineers writing code and the salespeople talking to your customers. Now I believe in being basically I used to like, for lack of a better word, follow the org chart a little bit. And this was also part of one of my learnings from Elon's biography is that he was so hands on, like he would be like walking the factory floor and asking an engineer why this door in the Model 3 has three bolts instead of maybe two. It is a different way to operate where you're in the details of the most important things that matter completely working in like a flat structure, operating as close to the to the ground truth as you can. I feel like in the early days of starting Turing, like now that I think about it, I may have had a subconscious desire to be liked. I think I must have. Now I don't. Now I just think about just doing things that would solve our customers problem the best.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
What was the most unpopular decision you've taken with Turing?
Jonathan Siddharth
Turing is switching from a distributed team to a hub and spoke model. So we are now working from an office in San Francisco and we've recently opened an office in Palo Alto. We're going to be opening an office in London as well. For some people, like that wasn't very popular. Some of them left and yeah, we like in person.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
We're big fans of in person here.
Harry Stebbings
Final one.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
When you look forward to the next decade, what are you most excited for? So like for me, my mother has Ms. I think that we'll have some pretty groundbreaking breakthroughs in our mass drug discovery that we haven't had for whatever that excites me.
Jonathan Siddharth
I'm excited about AI making new discoveries, automating AI research itself to get to a point where AI is in some self improvement loop so that we could get to superintelligence faster. So automating AI research and getting AI to the point of making new breakthrough discoveries. I'm excited by that. And I've always been fascinated by AI as like this exoskeleton that makes you a lot more productive. Have you watched the Iron man movies? Yeah. In the early Iron man movies he's wearing the suit and the suit is obviously giving him superpowers. Right. And then in the later ones. The suit is quote unquote agentic where like he has these drone suits, right? Like where there's like an army of his suits that go off and do things. I'm excited about a future like that where every human on the planet has access to agentic AIs that help them amplify their fullest potential. Today Harry might have 100 ideas, but Harry's able to do maybe two of them really well. I like a future where Harry can do the remaining 98 and I like that for like the 7 billion humans on Earth.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
Jonathan I love conversations which are very natural and free flowing. You can tell that I don't really pay much attention to the schedule, but you've been fantastic. So thank you so much for joining me.
Jonathan Siddharth
Thank you Harry for having me.
Harry Stebbings
But before we leave you today, are you drowning in AI tools? ChatGPT for writing, Notion for docs, Gmail for email, Slack for comms and you're constantly copy pasting between them all, losing context and losing time. This is the AI productivity tax and it's killing your output. At 20 VC we're all about speed of execution and Superhuman. Superhuman is the AI productivity suite that gives you superpowers everywhere you work. With the intelligence of Grammarly, mail and coda built in, you can get things done faster and collaborate seamlessly. Finally, AI that works where you work, however you work. Superhuman gets you from day one with zero learning curve and it's personalized to sound like you at your best, not like everyone else using generic AI. Get AI that works where you work. Unlock your superhuman potential. Learn more@superhuman.com podcast that's superhuman.com podcast and just like Superhuman gives you superpowers in your inbox, Fanta gives your company superpowers.
Guest or Interviewer (possibly Rory O'Driscoll or another industry expert)
In security and compliance.
Harry Stebbings
Customer trust can make or break your business and the more your business grows, the more complex your security and compliance tools get. It can turn into chaos. And chaos is isn't a security strategy. That's where Vanta comes in. Think of Vanta as your always on AI powered security expert who scales with you. Vanta automates compliance, continuously monitors your controls and gives you a single source of truth for compliance and risk. So whether you're a fast growing startup like Cursor or an enterprise like Snowflake, Vanta fits easily into your existing workflows so you can keep growing a company your customers can trust. My listeners can get $1,000 off Vanta by going to vanta.com 20vc that's v a n t a.com 20vc 20vc for $1,000 off Vanta. And like Vanta builds trust with automated security. Angellist drives momentum by helping you find and fund breakout startups. If you're listening to 20 VC you know we have a really freaking high bar. Well Angellist is the modern platform used by the best in class venture funds where over 40% of top endowments and banks are LPs. Their customers include a top five venture firm, 20VC and they now have, check this out, $171 billion of assets on the platform. They combine an all in one software platform with a dedicated service team that moves as fast as you do. One manager said this awesome quote, angellist feels like an extension of my fund. Another said Angel Angellist gives me total peace of mind. The attention to detail, lightning fast response time and just real sense of ownership from the team are exactly what I need to stop worrying about back office ops. So if you're starting a new fund, don't be a moron. Just use Angellist. They're incredible. Head over to angellist.com 20vc to learn more.
Episode Title: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing
Date: December 1, 2025
Guest: Jonathan Siddharth, CEO & Founder of Turing
This episode centers on the evolution and future of data labeling, research acceleration, and knowledge work automation in the era of AI with Jonathan Siddharth, founder and CEO of Turing. The conversation explores the differences between data labeling and research accelerators, the validity of revenue claims in the space, the implications of automating all knowledge work, the future of SaaS in an AI-driven landscape, and what real-world AI deployment looks like. Jonathan shares insights into where value pools are shifting, what moats exist in a world where software is commoditized, and how work—and workforces—will transform as AI progresses.
Defining the Shift
Complexity of Data Has Changed
Agents, Not Just Chatbots
On Automating Knowledge Work:
“All knowledge work is going to be automated. It's only a matter of time.”
— Jonathan Siddharth (00:00)
On Why SaaS is Over:
“I absolutely believe that SaaS as we know it, I think is over. I think it's completely over.”
— Jonathan Siddharth (46:36)
On AI Model “Overhang”:
“There is a significant model capability overhang ... the full potential of the model has not been unlocked by humans yet.”
— Jonathan Siddharth (41:40)
On Enterprise AI Adoption Pace:
“I believe in slow takeoff. I'm sorry to pour cold water on all the AI doomers ... but we are not in a rapid takeoff scenario.”
— Jonathan Siddharth (11:39)
On Moats in AI:
“One moat will be data-driven feedback loops ... The importance of PageRank was known ... but the advantage Google had was ... data from clickstream ... that helps your algorithms improve at a much faster rate.”
— Jonathan Siddharth (28:21)
Jonathan Siddharth paints a future where traditional data labeling and SaaS business models are relics, replaced by intelligent research accelerators and deeply integrated, agentic AI systems. The winners will be those who combine deep research capabilities, rapid adaptation, and a feedback-driven approach to both enterprise and model training. While disruption to established business processes and roles will be profound, Jonathan remains optimistic that productivity, innovation, and entrepreneurship will flourish as AI supercharges human capabilities.