
ElevenLabs CEO and co‑founder Mati Staniszewski joins Jennifer Li to explain how the team ships research‑grade AI at lightning speed—from text‑to‑speech and fully licensed AI music to real‑time voice agents—and why voice is the next interface for human‑computer interaction. He shares the small, autonomous team model, global hiring approach, and how the Voice Marketplace has paid creators over $10M while evolving into an enterprise platform.
Loading summary
A
We don't want to become same as previous generation of the editing suites. So instead let's solve it on the research level where it will know based on the voice exactly how it should speak with the speed to be able to cater to all those different use cases. You need such a big array of different voices, different languages, different accents, different styles. So we launched Voice Marketplace where you could create your voice and then share it. And when the voice is shared, you earn money in the return. Today we have almost 10,000 voices. We paid $10 million back to the people in the community. There are some crazy stor from the voices just speaking through technology showing the examples of the global race initial knee jerk reaction.
B
Today you'll hear from David Sack, Mark Andreessen and Ben Horowitz Voices for America for human computer discuss the Trump administration. Today you'll hear from AI Policy co founder Innovation building at Lightning and how the US can lead on energy chips and avoiding the real time voice regulation as well as how small autonomous teams and global hiring power the company's product velocity. We discussed the ethics of AI audio, the voice marketplace paying creators over 10 million and the shift from creator brand to enterprise platform. Plus why speed is the moat in the race to define the future of sound. Let's get into it.
C
I'm excited to welcome our first speaker, Maddy, co founder and CEO of Elam Labs. All right, so good to have you here.
A
Matti, thanks so much for having me here. Great to see everyone and good morning.
C
That was the Wallcom music generated by eleven Labs, was it?
A
It was. We expand continuously across the audio space. So we started with voices, then created orchestration of how to build voice agents and now also create a fully licensed music model so can produce AM to go alongside of it.
C
Awesome. We'll talk all about that. I had the opportunity and also the luck to get to know from the very early days when 11 labs got started and got to partner over the last three years to just see your execution everywhere from product launches to shipping new lines and models like you just mentioned, everything from text to speech models, speech to text. And then we started doing music sound effects and now the AI agent platform. I'm very curious. First I'm still in awe of the shipping speed after all this three years. But I want to ask how do you actually maintain both the speed and quality when you have such expensive product roadmap?
A
So first of all we partnered almost three years ago and so it's great to hear all the kind notes but also what I didn't realize when we partnered, the infrastructure team was free people and of course now I'm 11Love's founder. We love number 11 and the company infra team is 11 people. So we've seen the growth of the other side as well. And I hear that the companies here raise $66 billion in the total fundraising. So the number 11 is everywhere here. But I think that to start off, I think first piece, I think the smartest person I got to know as my co founder, Piotr, who has been the research brain for creating a lot of the models and then being able to assemble what we think are the most incredible researchers in the voice space to really create the first text to speech model that could understand the context in a better way and turn that into the emotion intonation. Then find a way to capture the characteristics of the voice so you have the voice sound with the right style, with the right age, with the right gender, dialect, everything in one. And then the researchers across of course now expanded that to speech, to text, music and other work. So that's our foundation. And then the way we structure it to be able to ship quickly, especially with so many things happening in AI space, is a lot of small teams. So today we have roughly 20 product teams, each of five to 10 people size, which with full independence can go ahead and ship products. Of course that carries some of the sometimes issues of duplicative work or sometimes people going at different speeds. But at the positive end, the ownership of each of the teams is extremely high. So people know that this is down to them to really deliver and ship and it allows us to move extremely quickly. We bucket our work into creative space. So creative platform where we help with narrations, voiceovers, dubs for creating creatives and creatives in the media entertainment space. And then on the agents side where we help people recreate voice agent experience, conversational agent experience across customer experience, all the way through to immersive media.
C
Great. Eleven Labs has labs in the name very similar to many of the other big labs. Which means you're doing your first party R and D and model development, but also building all these 20 products. How do you think about balancing both? Like keep progressing on the model research, but at the same time not delaying sort of the product launches.
A
Yeah, it's very tricky. I'm sure many of you have the same thing, like do you build a product when you don't know if the research innovation will displace the product you just built? We had this in the early days too. So one of the simple examples was we had a model at work. And one of the most common requests was could we do a different speeds for voices. So could you have additional slider to modify the speed of how audio gets generated and how quickly it speaks? And we are very against this idea of no, we don't want to do any sliders, any toggles, we don't want to become same as previous generation of the editing suite. So instead let's solve it on the research level where it will know based on the voice exactly how it should speak with the speed. And we resisted this for I think good amount of nine months. And we couldn't solve it on the research side. And then the product was super simple solve that got all the users across. And now the approach we take in looking at this is if we think the research work will take more than three months, then the product can do anything they want to start adding other models, adding some of the extensions. Of course, sometimes the timeline is tricky to predict, but roughly the guidance we have from our internal research team, what are the initiatives we hope to ship this quarter, what are long term initiatives? And then for anything long term, you can use any other work to close that gap and make it better.
C
I guess first you kind of have to figure out if the research commitment is going to meet the timeline first and then go on to align with the product teams. That make a lot of sense as everyone is moving to San Francisco and building in person and locked in in the same space. 11 has always been building globally and having people more distributed. But you now have centers, I guess in different locations from London, Warsaw, San Francisco to New York and other places. How do you think about building this global expansion and finding talent globally versus I guess the trade offs of building in the same place.
A
Yeah, so me and my co founder are Polish. We started between Warsaw and London at the time. And I think 11 labs wouldn't have existed if we weren't starting from Europe. It's a very peculiar thing. But in Polish, if you watch a movie in Polish language, like a foreign movie in Polish language, all the voices, whether that's a male voice or a female voice, get narrated with one single character. No emotions, no intonation. As you can imagine, it's pretty terrible. And it's still happening today for most of the content out there.
C
I've had a similar experience growing up in China that we have a lot of western movies dubbed in Chinese monotone.
A
So bad, so bad. And it's like in Poland, of course, post communist country. It's a cheaper way to do it. You don't have to hire as many people. You have one monotone audiobook reading of a movie. And that was kind of where the company started. And we started initially in Europe and we realized that if we want the best people to solve what was a research problem at the time, we need to hire wherever they are. And we couldn't lock ourselves to just San Francisco or look at the west Coast. We knew that we need to find them across Europe, across Asia and bring them into the company. So we started fully remote and started looking at those people. And then on engineering, we also were very against this traditional hiring method of looking at LinkedIn, looking at traditional background and trying to figure out could we go and figure out a different method to hire people. That led to some very interesting hires. So we hired a person that had incredible open source text to speech model and was working in the call center at the same time as a recipient of the calls to make money.
C
Wow.
A
And he's now on the team, one of the most brilliant researchers we have, doing all of the data processing, but the same pattern kind of followed. And of course the early team was very distributed. And then as we started scaling so beyond 30 people, we realized that the new people joining, there's a benefit of them having a space to be next to others, to get deeper into the culture, understand what are all the projects that are happening in the company. So we started the hubs where you can go into London and Warsaw and San Francisco where you can work with others in person. And that's how we tried to marry those two. If you are early in your career, we try to hire you in the hub so you can immerse yourself in the company. If you are used to remote work, completely fine. But then if you want you can always come and join us in the hub. And that worked really well. Currently we continue hiring very untraditional backgrounds in some of the place of the company and then fusing that to very traditional backgrounds which can teach the others. And in sales, for example, we've done some of those experiments too where that combination worked really well.
C
The lesson is you can really find talent everywhere. It's just how hard and how you look for them.
A
And I think in Europe also this was an interesting1. In US, people are very keen and excited to work. And if you go for any social event, you want to talk about work. And in Europe I didn't have this feeling where it's like most people don't want to do that. It's like the cultural piece is different, but then you do have the pockets of people that actually strive it too. They just don't have the companies where they could do that in. So I feel like our team from Europe is the most motivated and passionate set of people that we are lucky to have.
C
Yeah, I can attest to that given I've met some of them. Very hardcore, very good work ethic for sure. And you have also maintained a pretty flat org structure and have people own quite laterally a lot of responsibilities. Can you talk about the rationale behind that? And I guess there was also a no title policy.
A
Yeah, so we removed titles a year ago and it's going well, it still works. And I do think that they thought a lot of AI companies kind of do it too already with member of technical staff being like the usual piece you have for engineering. And then in a lot of the go to market, you are just go to market, not VP of sales or other roles. I think it's actually a pretty common pattern. But in our case we had a small team approach where you have extremely small amount of people, usually the five to 10. And we wanted to make it very clear that every team we create those teams, you have six months to prove it. If it's proven that team will stay and continue working. But it really is that the moment you join, you can have any impact on the company. So you can have any role in that team. The tenure will not define your position in the hierarchy. If you are smart and quick and passionate, you can elevate yourself very quickly, which this really helped. And also it's a common layer to the external world where Everybody looking at 11 labs knows that we are the go to market team is go to market team. There's no positioning to the same extent. What this allows us to do is I think when we speak with a lot of our partners, with a lot of our customers, they also know that they are getting the best people always. And we can also send people to different conferences, different events, regardless of that positioning. I think the tricky thing in the flat structure is not only positives in the way we currently have, it's a set of leads effectively for the subdivision. So the research, creative work, agents work go to market, self serve and sales LED and of course ops only. That's the layer of leads. And then under that there's pretty flat small team approach across the world. But then you really want the leads to be able to carry the complexity around the team. So suggest things between one team to another if they see that there's something valuable between them happening. So I think picking Those people that can context switch between is super important. And then letting the team fully focus on that and then having, which was interesting learning where if you put a person into all the slack channels and give them transparency, they actually get frequently distracted because then they read all the messages. You can still choose not to read them, but they still do. So you kind of need to cut the access to a lot of those pieces to force the attention and that kind of works. All those small things work really well.
C
Maybe we can borrow some of that lesson too. Let's switch in gear a little bit. You're on the front line seeing a lot of the creative work, whether it's from art, music or advertising, that are starting to adopt AI tools. And in the beginning that was not the case. There was a lot of resistance and now we're just seeing the adaptation and the welcoming of using more of the generative AI tools, including AI audio. And you have done some really smart things from the marketplace payouts to working with these creative industries since day one. Actually, I remember how much you stress we have to find a way to work with them and sort of observing market shift over time. So the question is, how do you actually adapt to these changes and find the ways to work with the industry in the infancy, in the beginning, and how did you navigate some of the challenges in that?
A
So I think the first piece is actually spending time with the industry and trying to understand what are their priorities, their incentives. Of course, it's sometimes tricky, sometimes you then end up being starstruck. We had an honor and pleasure to work with Jared on some of his incredible work and learn from him on what is important and which parts of the production process you can actually use AI, which ones you want to keep where. Is it actually helpful? I think that's the super important thesis across all the partnerships in the space. In our case, we tried to figure out how to do that on the voice space, which is of course with that technology. A how will the voice acting space look like in the future? And then two, of course, to be able to cater to all those different use cases, you need such a big array of different voices, different languages, different accents, different styles. So we launched Voice Marketplace where you could create your voice and then share it. And when the voice is shared, you earn money in the return. Today we have almost 10,000 voices. We paid $10 million back to the people in the community. There's some crazy stories from the voices. One of our first voices will say a deep Spanish voice. And the magic of the technology is that the same voice now is available on all different languages in the same way. So it's 30 different languages at the time. Now it's 70, but 30 languages at the time. And we had the Spanish voice join us and it wasn't picking up on the Spain. Nobody really liked it as much. And then it picked up in an English speaking country. That same voice, that deepness and now it's our top three voice for all the use cases. So hidden messages, you can all register to our voice marketplace and maybe earn some money too. So that's, I think the second important thing is like figuring out how we can be part how we can bring the industry together to disrupt together rather than just to disrupt. And with labels, I think I'm still learning how to interact. We worked with labels Merlin and Cobalt, so fourth majors to bring their music into the music model. So we can do it in a licensed way so you can generate that and give commercial rights so you're fully protected. That was a hard process. It took us 18 months to figure out the agreement that works. And in the end I think the main thing was adding set of forcing functions or forcing timings to find effectively a trigger of like, okay, this is when we do it and we either do it together or we do it separately. And those forcing functions really help add urgency. Then we needed to move that forcing function a few times, but it still worked to a large extent to go after that and then two is of course finding the compromise wasn't easy. But then in our case working with the labels there was protecting what they are caring about. And they of course also care about how they continue doing well by their members, by their artists that they work with. So we would spend a lot of time working with their members speaking about how we think about technology, what's going to happen in the next couple of years and that really helped. So just speaking through exactly the technology, showing the examples and kind of avoiding this initial knee jerk reaction that AI is bad has been tremendous.
C
And maybe tying back to the earlier question as you are navigating this landscape, how do you think about bringing the right talent that can head and lead some of these functions? And these are mostly unknown territories of how to navigate it. Where have you been seeing success in bringing the right people here for the.
A
Spaces that are completely new to us? So this, and like legal is another example, we would always kind of bring at least one or two people that were in that space that kind of have interacted with the same parties full time in the past, but then would actually Adjust that with a lot of consulting people that would help us in a specific conversation. So in this case in music, we had music lawyers that worked very closely with us that consult across a few of them. And the good thing is that they know all the players and they effectively were this bridging gap between both of us so we could speak the same language. And then that was really helpful.
C
Yeah. And they have had a very specific taste for people that are risk tolerant enough and also understand the commercial and business opportunities to help guide the right chain of actions in each of those domains. I found that very fascinating.
A
100%. I mean, legal. I don't know how many of you are trying to find a first legal counsel or have a number of those. For us, this was, I think, one of the trickiest roles to hire for because you are hiring into the space you know very little about. And then we had the first couple of legal people that were clearly not fed. So we separated paths. Then we hired a third person. And that person came from like a number of Fortune 500 companies. And they never worked in startup space, never worked in venture. And what resulted is every conversation was pointing out the risks that we see. So anything we wanted to do was the number of risks that this could carry. And it was really tricky to work because it's like you kind of get risks, but you got that risk advice of like, okay, and this is where we should draw the line. But everything was back to decision. And now we hired a person working previously in a number of companies and don't poach them. They are increasing and they understand the risk equation a lot better, where they are not only like a counterpart to figuring out what the risks are, but also like, okay, this is what other companies do. This is what we should potentially do. And then they are like a true thought partner. And a tremendous change for sure.
C
11 labs started as more of a creator brand everywhere, from the individual creators to the creators that are building businesses. But now you have been having a lot of success moving into enterprise. Not just started from the AI agent platform, but even with the text to speech, speech to text models. How have you been navigating that transition? Because that's one of the very common place where a lot of really great consumer creator brands fell down. But you have had so far a pretty smooth transition.
A
So when we launched, we had a lot of early inbound when we started the classic plg, a lot of inbound from enterprise. And I remember speaking with a 60Z team when they joined us, where our initial take was of Course we want to be an engineering company. We don't want salespeople. We would like to reinvent that and have engineers do the sales. We did hire one traditional salesperson and one non traditional salesperson like an engineer and we told them do sales now. And that really, as you can imagine, didn't work out in this specific case. But we learned our lesson and we now do invest in. And a combination of that, it's 80% sales, 20% engineering. So still a little bit of that. But this was like super important lever of understanding who are the customers, what they care about and working deeply with them to bring it back. And then that kind of working with them was kind of opening of what we need to actually do. On the product and research side. Moon Jal from Hippocratic is here. He was one of the earliest incredible use cases in the healthcare space where they would create effectively agents that would take inbound calls that are calling the hospitals to take and schedule appointments. And beyond that they would do all the other parts of outboarding to the patients to remind them about taking medicine or reminding them about the appointment that's happening. And to be able to do that, that suddenly shifts from using a one foundational model into combining the speech to text, the LLM, the text to speech to orchestrate them together. Then the integrations you need to build, then you actually need to deploy. And they were one of the areas that was 2023. But then we've seen this repeated pattern across a number of other customers and customer experience space and many others. And we decided to invest more into helping with the entire orchestration. So instead of just doing text to speech, we can help combining our research to make this whole combination of that more fluid. But then if you are thinking about enterprise, you do need to build the combination of knowledge base inside a system. You need to help deploy that with telephony providers, with Twilio, with SIP trunking. How do you do that in a templatized and easier way? And then of course the biggest gap, that's the most common, it's easy to do a demo, but how do you actually build it to production? How do you test how you version control, how you evaluate monitor over time, fine tune over time based on the results. And all of that has been a big part and underlying all of that. And we spoke a little bit with Matt before coming here. The foundation needs to be there, which is the security, the compliance, serving the customers across that will rely on that infrastructure. That's something that we want to shine through at C11 Labs where if you are using the software, it's going to always be reliable and always the 4, 9 or 5, 9, hopefully one day will be there, which is tricky in AI space. That's the goal, of course. One obvious difference between PLG and sales is the cycle to work through and identify the right customers is much longer. And I think that's where eagerness from our internal team was interesting to observe, where you had a lot of people that didn't work in an enterprise setting and then you had other side of the company that did, and the side that didn't was very skeptic about going enterprise and waiting the six months or 12 months to results. And in the early days we needed to shield them from that information and trust us, we'll do this and it will work. But they were very skeptic. And then of course, after 12 months it worked out. But that was probably the hardest culturally of how you still keep everyone jumping on the same train.
C
That's exactly right. A lot of companies actually, at least I observed, sort of slowed down after start adopting more of the enterprise sort of product launching and like building for the customers request that started to thank you so much to delay sort of the product launches. Is that something you're seeing or is there still like a good balance of like, we still want to be able to put out demos and PoCs and early teasers quickly, but at the same time we'll get to deliver a very robust and reliable product.
A
So there are two parts. The first part is so we have a difference on the team structure and then we have a difference on the external product structure. On the external product structure, we want to ship very quickly. But of course, if you are shipping to enterprise, you want to make sure that it's stable and reliable. So we delineate very clearly what's alpha, what's not alpha, and then we go for transition through that period. And then as we work with the customers and then our partners, they can decide whether they want access to alpha in the first place. And when they do, that's clearly shown that this is an alpha product. It might not be as stable. And so they get a choice. And I think that choice has been the most important lever, like, do you want it or not? And some are incredible on doing that innovation and showing some of their work or experimenting with that work. Deutsche Telekom with John here is creating some of the incredible new podcast experiences, and that came from testing early models of turning a text into a more notebooklm style of a podcast with Incredible voices that you can select for. German speaking voices, English speaking voices that sound good. And then there's the second which is team structure piece. And that's something that we didn't do until later when we had more than 100 of us is that we delineate inside a company products that are pre product market fit and post product market fit. On the post product market fit you are working for the long term. You test and evaluate a lot before. You only deploy when that's truly ready. The pre product market fit your mission is to ship until you think we've hit the product market fit. And usually we give the six month period of proving it out. If not we kill the product and we've killed product in the past this way. But that's like the main important piece of like okay, until we know there is a big potential user base, we will continue iterating.
C
I have been able to observe some of those, I guess hard decisions in the moment, but it's the right decision later on to let go of some of the products. This is all my favorite questions. My partner Martin Casado always say companies go through three phases. There is the product phase, there's sales phase and there's a scaling phase. And given you have been through some of those phases, what has been the hardest transition for you as a CEO?
A
There is a lot of mini ones of course. I have my co founder next to me across each of those which is the. I know him for 15 years, he's my best friend since high school. So I have the most luck to have that combination. Of course you, Jennifer and all the partners to help us through those transitions which has been incredible. But I think the, the recent realization was when we are now 350 people company and of course that means our go to market team and the incentive structure around that has evolved pretty strongly. And what wasn't clear to me and now in hindsight is obvious is that in early days everybody would just operate on a passion basis. They would just operate what they think is best for the company. That's our go to market team enlarged. We realized that the incentive structure really matters. If you are building that machine and that transition where you shift from a lot of the people that are helping create that machine are part of that machine. Those incentive structures will eventually drive the behaviors which might be slightly different to what you had in mind. If you don't make it extremely clear and in some ways the quota, the commissions are effectively a lagging indicator of strategy. And then strategy is kind of leading of what will happen in the future. So you need to find a way to resolve those two together where you want to make sure the quad end commissions and the strategy that you want to drive are closer together and the disparity as close as possible. And so here for me the biggest realization was that we are becoming a bigger company because there are clear behaviors that happen based on the commissions and then two to actually resolve those we need to be very upfront in terms of making it explicit that sometimes even if commissions are just this and you think it's the wrong thing, come back to us, let's speak about it and adjust course. So now we are explicit of all our sales teams that if they are seeing a deal that let's say might be competitive in nature and our pricing table would suggest that they can go very low and earn higher commission. But they think it's wrong, it's better to come to us. We are happy to still grant commission but kill the deal and go. We had this case recently where one of our foundational level competitor came to us wanting to license our models for demos and of course the incentive would suggest that you should sell to them. But luckily we didn't. Yeah, you granted commission though.
C
Yeah. In the early days you can definitely and adjusted that.
A
Now it's in the policy you cannot sell to the foundational model companies.
C
So it's clear. Clear to all the Very clear internally. That was incredible. Mati. Thank you so much for sharing all the lessons and learnings with us. Let's give a round of applause to Matt.
B
As a reminder, the content here is for informational purposes only, should not be taken as legal, business, tax or investment advice or be used to evaluate any investment or security, and is not directed at any investors or potential investors in any A16Z fund. Please note that A16Z and its affiliates may also maintain investments in the companies discussed in this podcast. For more details, including a link to our investments, please see a16z.com disclosures sa.
Date: November 4, 2025
Guest: Mati Staniszewski – Co-founder & CEO, ElevenLabs
Host/Panel: a16z team
This episode dives into the meteoric rise of ElevenLabs, a leading AI voice technology company. Mati, its co-founder and CEO, reveals how ElevenLabs balances cutting-edge research with rapid product launches, why voice is poised to be the next major interface in AI, and how the company navigates the complex intersection of creativity, ethics, and enterprise-grade reliability. The conversation also addresses global talent strategies, adapting to the creative industries, and lessons learned in scaling a modern AI company.
Timestamps: [00:00] – [01:56]; [13:32] – [17:02]
Voice as the Ultimate Interface: ElevenLabs began with AI-generated voice, progressing to orchestrated voice agents and fully-licensed AI music creation.
Serving Diverse Use Cases: The need to support a wide range of voices, accents, and speaking styles led ElevenLabs to launch a “Voice Marketplace,” where users can create and monetize their own voices.
Impact: Over 10,000 voices on the platform; $10 million paid to creators.
“We launched Voice Marketplace where you could create your voice and then share it. And when the voice is shared, you earn money in the return. Today we have almost 10,000 voices. We paid $10 million back to the people in the community.”
— Mati, [00:00]; [13:32]
Timestamps: [01:56] – [06:45]
Fast Shipping & Small Teams: Success attributed to 20 small, autonomous product teams (5–10 people each), ensuring speed without losing quality.
Research/Product Tradeoffs: Preference to solve problems (like adjusting speech speed) through foundational research, but pragmatic about implementing product-level fixes if timelines demand.
Decision-Making: When research hurdles are likely to take over three months, product teams have freedom to patch with practical solutions.
“We are very against this idea of...sliders, any toggles. We don’t want to become same as previous generation of the editing suite. So instead, let's solve it on the research level...We resisted this…for nine months. We couldn't solve it on the research side. And then the product was a super simple solve.”
— Mati, [04:50]
Timestamps: [06:45] – [12:34]
Origins in Europe: The lack of natural-sounding dubbing in Polish media inspired the company’s founding.
Non-traditional Hiring: Early hires included people from surprising backgrounds (e.g., a call center employee who’d built a text-to-speech system).
Hubs + Remote: Blend of fully remote work and physical hubs (London, Warsaw, San Francisco) to balance deep focus and strong company culture.
Flat Organization & No Titles: Encourages impact regardless of tenure; six months to prove team efficacy; promotes lateral responsibility and quick advancement.
“We started fully remote…We hired a person that had incredible open source text to speech model and was working in the call center at the same time…He’s now one of the most brilliant researchers we have.”
— Mati, [08:16]
“We removed titles a year ago and it’s going well…The tenure will not define your position in the hierarchy. If you are smart and quick and passionate, you can elevate yourself very quickly.”
— Mati, [10:04]
Timestamps: [12:34] – [19:53]
Early Resistance Melting: Initially, creative professionals were wary of AI, but voice AI is now widely adopted in music, advertising, and entertainment.
Working with Industry: ElevenLabs emphasizes partnership over disruption—consulting with creators and learning how AI can add value.
Marketplace Payoffs: Unique voices sometimes find unexpected success in new markets.
Music Licensing: Secured deals with major labels (Merlin, Cobalt) to allow commercial rights and integration into AI models, after lengthy negotiations.
“With labels, I think I’m still learning…We worked with labels to bring their music into the music model so we can do it in a licensed way…That was a hard process. It took us 18 months to figure out the agreement that works.”
— Mati, [15:50]
Hiring Domain-Specific Talent: For new areas, pairs in-house hires with experienced consultants (e.g., music lawyers) to bridge knowledge gaps and accelerate industry alignment.
Timestamps: [17:02] – [19:53]
Building the Right Legal Team: Hiring effective legal counsel is tough; initial hires from big companies brought a risk-averse mindset incompatible with startup needs. Success came with risk-tolerant, commercially-savvy hires.
“Every conversation was pointing out the risks…And now we hired a person…who understands the risk equation a lot better, where…they are like a true thought partner. Tremendous change for sure.”
— Mati, [18:28]
Timestamps: [19:53] – [26:50]
Smooth Transition: Started as a tool for creators, but significant inbound from enterprises (especially for AI agents in healthcare, customer service).
Adapting Sales & Product: Moved from engineering-led sales to dedicated sales teams (80% sales, 20% engineering).
Enterprise Features: Built out orchestration tools, integrations, enterprise-level compliance and reliability—now a cornerstone of the offering.
“One obvious difference between PLG and sales is the cycle to work through and identify the right customers is much longer. In the early days…had to shield [teams] from that information and trust us, we'll do this…After 12 months it worked out. But that was probably the hardest culturally.”
— Mati, [23:37]
Product Management: Differentiates between pre- and post-product-market fit within teams; products not capturing a sufficient user base in six months are shelved.
“On the pre-product-market fit your mission is to ship until you think we've hit product-market fit. Usually we give the six month period...If not we kill the product.”
— Mati, [25:32]
Timestamps: [26:50] – [29:55]
Transitioning Incentives: As the company grew (now 350+), incentive structures became critical; had to align commission with strategy—explicitly encouraging teams to escalate “close calls.”
Maintaining Clarity: Set explicit rules (e.g., not selling models to foundational model competitors) and ensured open communication about misalignment.
“In early days everybody would just operate on a passion basis. [Now] incentive structure really matters. If you…don’t make it extremely clear...strategy [and] commissions…need to be as close as possible.”
— Mati, [27:16]
“We had [a competitor] want to license our models for demos…and the incentive would suggest that you should sell to them. But luckily, we didn’t.”
— Mati, [29:39]
“The voice is the next AI interface—not just for accessibility, but for creativity, immersion, and productivity.”
— Mati (paraphrased theme, throughout)
“You can really find talent everywhere. It’s just how hard and how you look for them.”
— Host, [09:12]
“Avoiding this initial knee jerk reaction that AI is bad has been tremendous.”
— Mati, [16:59]
ElevenLabs has become a pivotal player by treating voice as the linchpin of future human-computer interaction, accelerating through autonomous team structure, global hiring, and relentless R&D. Their collaborative approach to working with creative industries and strict alignment of incentive structures as they scale stands out as a model for other AI-driven startups. As voice becomes increasingly central to AI, ElevenLabs’ learnings on research, ethics, scaling, and global teamwork provide valuable lessons for any tech innovator.