
Martin Casado speaks with George Fraser, cofounder and CEO of Fivetran, about the future of data infrastructure in the age of AI. The conversation covers Fivetran’s merger with dbt, the changing role of data platforms, and why Fraser believes many companies are overestimating the threat AI poses to enterprise software. They discuss open data access, the backlash against AI agents accessing systems of record, and why businesses still need centralized data foundations even as agent-based workflows become more common. Along the way, Fraser shares his views on data gravity, coding agents, enterprise AI adoption, and how AI is changing the way software companies build and operate products.
Loading summary
George Fraser
There is a new reason to have all your data in one place, which is AI agents need context. If you don't do that, then it's sort of like using ChatGPT from before ChatGPT was connected to the Internet. Postgres, contrary to popular belief, is very old technology. It is not a good database simply because it was written a long time ago. It has a lot of technical debt.
Martin Casado
Satya has said that there's going to be the collapse of SaaS. Do you think the SaaS apocalypse is a thing and we're going to see a massive shift.
George Fraser
The bigger threat is that AI native companies will just zoom and catch up to the established incumbents and maybe be better.
Martin Casado
Like we'll actually have an HR and that HR team will onboard AIs as they come. They'll be part of teams, they'll join the slack. And in that world, these aren't software. That's actually more seats, more consumption of software. And so do you think that for enterprise agents we're moving more to these, you treat them like humans or do you think that that's too far?
Podcast Host
For years, companies built data infrastructure to answer questions about the business. Now they're building it for AI. As AGENC agents become more capable, the challenge is no longer collecting data. It's making sure the right systems can access the right context at the right time. That shift is forcing companies to rethink everything from Data platforms and APIs to enterprise software and systems of record. Martin Casado speaks with fivetran co founder and CEO George Fraser about AI data infrastructure and why the next wave of enterprise software may look very different from the last.
Martin Casado
Guest today is George Fraser, who is the CEO of fivetran. Fivetran announced the merger with dbt. So maybe to start, just give a quick overview of what fivetran does.
George Fraser
So fivetran, we've been around for a while. We've been around since 2013. Had customers since 2015.
Martin Casado
2013. 13 years.
George Fraser
Yeah, exactly. I've been doing this long enough that a slide about the past state in my own slides is the same slide as the future state from when I started. But what fivetran does is we help our customers get all of their data from all their Systems, like Salesforce, Netsuite, all their SaaS tools, their own databases into one place. Getting all your data in one place, it's not a new thing. Businesses have had the need to do this since filing cabinets. The primary reason historically that people used fivetran to get all their data in one place was to do business intelligence was to build reports about things like what's your revenue, what's going on with your sales team, what are we forecasting for this quarter, all those great things. And now there is a new reason to have all your data in one place, which is if you want to use AI agents in business, AI agents need context. And it turns out that the same data foundations that work well for business intelligence and reporting, with some additions and some modifications, actually can work really well for AI agents as well.
Martin Casado
I mean, talk about a sector of the industry which is under a lot of change because of AI. And so maybe could you give like a high level overview of how it is evolving? What are some of the considerations about the shifts in data? And in particular like we're seeing a lot of changes how vendors view their own data, how the big labs use data. So just talk a bit about what the industry is.
George Fraser
The thing about data in the context of business is it is always born somewhere else. It's always born in systems of record like Salesforce, like Workday, like SAP. And even if it's your own applications data, if you're a software company and you run your own database, the data is born in that database. And since as I said, time immemorial businesses have had the need for internal use to centralize a copy of all their data in another location, it doesn't work to just go and do all of your reporting and ask all your questions in each system individually. Some kinds of questions require you to look across the entire system. And so that is not new. However, these AI agents are new. And there has been in the last year a reaction which really started with the stock market. As we saw the saaspocalypse happen and as we saw the stock prices of all of these systems of record that I'm talking about plummet, people view them as under threat from AIs. We have seen some of these companies start to think that a great strategy for dealing with AI might be to lock it out and to say your data is our data now and you can't take it elsewhere. And if you want to use AI on it, you have to use the AI tools that we provide. Notably, just a couple weeks ago, SAP announced a new API policy that literally said all AI agent access was banned except in a way specifically approved by SAP. Now if you're an SAP user, don't panic. This is just a policy. You have contracts with SAP, those are authoritative as to what you are and are allowed to do. So don't overreact. To these policy memos, but it just shows how extreme the reaction of some of these companies has been.
Martin Casado
I just want to tease this apart because I think there's a lot of confusion on what exactly is going on. Right, so this is locking down access to the data that an agent would use instead of an app. Right. It's not access to data because you're going to train your own model.
George Fraser
That's right. Very few people are in the business of training their own models. Most people, when they want to access their own data in their own systems of record, even if those systems of record are managed by vendors, they are using it for context. They're using it in order to ask and answer questions about what's going on in the business.
Martin Casado
So the concern is my SaaS app has less value as an interface because now the agents can access the data directly and basically perform the same functions the SaaS app was before. Is that the concern?
George Fraser
I think there are many concerns. I think that.
Martin Casado
Can you just straw man the set of concerns? Because this is one of the biggest reactions I've seen in the industry in a very long time, and I'm kind of trying to come to grips. So what the actual worry is, I
George Fraser
think people are worried that their systems that they've spent many years building will simply be less valuable in a world where their users are no longer humans, but they're agents. I think they're worried that.
Martin Casado
But why isn't this just another seat? I mean, it seems like, I mean, arguably this is positive because there's gonna be more consumers of policies.
George Fraser
So agents don't need as many individual identities. When you have AI agents accessing systems that you really just need roles. You don't need the same granularity of users. If you have many product managers, each will have their own identity in a system. But if you have a product manager bot, you really might just have one role that it uses and it might have a single identity and yet do the work of hundreds or thousands of people. So there's not an easy answer like that. Furthermore, these companies have a history of having open APIs. Open APIs are a good thing. If these companies did not have open APIs, they would have been consigned to the dustbin of legacy SaaS decades ago. I mean, this is a thing that happened in the 90s, right? The evolution of open APIs and their customers have been using them and depending on them for decades. And those same APIs are the primary target of AI agents. So it is very hard to differentiate whether the Users are accessing the APIs in the same way that they always have been, or whether they are accessing them in agentic ways that may substitute for human workflows.
Martin Casado
I'm going to keep poking on this because I still think it's not a real concern. So let me make another straw man argument. So let's say you had opened up all your APIs in the 90s, which is the case. Like, why couldn't I just write a procedural app which is my own version of the SaaS, and therefore also disintermediate your SaaS? Like, why are agents somehow different than me just writing my own software or my own dashboard?
George Fraser
Maybe they're not. This may all be kind of much ado about nothing. I think it's foolish for them to close down their APIs. So you're putting me in a weird position.
Martin Casado
No, I. I do. I do too.
George Fraser
Trying to defend a position that I think is stupid.
Martin Casado
Yeah, yeah, yeah, I agree. Yeah.
George Fraser
So I think a lot of these threats are not new. Like, well, maybe they'll use programmatic access and thereby use less seats. Maybe they'll move some functionality to their own interfaces. I mean, that is a real thing that has been happening.
Martin Casado
I just want to let you know, I am old enough to remember these discussions in the 90s. The rhetoric was exactly the same, the reaction exactly said we could never open up APIs, we can never let them do this because they're going to disintermediate us. And it just turns out that if you're buying into a business process, like the operational flow of something that is set up by the company that you're buying it from, Salesforce knows how to run the Salesforce. And so whether it's an agent that's consuming it or SaaS, I would argue that there's still the value there.
George Fraser
I completely agree. And I will point out another piece of evidence for that claim, which is if you look at the budgets of real companies that are heavy consumers of software, they spend 5 to 10% of headcount on software. Software costs are immaterial in the grand scheme of things. Software, compared to everything else a typical business spends money on, is so cheap, the idea that they're going to use AI to value engineer the number of seats they have on Slack or something is ridiculous. They're going to use AI to go make their business work better in whatever it is that they're actually good at. They're not trying to take that 5% software spend and turn it into four and a half. That is not the highest, best use of AI.
Martin Casado
Famously, all the big AI labs, including Andreessen Horowitz, and we're all very heavy users of AI, like still use these SaaS tools. And so four years into this, we don't have a lot of evidence that
George Fraser
like, as do we, as do OpenAI and Anthropic.
Martin Casado
Exactly.
George Fraser
Who are both Fivetran customers. And we replicate lots of data from these very SaaS tools on their behalf into their data lakes. So if they're still using them, do we really think the company of the future is not going to be.
Martin Casado
Right? Right. But one thing we can both agree on is this is bad for customers. Right. This kind of locking down the APIs is bad for customers. So maybe talk through like a, you know why it's bad, which may just be obvious and then B, kind of your recommendation for how to manage that, like assuming that this is happening industry wide.
George Fraser
So the reason it's bad for customers anytime vendors put up walls and try to regulate data access is that you need to have all your data in one place in order to do meaningful reporting, in order to understand just what the heck is going on in your business. And, and in order for AI agents to work in the context of business. If you don't do that, then it's using ChatGPT from before ChatGPT was connected to the Internet. If you used it back then, you remember it used to have this knowledge cut off. And it would tell you, I can only answer questions that happened by in my training window.
Martin Casado
Six months ago.
George Fraser
Yeah, six months ago was when I was trained. I don't know anything after that because I'm not connected to the Internet. That is what using today's AIs is like. If you're using them in a business context and they don't have access to your business data. And so it's very important for every company who wants to do things with data to create their own data platform where they have a copy of all of their own data and it's being kept continuously up to date. And anytime vendors start putting up barriers, it just makes it harder to get that done. And the customers will still do it. They'll just work around these barriers at great cost and complexity.
Martin Casado
One of my favorite things that you've done as a company to educate the customers on this is this benchmark. Can you maybe talk through what that is?
George Fraser
Well, we have a website. Is that what you're talking about? Open Data Infrastructure.
Martin Casado
Open Data Infrastructure.
George Fraser
Open Data infrastructure.
Martin Casado
It's a benchmark like it does scoring right?
George Fraser
Yeah. So we score. The list is growing. We're trying to make it as long as possible. We score as many vendors as we've been able to catalog so far on their data access policies. So we basically score them on whether they try to charge egress charges, whether they try to make you pay for getting your own data out, whether they, whether they make it impossible to get a complete copy of your data because some vendors will do that or they'll make it just very difficult and whether they have terms of use restrictions on accessing your own data. So there's a big grid on there and it rates.
Martin Casado
Are you comfortable saying who the worst offenders are or well, or should just point people?
George Fraser
Just, just go to the website and read it. It's all very fact based and very well, well evidenced. I mean the worst offenders historically have been. SAP has always been really bad.
Martin Casado
I mean even when I was running a large business, I mean that was, I was.
George Fraser
It's interesting because they were getting better. And I've sensed in the last few years that there's sort of two camps within SAP. One of whom who regards it as. It's the customer's data. The customer's gotta be able to do what they need to do with their own data. And then there's sort of the. Who views it as SAP's data and you'll do with it what we tell you to. And then, you know, historically Salesforce has been really good with the exception of Slack where they are terrible.
Martin Casado
Yeah, I know.
George Fraser
But they've started to get squirrely about this. So it's a moving target. I am hopeful that this is merely a brief flirtation with closed data by most of these vendors and they will realize this is not a good idea, that they are at war with their own customers and it's not even going to work anyway. And we will go back to the trend towards ever more openness.
Martin Casado
You just answered my next question, but I'm asking anyways, which is like, do you feel like this is just like a repeat of the Open Data API? It'll resolve quickly. We always kind of go through this soul searching and then we will resolve back to where we were, which is like open data is the right thing. Do you think that's the path that you think is different this time?
George Fraser
Yes, I think it will be the same. That's my prediction. It will be the same path. As mentors will discover that they cannot provide inside their own platform a solution to every data problem that their customers have because they are simply so diverse. And instead they simply create a mechanism for the customers to replicate it to their data platform of choice and do with it what they will. And even if they charge fees for that, it's not the end of the world. On opendatainfrastructure.com, you really get yellow for charging fees. It's only red when you try to actually block it. At the end of the day, if you want to have a little toll, that's not the end of the world. The problem is when you start actually blocking it, there is no option and saying, oh, if you want to do anything with data, you have to come use my tools inside my walled garden, which never works because all the rest of your data is not in that walled garden and it's not going to be. And you can never create enough tools to support all the different things customers want to do with data.
Martin Casado
You know, this is probably related to this notion of, you know, or this belief in data gravity. And I mean, one thing that I've loved working with you over the years is like, you're exceedingly smart and you're exceedingly contrarian. Just so fun to like, kind of, you know, watch your opinions diffuse and more often right than wrong. And one thing you have said is that data gravity is either overrated or not real. So do you think that like a, you know, do you stand by the statement? And B, do you think that this is driving people to like, try to do these walled gardens?
George Fraser
I, I think data gravity is completely fake. I am the only person who thinks this.
Martin Casado
I think you're the only person that
George Fraser
says, if you want, if you want
Martin Casado
to see evidence, maybe describe what data gravity means. People use the term a lot, but like, I don't think they understand the implications.
George Fraser
Data gravity is the idea that business data is so large that it's very expensive to move around because of egress charges of cloud vendors. And if you want to see, and that therefore it's very important that you choose a physical region in the world where all your data is going to live in a specific region of a specific cloud and then you build all of your data consuming services in that same location. Or you can partition, especially because of egressvs. Well, this is aversion. The term data gravity does get used to mean multiple things. This is one particular incarnation of the idea of data gravity. And this is the one that I am saying is, Vic, that egress charges are so important. And if you want to see evidence against this, come look at the Networking dashboard of FiveTrans, various AWS and GCP accounts. You will be astonished. Despite replicating huge data sets for thousands of companies, we have 7,000 customers of size and thousands more little ones. The amount of data being moved at any given time is tiny. And the reason is that we're doing change data capture. You can have a huge data set, but if you just replicate the changes, the changes are always much smaller than people think. And I think that a lot of this idea of data gravity came from dumb data pipelines that people wrote where they would copy their entire company's data sets out of their database every day, once a day at midnight. And so they just had this crazy read amplification. You know, they were just repeatedly copying the same data over and over, and it gave them the impression that they had so much data, but they really don't.
Martin Casado
Maybe a good argument not to roll your own on these things anyways.
George Fraser
Yes, yes. This is what happens when people roll their own data pipelines is they fall back on these patterns that are easy to get right, like that pattern I just described. It's extremely easy to get a correct replica just by copying it all over and over, but very expensive to operate in the long run.
Martin Casado
Maybe before we leave this topic, so let's say there's a CIO listening to this right now, and the CIO is like, oh, how do I navigate these kind of uncertain three to six months? Like, what kind of leverage do I have with these SaaS passengers as they're closing things off? Do you have any sort of guidance for them?
George Fraser
Yeah, I think, number one, you should insist on having a copy of all of your own company data in a data lake that you control. Don't let go of that. For any vendor, you have a lot of leverage. These vendors actually have a lot of obligations to let you do that. The reason why they get away with blocking people is simply because people don't fight. So pick fights and write it into your. If you have big contracts with vendors and you're redlining your MSAs, write language guaranteeing your own data access into those MSAs.
Martin Casado
Are you seeing those show up now? Actually, I hadn't even considered that.
George Fraser
We have model language on opendatainfrastructure.com that we recommend that you incorporate into your MSAs. And even if you don't get it, just by asking for it, you are sending a signal. So if it's a $10,000 contract, don't, don't do it. But if you're, if you're signing, you know, 500k million dollar contracts insist on data access in your MSA and you will find surprisingly often that you get it.
Martin Casado
I want to talk a little bit about agents now. So I feel like we've kind of gone through multiple phases already in agents. So the first one is they're just purely treated as software. They're like, it was almost like search, like take all your data, put it into a data lake and then you have an LLM that has access to. And then we go to that and it's like enterprise search all over again. And then we went to like the agents like openclaw. And then that model is like a personal agent, but it kind of like was part of you. So you'd give it access to your email and you would give it access to like your API keys and give it access to your accounts. So it was like part of you, an extension of you. Now we're seeing.
George Fraser
I actually set up openclaw.
Martin Casado
That's great. I use Nano Claw.
George Fraser
Well, I set up openclaw and then I turned off of it onto Nano Claw because it's such a monstrous piece of over complex software. And then I actually turned off nanoclaw onto Nanobot, which is what I've stuck with. But I use it to manage my tennis team. But it has its own identity.
Martin Casado
Okay, so this is what I was going to say.
George Fraser
It has its own email.
Martin Casado
So this is, I think we've all come to this conclusion which is like I don't want it having access to my email, I want it. So I also, I also have the Mac Mini, I run the Axi now I'm working with somebody like build a harness. But like they're all the same, right?
George Fraser
I run it on a VM.
Martin Casado
Yeah, perfect. Yeah.
George Fraser
It's got its own WhatsApp number, its own email address.
Martin Casado
So I got own phone number, its own email address. And then, you know, now as we think about a 16Z, we're actually thinking about like, you know, why don't, why don't we just treat all these agents like this, like we'll actually have an HR and that HR team will onboard AIs as they come. They will train them, they will show them the access to the documents that you need. They'll be part of teams, they'll join the slack, you know, just like humans do. And in that world, and we touched on this a little bit before, but in that world these aren't software. That's actually more seats, more consumption of software. And so do you think that, that for enterprise agents we're moving more to these, you treat them like humans or do you think that that's too far?
George Fraser
I think it is a intermediate form. I think the reason this works well is because you can slot it in to the existing workflows without having to refactor the whole universe. So in my example of my agent that manages my USCA tennis team, it can email with the players, it can go to the USTA website, check the schedule, make lineups, check availability, and it works well having its own identity because it can slot into all of these existing workflows that were designed for humans. At fivetran we have an AI agent that helps respond to support tickets that goes and inspects the logs, inspects the code. It uses all the context that we centralized with fivetran to find out what is going on with this customer, what might be the solution to this problem. And it drafts responses and right now it slots into the system a lot like a person would. But we are working on making the whole thing just a closed loop pure AI system where it will only have one identity. There will just be the the connector Troubleshooter, Borg, Hive Mind.
Martin Casado
And there may be, but let me give the argument for the multiple. So why do I use a Mac Mini for my agent, right? I mean there's a couple reasons. One of them so it can access imessage because there's no programmatic way to access imessage. And so like you know, it has a desktop. Another one is like everybody's moving headless. Like Salesforce is doing their headless thing and there's headless browsers. But it turns out, let's just talk about browsers. Like if you have a headless browser, all of the anti scraping software kicks in and then like it's not as functional. So it's actually much better to just give it a fully functional version of Safari. So you could argue that the interfaces that have evolved over the last 30 years to deal with unpredictable users that know how to use computers is the UIs we have today. And actually the simplest thing to do, rather than try to rewrite all of that stuff is just to give these agents that have been trained on human data access to full end to end systems.
George Fraser
I don't know the systems that I've worked on, browser use has never been necessary. Browser use has a big cost which is very slow and it consumes a lot of tokens.
Martin Casado
Yes.
George Fraser
And I have found that in my tennis example, the USDA website does not have any anti scraping provisions.
Martin Casado
And you're not trying to read from LinkedIn or Zillow. Try that if you.
George Fraser
Yes, and so I'm just using. I'm using just like Python browser automation. I'm actually using Selenium. And then I'm actually. There's actually a skill that emits just exactly what you want to know so that the agent is not reading HTML and consuming all those tokens all the time. And then at FiveTrans, for example, we are working right now on a Salesforce administration agent to integrate, to basically do continuous integration of small changes into our Salesforce, which is very labor intensive right now. And it also does not use browser automation because the Salesforce CLI we found does everything we need to do. We'll use browser automation if we need to, if we come to something that can't be done with the Salesforce cli. But to their credit, the Salesforce CLI is quite comprehensive. Pretty much anything you can do in the UI you can do with a CLI command. And the agents seem to already know how to use it.
Martin Casado
So can I do a $1 bet with you when we're on this podcast in five years? So I think the majority of use for agents in five years is going to be the same interfaces that humans are using, just because it's a long tail of integration has already been solved and like all the protections and all the sharing and everything else. And would you say the majority would be through APIs that are kind of more of a traditional computer software?
George Fraser
Yeah, they'll just hit the APIs.
Martin Casado
Yeah. What do you think about things like, you know, these, these technologies that mediate dao, like mcp, which have emerged to kind of try and solve that problem. Do you think there's a future for those? Or do you think that those just give way to like strict tool API usage?
George Fraser
You know, in theory it seems like they're an unnecessary layer because these agents are great at calling APIs and calling command line tools. So why bother having this other layer? In practice, when you sit down and actually try to build systems, MCPs do solve important problems, particularly authentication and just like discoverability of what's available. So even though you can say kind of from first principles that maybe they shouldn't exist, when you actually sit down and write a real system that's accessing context, you almost always end up sticking an MCP server into it.
Martin Casado
And this is authorization. Authentication, authorization and discoverability. Like the fact that these things exist.
George Fraser
Yeah, and there's just also a lot of little affordances in the AI tools that are built around mcp. Like User granting authorization for specific tasks are done at the tool level in mcps. And that's just a rule that's been baked into the harnesses that ends up working well for a lot of situations. It also works badly in some situations. But the tools on the consuming side have started to grow around mcp and thus even if maybe theoretically you don't need it, I think in practice it's. It's taken hold.
Martin Casado
The thing that I find so strange about this is like, even the tool use itself, like, you could argue that, like, as smarter models come out, they could build better tools anyways. And so sometimes I wonder if, like an agent should just be the most minimal thing ever. Like, it manages like durable state, it manages compute. And then, you know, you run like, whatever, like the Anthropic SDK or the OpenAI SDK, and then you just tell it to build its own tools. Like, you know, build your connection to this.
George Fraser
Well, that's how Nanoclock, that's how nanocloud works.
Martin Casado
No, it's exactly how it works right now. But you could argue that there's so much money being poured into these foundation models, like tens of billions of dollars maybe become hundreds of billions. So the most intelligent thing at any point on the planet is one of these models. So why would you use an old tool that could build a better tool?
George Fraser
Yeah. So I mean, nanoclaw, for those who don't know, is a personal AI agent. Sort of like openclaw, except the way it works is you fork the repo when you start up and then you just sort of vibe code it into whatever you want.
Martin Casado
Yeah, whatever you want, exactly.
George Fraser
And the problem I encountered is it sort of went awry and I didn't really want to go and troubleshoot all of the details.
Martin Casado
I debug Nanoclaw.
George Fraser
Exactly.
Martin Casado
Curse. So I'm in there.
George Fraser
Like, I should go through it at some point. I spent enough time debugging nanoclaw that I was like, I want something that has more separation of concerns, where there's sort of an agent over here that has an API and it works the way it works. And then I just write like, skills and stuff on top of that. Ironically, even when I did that, I ended up having to add things to Nanobot because, like, it couldn't differentiate WhatsApp group messages from DMs. And I think I actually have a PR against nanobot because of that. So I sort of ended up back in the same place a little bit. But I take your point. Like, if they get smart enough they may, you know, they may just build their own intermediate abstractions as needed.
Martin Casado
Yeah, yeah, that's right. As you go. Just going to maybe wrap this bit up. I mean, you know, Satya has said that there's going to be the collapse of SaaS. And you know, we know databricks is trying to rebuild a lot of SaaS on top of their platforms. I mean they've been very public about that. So do you think we're going to see like a massive, like, do you think there's SaaS Apocalypse is a thing and we're gonna see a massive shift or, and like, you know, it's gonna be agents and it's gonna be on new types of infrastructure that they run. Or do you think that, you know, SaaS is fine, the markets are overblown. Satya is, you know, wrong.
George Fraser
Well, to give credit to the public markets, I, I think that they are accurately. I mean whether the magnitude is right, I cannot say, but the direction is right. There's a lot more uncertainty embedded in all these SaaS companies than there was a year ago. And that's reflected in the changing in the decline prices. But I don't think it's for this. I don't really buy into this reason that all the SaaS categories are going to disappear and be replaced with five coded software. I think there will be some, but I think the bigger threat is simply new companies coming along. It is just so much easier to write software that AI native companies will just zoom and catch up to the established incumbents and maybe be better in some way.
Martin Casado
You know what's so interesting? I mean you just mentioned that about fivetran. But if you look across the companies we work with that are traditional companies and I actually would not include you in that, but I mean, listen, both of the labs are your primary customers, but if you look at, across the traditional companies, you don't actually see in the data that they're slowing down like 5 trans doing rate. It's actually the company's accelerator rating. And so how do you reconcile this high risk concern that you just voiced with the actual, the business? Is it more just kind of existential angst?
George Fraser
No, it's like X risk. You know, some, some, some new way of doing what you do comes along that's dramatically better. And this will happen to some companies.
Martin Casado
I know, of course it will.
George Fraser
Right, but and, and more, we're four
Martin Casado
years into this at this point. You know, I don't know, I just feel like you do, you, you, you can't derive this from the data as far as I can tell. And like maybe it's coming or maybe
George Fraser
it's not even really in the public company data.
Martin Casado
No, it's not, no across the board.
George Fraser
It's when you start to see like less than one net dollar retention then you know it's come.
Martin Casado
Yeah, yeah, yeah, for sure. Yeah. And maybe it will, maybe it's in the future. But like we've been having this conversation. I remember, I actually remember when I was, when you put in the call and it was a year ago or so and you're like, Martin, this AI stuff is like really real and you know, like you can kind of code connectors with it pretty good and it's pretty good and it's coming. But that was actually quite a while ago when you put in that the company's done fantastic since then. So it could be the case that like rather than someone trying to redo an existing company that you know, has figured out like a long tail of stuff, they'll go work on different problems that are more suitable.
George Fraser
So we have, you know, in our particular case we have been trying ourselves to use AIs to build data replication connectors, which is the core of what we do for years since GPT3. And they continue to improve in terms of what they can put out. They still do not discover this long tail of complexity. It really surprises people how difficult it is just to make an accurate copy of a system and keep it up to date. And now we are actually starting to see new capabilities inside FiveTrans to push the bounds of quality, particularly quality, even further, like completeness of coverage of the sources and the correctness of replication. You can imagine how you can use AIs to do that more comprehensively than you ever could with human beings. And so I think in addition to the sort of, you know, the AI threat, it is getting closer, but it's still, I think, a way away from what we do. We're actually starting to see the opportunity pull us forward. So we're starting to get better at our own core business by leveraging AI internally in extremely non obvious ways that I don't think anyone else has discovered yet.
Martin Casado
Can you talk to those or is that trade secret?
George Fraser
Well, at the end of the day, what's going on inside FiveTrans is just this crazy mass troubleshooting effort that never ends.
Martin Casado
Wait, that sounds like every startup ever.
George Fraser
Well, we have it is, but it's the breadth of it is much larger for us because we have 750 connectors to different systems of Record everything from Oracle to SAP to Qualtrics to, you name it, they all have different idiosyncrasies. And you only discover these idiosyncrasies when real customers bump into them and they show up as performance problems, correctness problems and failures. And you know, the way we have always solved it is I always like to say the trick is there is no trick. It's just a lot of effort behind the scenes. And it's kind of an economic trick. We only have to fix every bug once and then every customer who uses that connector benefits from it. But you can imagine how you can use AI coding agents, which are basically an infinite supply of junior engineers. That is a particularly valuable tool for this kind of problem. And the details of putting that into practice turn out to be quite tricky. But we've really, especially the last couple months, started to see it work and started to see improvements at scale. Many, many, many small improvements. Start the, we've seen the flood start to come and I think you'll see the quality and reliability of fivetran take yet another leap this year because of that.
Martin Casado
You know, you're in this very unique vantage point because you have the big labs as customers like OpenAI and Anthropic or customers. So you know, they are AI native. They're at the forefront. Do they use Fivetran differently than traditional enterprises or anything that the enterprise can learn from that? That.
George Fraser
No, they're use cases are very typical. They use fivetran to replicate data from lots of different systems of record into, into a centralized data lake. And they do analytics with that. They feed that as context into their own internal AI workflows. So they, they have built data foundations that look very much like the data foundations of many other companies. The data, the, the system at Anthropic, one of the people who helped set them up was a consultant who had set up fivetran and DBT at many other companies. So their data platforms looked very typical. And I think this is a very important message. If you are thinking about data foundations for AI, do not make the mistake of thinking you need to build some exotic new system as a data foundation. For the right data foundation for AI is probably the one you already have. If you have a reasonably modern data platform, something like Snowflake Databricks or BigQuery or maybe even you have transitioned to an iceberg data lake with those compute systems running on top. That is a great foundation for your context for AI as well.
Martin Casado
You know, there used to be this idea and again we've Touched on in the context of fivetran, but more broadly, there's this idea that AI commoditizes infrastructure. Broadly. Right. And so the idea was like, well, it can write anything. The opposite seems to be true. More software is being written than ever before. The software is actually pretty buggy. It needs kind of stable infrastructure below it, you know, and so most infrastructure companies have seen a lift as a result of this. You know, in your, in your sense, is this transitory, like the eventual AI consumer infrastructure is coming? Or do you actually, I mean, let me just give you my, my quick view on this, which is building rock solid software that you can operate for long periods of time is just not what AI is best at and you're probably better spent focusing it on other things. But is my view blinkered in how powerful it's going to get over time?
George Fraser
No one knows how powerful it will get over time. I mean, the nice thing about that is that if it gets sufficiently powerful, all these questions become sort of moot because we'll just be living in a post scarcity world. But I think if we look at the present day, I think it is mostly true that AIs are just creating more demand for infrastructure and not commoditizing it at all. You can think of infrastructure as having layers and at the bottom are like data centers. And then you go to cloud vendors like aws and then you have systems like convex sort of serverless platforms that try to make the cloud vendors easier. And then you even have, you know, systems that sit at a higher level of abstraction than that, which could include, you know, databricks, a lot of their businesses hosting notebooks. Right. I think that last layer is the one that is, is threatened by AI.
Martin Casado
Yeah. Consumption layer.
George Fraser
Yeah, yeah. AI is, is quite good at navigating slightly more complicated infrastructure. So if you have an AI agent, maybe you don't really need that very most user friendly layer. You can drop down to the next one and use that.
Martin Casado
Yeah. I mean you could argue that whenever the consumption layer is up for grabs, which also happened with the Internet.
George Fraser
Right.
Martin Casado
Like you kind of went to different places to go do things, like it changed the ui, like it changes a bunch of stuff, but like the core infrastructure stays in place. Right. Like you still have operating systems, you still have chips, you still have databases. And like you did, they kind of evolve over time rather than they get replaced.
George Fraser
Yeah. And maybe you peel one layer or maybe you peel three.
Martin Casado
Exactly.
George Fraser
You're not going to peel it all the way back down to glass.
Martin Casado
Yeah, that's Right, let's talk on. Let me actually just do a quick time check here just because I just enjoy talking to you so much. We just talk forever. Okay, let's change topics a bit to the DBT merger.
George Fraser
Yeah.
Martin Casado
So you acquired census in 2025, and then Tobiko data and SQL mesh and then you sign with DBT Labs. And so, I mean, this has always been a space that's been relatively acquisitive, but I would say for the new style companies, Fivetra has been the most acquisitive. So maybe can you talk through the strategy and the plan or was this ad hoc? Is there some grand strategy?
George Fraser
Well, I am the child of an investment banker, so maybe I'm just realizing my destiny.
Martin Casado
Not just investment bankers, pe. Right.
George Fraser
Well, my brother did PE and my cousin and a bunch of other people in my family.
Martin Casado
There you go.
George Fraser
But not my parents. My mother was a commercial banker and my dad was a M and A investment.
Martin Casado
Oh, there you go. Okay. M and A Wells. Yeah, yeah.
George Fraser
But in the between M and A.
Martin Casado
Okay, I see. But anyways, listen, for a startup and having watched you do it has actually been very impressive to watch you run this strategy.
George Fraser
Well, but seriously, it was not something we set out to do. Fivetran does not have a corp dev function. I felt with every single.
Martin Casado
I never really thought about that. That's true.
George Fraser
Yeah. I've always felt that any acquisition or merger and the first big one was really hpcr.
Martin Casado
I remember very well.
George Fraser
It should feel like it's for these unique reasons, and it feels like it's the last one you're ever gonna do. And it's not gonna be the last one you ever do, but the reasons to do it should be really, really strong. You shouldn't. Nonetheless, we have found these strong reasons several times. I think the DBT one is a great fit. These are two products that have historically almost always been used together and they kind of go together. A fivetrain is the tool that gets all your data in one place. DBT is the tool that you use to organize it and turn it into a model that reflects the particular details of your. Your business. And then that is what feeds into all of the data consumers.
Martin Casado
You know, you've actually said publicly that DBT is going to be one of the biggest beneficiary of coding agents. Can you kind of pencil that out a bit?
George Fraser
Yeah, yeah. So there's this great.
Martin Casado
I don't even really know what that means.
George Fraser
Well, I think there's going to be way more usage of DBT I think coding agents are going to write tons of DBT models.
Martin Casado
I tell you that. We're actually already seeing that. I mean, that's.
George Fraser
Yeah. And it's going to be a great beneficiary. There is this great, great quote from Dykstra, I think, which is Dijkstra's algorithm. Dijkstra, that. Yeah, that computer code should be seen as a means of communication between humans and only incidentally as an execution format for computers. And nowhere is that more true than in SQL queries, in DBT projects. It is a great way to express. These are the rules of data at my company. And even if it's being written by AIs, you still want to have that artifact. That is an executable documentation of how your business works.
Martin Casado
All right, so you have the pleasure of being the CEO of a relatively large company during the AI wave. You've gotten back to writing coding, which a lot of us have. You're running a lot of experience. Like you mentioned experiments. Your nanoclaw experiment, though, you're now, now running what?
George Fraser
Nanobot.
Martin Casado
Nanobot, that's right. So how much of this is, you know, George, the scientist, the techie versus, like you actually do this like pragmatically useful for the CEO of a large company?
George Fraser
Yeah, I don't know if it's a good idea. I just can't resist. And coding agents are great for CEOs who want to write code on the side because they work sort of asynchronously. So you can have a lot of things spinning in the background. I have a lot of projects going right now.
Martin Casado
Can you name them mean?
George Fraser
I mean, I have things I am just doing as hobbies, like, like the system for managing my tennis team. I'm working on a little on a. On a tennis statistics machine vision app. But then I have many things at fivetran. They're all experimental proof of concepts that I share with people. And we talk about there's a. A potential like Nano Data Lake catalog that I have going that attempts to. If you. When you use data lakes, you have to adopt this additional service called a catalog. And it's the answer to the question, could we make the catalog invisible? So that's an example I'm working on just for the hell of it. A from scratch, classic OLTP SQL database. What I think that sounds like there is an. It's crazy. It's crazy, right? But the whole point of the project is no, that'd be not distributed. What it attempts to do is to be like SQLite, except S3 is the backing store.
Martin Casado
Oh, that's cool. That's a great idea for it.
George Fraser
Because when you build AI workflows, you have this need for zillions of tiny databases. And it's a proof of concept. It's mostly an exploration of like, could you, with sufficient AI coding, just take on something absolutely ridiculous? If you're hearing this and you want to work on this and you are an expert in database.
Martin Casado
Is it public? Devopublic, GitHub?
George Fraser
I don't, I don't. But if you want to work on this, come talk to me and maybe you can come do this at fivetran. You don't even have to use my proof of concept. I think there's actually a real opportunity right at this moment. I really lament that we're sort of stuck with Postgres forever. Postgres, contrary to popular belief, is very old technology. It is not a good database. Undergraduates writing class projects write better databases than postgres, not because the people who built Postgres were not smart, but simply because it was written a long time ago. It has a lot of technical debt. And I really think the world should create a new operational database rather than just just endlessly repackaging postgres.
Martin Casado
But that's an amazing take.
George Fraser
Yeah, that's another one of my contrary. Postgres is bad. Actually, I should also say.
Martin Casado
No, no, no, even better. Like an undergrad in a database course is writing better databases than underlying.
George Fraser
In many respects, the storage engine that you would write as an undergraduate in a database course is better than Postgres. Heap storage engine. Postgres storage engine. I don't think the creators of the people who, who promote it today would admit it is not a good design. It's been patched up in a lot of ways. But the Postgres Store agency.
Martin Casado
I'm not going to digress on this a million questions. But I will say, just like this conversation, do you ever find this becomes a bit of a distraction? It is so dazzling and so fun and so interesting to work on these things. And you're like, oh, I should be doing that one on one, but instead I'm here in my office.
George Fraser
No, I think about that a lot and I make sure you gotta keep it at bay because it is like, you know, there is a danger of Claude psychosis, which is a term that I love, but you can just get sucked in. But you know, I have a lot of time. I don't have any kids, so I have a lot of time. I mean, people with children Will say, it's like having a whole nother self
Martin Casado
if you don't trust me. And I do. Yeah. Yeah. I mean, one thing I do really appreciate about you as an executive is this, you as a founder is you're actually quite reflective. And one of the mental exercises that you've been doing as long as I've known you was, like, pretending. If you were a new CEO brought in by the board to fix fivetran, like, what would you immediately unwind? Which, by the way, always happens when you bring in a CEO. And I know you kind of do this mental exercise. So is there anything recently that you've thought, like, boy, if I was brought in to, like, run things, I would
George Fraser
change this many things? That's an exercise I do regularly. I'm trying to think of a recent example. I mean, a not so recent example is I did do some things to try to simplify pricing. I think those have been successful. They were painful for the company because they mostly cut prices for small customers. Another exercise I do is I ask myself, what should other CEOs do? It's hard. It's a trick to get yourself to do things that are big and scary. So in the DBT merger, when I was reflecting on that, on whether that was a thing we should seriously consider, one of the tricks I use is I ask myself, what should start do sort of as the CEO of Snowflake, who we've worked with for a long time? It's a good mental exercise.
Martin Casado
Oh, yeah, that's right.
George Fraser
And then I just go do that. And that was, like, a clear answer in my mind was like, merge with dbt. Absolutely. By that framework. Even though it seems very big and scary, when you imagine, is this a good idea for someone else, then you can kind of get there.
Martin Casado
Yeah, maybe just kind of a softball cliche. Final question. But, you know, you do have, like, the unique perspective of managing a CEO during this transition. So, first, what are you most kind of existentially worried about? And second, what are you most excited about?
George Fraser
Well, I think the thing I most worry about is just, you know, that the. At some point, the coding agents will get so good at writing connectors that people will just shift to diy. I think that's a real threat to fivetran. I think some different businesses are more and less threatened by, like, maybe the customers will just vibe code it themselves. That is a thing I worry about with fivetran. And we will find a way to thrive in that world if we get there. And we will provide the tools that you use to do that, if that indeed becomes possible, even if it comes at a short term cost to ourselves. But I do worry a lot about that. And then the biggest opportunity I think is, is that AI is just a whole new set of things to do with data. The need for getting all your data in one place, organizing it, is so much greater now than ever before. I think that there's a whole set of tools that people are going to need on the other side of that data platform, and I think we, and especially we and dbt are perfectly positioned to provide them.
Martin Casado
Amazing. Well, it's always a pleasure to have have you. George, thanks for coming.
George Fraser
Good to see you. Martin.
Podcast Host
Thanks for listening to this episode of the A16Z podcast. If you like this episode, be sure to like, comment, subscribe, leave us a rating or review and share it with your friends and family. For more episodes go to YouTube, Apple Podcasts and Spotify. Follow us on X16Z and subscribe to our substack at a16Z substack. Thanks again for listening and I'll see you in the next episode. As a reminder, the content here is for informational purposes only, should not be taken as legal, business, tax or investment advice, or be used to evaluate any investment or security, and is not directed at any investors or potential investors in any A16Z fund. Please note that A16Z and its affiliates may also maintain investments in the companies discussed in this podcast. For more details, including a link to our investments, please see a16z.com disclosures.
Episode: AI Agents and the Fight for Customer Data
Date: June 5, 2026
Host: Martin Casado (Andreessen Horowitz)
Guest: George Fraser, CEO & Co-founder of Fivetran
This episode dives deep into how AI agents are transforming the landscape of enterprise data infrastructure and software, particularly with regard to access, ownership, and centralization of customer data. George Fraser (Fivetran CEO) sits down with a16z general partner Martin Casado to address the evolving tensions between AI-native companies and traditional SaaS incumbents, the recent trends toward API lockdowns and "data walled gardens," the myth (or reality) of data gravity, and why data centralization is becoming even more critical in the AI era. The conversation also touches on pragmatic strategies for CIOs, the future of AI agents in enterprise settings, and Fivetran’s own recent acquisitions and internal AI usage.
AI Needs Context:
Fraser opens with a potent reframing: while data centralization traditionally supported business intelligence (BI), the emergence of AI agents now makes unified data access essential (“AI agents need context. If you don’t do that, it’s like using ChatGPT before it was connected to the Internet.” – Fraser, 00:00, 10:28).
Fivetran’s Role:
Explains how Fivetran has evolved from enabling BI and reporting (“what’s your revenue, sales team, forecasts,…”) to also enabling the next wave of AI-powered use cases by centralizing data.
“AI native companies will just zoom and catch up to the established incumbents and maybe be better.”
– Fraser (00:26)
Vendor Anxiety about AI:
Recent industry moves (notably SAP’s restrictive API policy) reveal incumbent fears about losing the value of their applications if AI agents can extract and act on data directly.
“Some of these companies start to think that a great strategy for dealing with AI might be to lock it out… This just shows how extreme the reaction… has been.” – Fraser (03:17–04:34)
Customers vs. Vendors:
Fraser unequivocally calls such API lockdowns “bad for customers” because centralized, up-to-date data is both essential for AI context and a basic right for businesses that own the data:
“If you don’t do that, then it’s using ChatGPT from before it was connected to the Internet.” (10:28) “Anytime vendors start putting up barriers, it just makes it harder to get that done. The customers will still do it. They’ll just work around these barriers at great cost and complexity.” (11:20)
Historical Parallels:
Both hosts recall API debates from the 1990s and argue similar fears proved unfounded—open APIs ultimately won because customers demand flexibility.
Software Spend is Small:
Fraser notes that in most enterprises, spending on software is a “rounding error” compared to headcount and other business costs.
“Software costs are immaterial in the grand scheme of things… The idea that they're going to use AI to value engineer the number of seats they have on Slack or something is ridiculous.” (08:54)
AI Doesn’t Necessarily Reduce SaaS Consumption:
Even heavy AI users (OpenAI, Anthropic, Andreessen Horowitz) still rely on SaaS tools. For now, AI is more likely to augment or increase software consumption than cannibalize it.
Evaluating Vendor Openness:
Fivetran’s “Open Data Infrastructure” initiative scores vendors on their data access policies (egress fees, data completeness, contractual/API restrictions).
“SAP has always been really bad… Salesforce has been really good—with the exception of Slack, where they are terrible. But they've started to get squirrely about this.” (12:44–13:29)
Trend Watch:
Fraser hopes current API lockdowns are a “brief flirtation” and predicts vendors will return to openness because customers' needs are too diverse for monolithic, vendor-only solutions.
“Data Gravity is Completely Fake”:
“You can have a huge dataset, but if you just replicate the changes, the changes are always much smaller than people think.” (16:10)
Advice: Don’t Build Your Own Pipelines:
Inefficient legacy pipelines exacerbate the “data gravity” illusion by copying full datasets unnecessarily.
From Software to Team Members:
Casado and Fraser share how their own AI agents now have separate emails, phone numbers, and user accounts—treated just like human team members.
Interim Phase:
Fraser: Right now, giving AI agents distinct identities is a practical way to plug them into workflows designed for people (e.g., support ticket responses, HR onboarding).
“It can slot into all of these existing workflows that were designed for humans.” (21:29)
Will Agents Use UIs or APIs?
Casado bets that in five years most agents will still use the same interfaces as humans due to the long tail of integrations; Fraser bets agents will primarily use APIs.
Mediator Layers (MCPs):
MCPs (Middleware Control Protocol/Platform): Even if in principle tools should be unnecessary, in practice they help manage authentication and discoverability, so “have taken hold.”
Coding Agents Creating Their Own Tools:
As agents get smarter, they may just “build their own intermediate abstractions as needed,” further blurring any lines.
SaaS “Collapse”?
Fraser: There is reasonable market uncertainty but doubts about a sudden SaaS apocalypse. More likely the threat is “AI-native companies will zoom and catch up.”
“I don’t really buy into this reason that all the SaaS categories are going to disappear…” (29:23)
Incumbents Accelerating:
Despite “existential angst,” Fivetran and other established players aren’t actually seeing declining demand. The AI “threat” is real but not yet visible in business metrics.
AIs as Mass Troubleshooters:
Fivetran now uses internal AI “coding agents” to fix connector bugs at scale across 750+ supported apps, leveraging AI as “an infinite supply of junior engineers.”
“Putting that into practice…we’ve really, especially the last couple months, started to see it work and improvements at scale.” (33:09–34:53)
AI-Native vs Traditional Use Cases:
Even OpenAI and Anthropic use Fivetran in “very typical” ways—centralizing data from diverse systems for analytics and AI context.
Modern Data Foundations Remain Valid:
Companies don’t need exotic new infrastructure for AI: existing data lakes/warehouses are still solid foundations.
“AI is quite good at navigating slightly more complicated infrastructure, so… you can drop down to the next [layer]… You’re not going to peel it all the way back down to glass.” (38:16–38:58)
Not a Grand Plan, But Strategic Fit:
Fraser discusses Fivetran’s notable acquisitions and the merger with DBT Labs:
“These are two products that have almost always been used together and they kind of go together.”
DBT as a Coding Agent Beneficiary:
Agents will write “tons of DBT models,” turning DBT into the “executable documentation” of company data rules.
“Even if it's being written by AIs, you still want to have that artifact…” (41:30–42:24)
CEO as Active Tinkerer:
Fraser still codes for fun (AI tennis team manager, a prototype SQL database for S3). Coding agents make it easy for execs to tinker asynchronously.
Reflections on Focus:
The dangers of being sucked into endless AI tinkering (“Claude psychosis”) are real, but having time enables this play.
Mental Exercises:
Fraser regularly imagines what he’d change if he were a turnaround CEO. This reflection led to major decisions like the DBT merger.
Biggest Worry:
AI coding agents getting so good at writing connectors that companies do it themselves, threatening Fivetran’s raison d’être.
Biggest Opportunity:
AI vastly expands use cases for clean, centralized, organized data. The need for data platforms is greater than ever—Fivetran + DBT well-positioned.
On API Lockdowns:
“Anytime vendors start putting up barriers, it just makes it harder to get that done. And the customers will still do it. They'll just work around these barriers at great cost and complexity.”
— George Fraser (11:20)
On Data Gravity:
“I think data gravity is completely fake. I am the only person who thinks this.”
— George Fraser (15:37)
On SaaS Value:
“Software costs are immaterial in the grand scheme of things. The idea that they're going to use AI to value engineer the number of seats they have on Slack or something is ridiculous.”
— George Fraser (08:54)
On Future AI Agents:
“You can imagine how you can use AI coding agents, which are basically an infinite supply of junior engineers. That is a particularly valuable tool for this kind of problem.”
— George Fraser (33:28)
On Legacy Databases:
“Postgres, contrary to popular belief, is very old technology. It is not a good database simply because it was written a long time ago. It has a lot of technical debt.”
— George Fraser (00:00, 44:47)
On Reflection as a CEO:
“If you imagine, is this a good idea for someone else, then you can kind of get there.”
— George Fraser (48:14)
Spirited, technical yet practical, and peppered with healthy skepticism and humor. Fraser is notably contrarian and direct (“Postgres is bad…data gravity is fake”), while Casado grounds the conversation with market context and personal anecdotes of deploying AI agents. Both maintain a pragmatic, builder’s view—focusing on what’s actually working now and where the real risks and opportunities lie.
This structured summary captures the breadth and practical depth of the episode, making it accessible and actionable for listeners and non-listeners alike.