wavePod

TECH006: Open-Source AI That Protects Your Privacy w/ Mark Suman (Tech Podcast) - The Investor's Podcast (We Study Billionaires) - The Investor’s Podcast Network | Wave AI Podcast Notes

Back to The Investor's Podcast (We Study Billionaires) - The Investor’s Podcast Network

TECH006: Open-Source AI That Protects Your Privacy w/ Mark Suman (Tech Podcast)

The Investor's Podcast (We Study Billionaires) - The Investor’s Podcast Network

Wed Oct 29 2025

Can AI be both powerful and private? Mark Suman, ex-Apple engineer and co-founder of OpenSecret and Maple AI, reveals how end-to-end encrypted AI is shaping the future of inference and trust.

Summary

Podcast Summary

Episode Overview

Podcast: We Study Billionaires – Infinite Tech Series
Episode: TECH006: Open-Source AI That Protects Your Privacy w/ Mark Suman
Host: Preston Pysh
Guest: Mark Suman, founder of Maple AI
Date: October 29, 2025

This episode delves into the profound implications of open-source, decentralized AI—specifically, how individuals can reclaim privacy from big tech’s grasp while still accessing the benefits of cutting-edge AI models. Mark Suman draws on his years at Apple and his current work founding Maple AI to discuss trusted execution environments, verifiable AI, and the philosophical and pragmatic need for privacy-preserving intelligence.

Key Discussion Points & Insights

1. Mark Suman’s Background in Privacy and AI

Mark’s journey: Began with privacy-focused cloud backup software ("the aughts")—emphasized encrypted personal backups ([02:22]).
Apple experience: At Apple, privacy wasn't just marketing. Legal and technical structures required new, privacy-preserving ML workflows ([02:22]–[04:02]).
- “From like probably the third week... I was engaged with a privacy lawyer... It made things difficult. We had to innovate and invent new things that nobody was doing.” – Mark ([03:07])

2. Apple’s Unique Pace and Privacy-Driven Approach to AI

Apple behind in the AI race? Not just leadership: It’s organizational size and commitment to privacy making things slower but more user-centric ([04:49]–[06:05]).
- OpenAI, Google, XAI build fast with massive hardware; Apple is more cautious, using secure enclaves and auditors.

3. Terminology: Verifiable AI vs. Open Source or Decentralized

Why "verifiable" matters: The host and Mark agree that “don’t trust, verify" from Bitcoin applies.
- Verifiability = Ideological transparency, not just open-source code, but the ability for anyone to inspect and validate what runs on servers ([06:40]).
Trusted Execution Environments: Secure enclaves allow users cryptographic proof that open-source code is what actually runs, removing blind trust from the process ([06:40]–[07:37]).

4. The Real Threats of AI Data Centralization

Addiction and convenience: Users (including both host and guest) acknowledge they trust and use centralized AIs out of convenience, even at the cost of privacy ([07:37]).
Long-term dangers: Proprietary AIs can capture your thinking and memories, and once data is given, “you’re not getting it back” ([08:43]).
- “If you’ve kind of given up that thinking process to another machine ... we might be giving up the thing that makes us uniquely human.” – Mark ([09:24])
- “We can dive into that ... I’m calling [it] subconscious censorship ... these proprietary systems ... can be instructed ... to alter your memory to be more mainstream.” – Mark ([09:53])

5. Psychological Manipulation and Algorithmic Influence

Social media analogy: How algorithmic feeds have already altered emotions and beliefs; same could happen with AI but even more intimately ([11:16]).
- “We’ve seen how, just by the way that they order the posts, they can affect your emotional state... Take those tools ... and apply it to AI ... now AI knows you intimately.” – Mark ([11:37])

6. The Need for Verifiable, Open AI Ecosystems

Not doom & gloom: Mark sees technology as a “gift” but urges verifiable setups to avoid manipulation and data harvesting ([13:05]).
Maple AI’s value: Total transparency—open-source code, verifiable execution, and privacy as the core feature ([15:25]–[18:03]).
- “We know that people don’t want to give up their convenience just for the sake of privacy ... So we are going to build ... ChatGPT, but it’s going to have privacy at the core.” – Mark ([18:59])

7. Practical Demonstrations: Maple AI’s Architecture

Mathematical attestation: Every user session receives a cryptographic “green check” proving code untouched and verifiable ([18:03]).
Analogy: Like HTTPS (browser lock icon)—Maple takes it one step further: HTTPSe for “Secure Enclaves” ([18:03]).
Hybrid privacy: Local encryption and private keys ensure even Maple can’t see user data—it’s decrypted temporarily only in secure enclaves ([29:12]).

8. The Competitive Landscape: Open Models vs. Proprietary Giants

Open LLMs improving rapidly: The accuracy gap is closing; “90% of the way there for most use cases” ([24:24]).
- “Really most people don’t need to have that extra 3% ... to really get a lot of value out of it.” – Mark ([25:20])
- Open models like Quin 3 coder now match or beat proprietary models on specific tasks ([24:24]).
Why go open? Sometimes ideological—state actors want their worldview embedded ([26:29]). Sometimes because proprietary competition is impossible.

9. Specialization, Model Routing, and the Future of AI

Specialist models: The next phase is “routers” directing user prompts to expert subsystems (coding, medical, legal, etc.) ([27:51]).
User experience focus: Maple wants to hide complexity; initial model pickers to be superseded by smart, automatic selection ([29:12]).

10. AI Memory, Context, and Privacy

Personal long-term memory: Custom “memory banks” (user-controlled) will let AI recall prior context without leaking private data ([31:40]).
- “We want to build is a truly sovereign AI memory where you can go and see what the system remembers about you ... and you can edit it.” – Mark ([32:49])
Engineering challenge: Preventing overfitting—making sure past memory doesn’t dominate irrelevant future conversations ([38:28]).

11. Inference: The Emerging Competitive Moat

Inference speeds & cost: As chips evolve (e.g., Xai’s custom ASICs), inference cost and speed will divide winners from losers ([41:27]).
- “These apps that we’re building on top of the inference are going to be the competitive moat.” – Mark ([42:32])
- Hybrid models: Local small models preprocess, cloud large models crunch expensive tasks ([43:21]).

12. The Economics and Sustainability of the AI Arms Race

Ongoing bubble: Billions flow between chipmakers, cloud providers, and AI labs—a circular “meme pump” of equity and investment ([44:58]–[47:18]).
- “I think we’re definitely going to have a bubble at some point that’s going to pop. I view it very similar to the Internet ... the winners are going to remain.” – Mark ([46:31])

13. Building in the Age of Reflexive AI

Small team, rapid iteration: AI enables tiny teams to compete; 90–95% of Maple’s code is written by/with AI tools ([49:14]).
- “If we were doing this prior to AI, ... we probably would have had to have two more people, three more people ... so we’re definitely seeing an acceleration.” – Mark ([52:39])

14. The Future: Local, Sovereign AI Hardware at Home?

Vision: AI servers in every home—maybe as common as a modem or heater, owning your own data and sovereignty ([53:53]).
But: Most people still prefer the convenience of the cloud (like Gmail vs. self-hosted email). Will people draw “the line at their brains”? ([54:41]–[55:23])

15. Nostr and Verifiable Identity in AI

Public/private key authentication: Protocols like Nostr could become foundational in digital identity, privacy, and verifiable communication.
- “I see it all coming back to that word verifiable. ... being able to say, hey, this little piece of memory that went into my AI, that’s signed with my private key.” – Mark ([56:03])

16. Practical Takeaway and Final Call

Maple is an “extra tool”—not a total replacement. Use it for conversations where privacy matters most ([57:15]).
- “You get this refreshing feeling knowing that this is just a private room with you and an AI and nobody else is listening ...” – Mark ([57:34])
- Try it: Trymaple AI

Notable Quotes & Memorable Moments

On Apple’s Privacy Culture:
“I had to innovate and invent new things that nobody was doing ... it’s truly part of who they are.” – Mark ([03:07])
On Data Surrender:
“If you’ve kind of given up that thinking process to another machine that has now captured it ... we might be giving up the thing that makes us uniquely human.” – Mark ([09:24])
On Subconscious Censorship:
“These proprietary systems capture your memories and capture your thought process ... then they can be instructed ... to alter your memory to be more mainstream ... they can guide you ...” – Mark ([09:53])
On Verifiable AI:
“It’s being able to inspect, it’s being able to verify everything that you’re running ... you want to be able to look at everything so that nothing is kind of hidden in there that you don’t know about.” – Mark ([07:09])
On Combining Convenience and Privacy:
“We are going to give people all of those core amazing features that they get out of charge and grok. But they’re also going to have privacy built into it ...” – Mark ([18:59])
On the Economics of AI:
“I think we’re definitely going to have a bubble at some point that’s going to pop ... the winners are going to remain.” – Mark ([46:31])
On Ownership and Local AI:
“Maybe this is finally the line in the sand where it’s like, you can have our emails, but you can’t have our brains. Our brains need to live at home.” – Mark ([54:41])

Key Timestamps for Important Segments

| Time | Segment/Topic | |-----------|------------------------------------------------| | 02:22 | Mark’s privacy journey; Apple culture | | 04:49 | Apple’s slow AI pace explained | | 06:40 | The meaning of “verifiable” AI | | 08:43 | Risks of data surrender to proprietary models | | 09:53 | The idea of “subconscious censorship” | | 11:16 | Manipulation via social feeds → LLMs | | 13:05 | Not doom-and-gloom: solutions in openness | | 15:25 | Maple AI’s privacy-by-design architecture | | 18:03 | Secure enclaves, HTTPSe, and verification | | 24:24 | Open models catching up in performance | | 27:51 | Specialization and model routing | | 31:40 | Building user-controlled AI memory | | 38:28 | Challenges of making AI memory not overfit | | 41:27 | Inference as the new AI moat | | 44:58 | Economics of the AI/compute “arms race” | | 49:14 | Programming with AI—90%+ of code AI-written | | 53:53 | The vision (and barriers) for home AI servers | | 56:03 | Nostr and public key/private key for identity | | 57:15 | Parting advice: treat Maple as a privacy tool |

Episode Takeaway

The episode offers a clear-sighted warning about the stakes of letting third parties control and harvest the data that defines us. Mark Suman argues—and demonstrates through Maple AI—that combining open-source, verifiable infrastructure with user-centric design can protect privacy without sacrificing the immense value of AI. The path forward is not just technological; it’s ideological, rooted in the mantra "don’t trust, verify." For those who cherish autonomy and privacy, adding tools like Maple to their digital toolkit may become non-negotiable as AI becomes ever more embedded in daily life.

Loading summary...

Transcript

A (0:00)

You're listening to tip.

B (0:03)

Hey, everyone. Welcome to this Wednesday's release of Infinite Tech. Just like Bitcoin separated money from the state, decentralized inference is now separating AI from big tech. It's a quiet revolution, shifting control of intelligence itself from the centralized data centers to individuals and small developers who can run powerful models privately, securely and anywhere in the world. Today I'm joined by Mark Suman, founder of Maple AI, to unpack how this is being possible through trusted execution environments, secure hardware that protects both data and computation. It's a glimpse into the foundation of a truly open AI ecosystem. And so, without further delay, let's jump right into the interview.

A (0:46)

You're listening to Infinite Tech by the Investors Podcast Network, hosted by Preston Pysh. We explore Bitcoin AI, robotics, longevity and other exponential technologies through a lens of abundance and sound money. Join us as we connect the breakthrough, shaping the next next decade and beyond, empowering you to harness the future today. And now, here's your host, Preston Pish.

B (1:20)

Hey, everyone. Welcome to the show. I'm here with Mark Suman and I'm really excited to have this conversation, sir, because this is such an important topic, like crazy importance, and I think it's only getting started, but I think everybody's going to come to the realization how important this topic is in the coming.

A (1:39)

Five to ten years.

B (1:40)

So welcome to the show. Excited to have you here and really excited to get into this.

A (1:45)

Thank you. Yeah, I'm excited to be on here. I've listened to your show quite a bit, so it's cool to be on here and chatting with you. So I'm honored, sir, I'm honored.

B (1:53)

Let's start here because I'm fascinated by your background. You worked at Apple for many years as a software engineer working on privacy, machine learning and computer vision. On the privacy front, I think this is something that is super relevant to where we're going to go with open source, decentralized AI, which is what you're building here with Maple AI. But what did you see while you were there at Apple that encouraged you or gave you the motivation to go out and start what you're doing right now?

A (2:22)

Yeah, sure. So privacy has been part of my career from the beginning. I started off doing online backup software for people back in, like, the early, I don't know, the 2000s. Right, the aughts. And it was all about how do we save your computer into this new cloud thing that everybody's talking about. But we wanted to offer people a private way to do it because, like, you could back up all your photos to someone's computer and that person who runs a computer can see everything. So we would provide people with this private key that they could use on their computer and encrypt everything before they sent to the cloud. That's kind of where I got my start. And so privacy was always kind of part of who I was. Fast forward to when I joined Apple and on day one, my, you know, my new manager sits down with me and says, I want you to build this thing that we're going to use in the retail stores. But we have to do it in a way that's totally private. Because Apple cares about privacy. Right. It's one of the core things, what I can say, like it truly is one of the core tenets of Apple seeing it on the inside. So from like probably the third week of my project, I was engaged with a privacy lawyer and they were kind of part of the journey throughout the whole thing. And it's like, okay, how do we build this thing? Normal companies would just capture someone's face and capture their identity and look at their banking transactions and all these things. Right? Normal companies would do that. Apple doesn't do it that way. Right. We have to separate all this stuff. We have to find ways to do it that is totally privacy preserving. So it made things difficult. We had to innovate and invent new things that nobody was doing. We had to invent totally new tools for tagging and annotating AI training data and machine learning training data in ways that were totally privacy preserving. So it's some really cool stuff. And I will just say, like, it's truly part of who they are.

A (8:43)

Yeah, I mean, I don't fault anybody for using these tools. Convenience is, is an amazing thing, right? It's, it's why technology exists. Technology comes around things more convenient for people, it adds value into their life and so they grasp onto it and then there's always trade offs. And so I have an OpenAI account, I've got a GROK account, I use them and I use them in a way that I'm trying to minimize my exposure, my privacy exposure. Right. I also obviously build Maple and me and my co founder and so I use that for different purposes. But the threat that I see you talk about like five to ten years down the road, the Difficult thing I heard it described recently, chatting with a friend, is that as you kind of give away your thought process to a proprietary AI service, right, it takes that and there's really no getting it back. Right. They have it now forever, and they can make as many copies as they want to, and then they can choose to put that into their model, they can choose to manipulate it if they want to, they can do whatever they want to with that data, and you're just not getting it back. So five, 10 years down the road, if you look at what's unique about you as a human, it's really the way that you think your face is unique for sure. But you could probably find a pretty good doppelganger out there that looks similar to you. But your memories, your thought process, the way that you perceive the world, is probably the most unique thing about you. And if you've kind of given up that thinking process to another machine that has now captured it and can train off of that, we might be giving up the thing that makes us uniquely human. It's. I think a threat could be viewed in that lens of are we turning over some of our humanity to a proprietary system? And I'm working on a long form article about this right now. It's a phrase I'm calling subconscious censorship. And we can dive into that if you want to. But it's really this notion that these proprietary systems capture your memories and capture your thought process, and then they can be instructed, given directives to alter your memory to be more, more mainstream or be less mainstream. You know, they can guide you and direct you how they want.

A (15:25)

Okay. Yeah. And I, and I think what you just described is what most people want. Right. This is the tale as old as time with privacy technology and freedom. Technology is, we know that we should be probably being better about our data and better about our information that we share, but it's just so useful and so convenient and we see so much productivity gains from using some new technology that we're willing to kind of like close our eyes and plug our nose as we use them. And I'm just as guilty as everybody else with doing that because we have to make trade offs in our personal life to do that. And I mean, we haven't really talked about the potential for data leaks, right. And maybe I'll just drop this in here really quick and then I'll answer your question directly. But we've seen with ChatGPT, with Grok most recently, both of them had bugs in their software where chats were being indexed on Google search results. And so I don't know if you saw this was like a month or two ago. So people were searching for stuff and finding personal chats that they had made. These were specifically around the share button. So in ChatGPT you can click share. It gives you a private link that you can send to somebody and now somebody else can read that chat. And so it was still meant to be private between you and someone else, but Google was picking up on those and now people could search and it would be things like somebody's chatting about their marriage, you know, and marriage difficulties they're having. And then they send it to their spouse and say, okay, here's what our AI therapists on ChatGPT told us. And now their marriage details are spilled out onto the Internet. So it's when you give somebody else your data, that's a risk you're taking on, is have stuff like that happen. What we're trying to build is we're trying to build verifiable AI that people can use so they can see everything through the process. We build everything in the open. All of our code is online before we push it in the servers, before we push anything as an update that you can download. You can see the code first so people can inspect it. And then we also know that local AI is really the most private AI. Something you can run on your phone, you can run on your computer. It's never going to get more private than that. Turn off the Internet, talk directly to it. You can inspect it before you use it. Like that's the utopia right there. But not everybody has a powerful enough device yet to do that. They cost tens of thousands of dollars to run these big, largest open source models. So we're trying to give people an in between. We run secure enclaves in the cloud, we push our code there. And then what it does is it gives you, it's called an attestation, which is really just a mathematical proof and it's a way to match. So it's a way to say, okay, you have this code that's Open source on GitHub, but how do I know that you're actually running that Exact same code on your servers. So there's a lot of other private AIs out there that say, hey, here's our open source code. We're private, we're not tracking what you do. But you can't actually check. You can't verify that.

C (21:01)

It simple Listen, if you're a money nerd like me, then you need an easy way to track your net worth in real time. And that's why I'm thrilled to tell you about today's show's sponsor, Kubera. I've tried all sorts of different apps for this and Kubera is definitely the best one I could find I it links in all my bank accounts, investment accounts and private investments and it gives me a clear real time overview of everything I own. One of the issues I found with other apps is that my accounts constantly get disconnected or I'm not able to get them connected at all to give me that big picture financial overview that I need. And it's not just about tracking where I am today, it's about understanding where I'm headed. So Kubera has this really cool feature called the Fast Forward feature. It lets me model out different life scenarios so I can say if I'm making this much money in five years, how much will my net worth change over time? Or if I decide to buy a house next year and put this much money down, what will my net worth be in five, 10, 20 years down the line? That way I always have a clear picture of what's next to ensure I continue to hit my financial goals again. I'm thrilled to have Kubera as a sponsor. I use their app all the time and if you want to stay on top of your wealth I you can get $100 off your first year subscription@kubera.com WSB that's k u b e r a.com WSB startups move fast. And with AI, they're shipping even faster and attracting enterprise buyers sooner. But big deals bring even bigger security and compliance requirements. A SOC 2 isn't always enough. The right kind of security can make a deal or break it. But what founder or engineer can afford to take time away from building their company? Vanta's AI and automation make it easy to get big deals ready in days. And Vanta continuously monitors your compliance, so future deals are never blocked. Plus, Vanta scales with you backed by support. That's there when you need it every step of the way. With AI changing regulations and buyers expectations, Vanta knows what's needed and when. And they've built the fastest, easiest path to help you get there. That's why serious startups get secure early. With Vanta, our listeners get $1,000 off at vanta.com/billionaires. That's V A N T A dot com billionaires for $1,000 off.

A (24:24)

Yeah, it's a valid concern. Especially in the early days, the open models were significantly worse than the proprietary ones. But we've seen an acceleration. I mean, ChatGPT came on the scene two and a half years ago, maybe three years ago is when it really caught on. And we've seen the open models catch up a ton in that timeframe. You know, they were like 50% as good, then 75%. Now they're like in the 90% range. You have a coding model Quin 3 coder, which is scoring just as good as some of the proprietary models on programming in some areas, not all areas. And we're seeing a point where like the benchmarks, however you want to define that are really getting similar. And then it comes down to just using it and seeing how it behaves for you. And really most people don't need to have that extra like 3% in their model to really get a lot of value out of it. And then the other thing we're seeing is you bring up GPT5 and arguably GPT5 is an incremental increase over GPT4. And a lot of people complained and wanting them to go back to GPT4 and make it available again. And some people have done like introspection into the routing technology behind GPT5 and think that there's actually still a lot of GPT4 just under the hood that's helping to power version 5. And so you look at that and you say, okay, maybe their progress has slowed down just a little bit. And then also they open sourced GPT OSS, which is really just kind of 4, 0 under the hood. And so you're seeing that open source from their standpoint is starting to catch up. And then you have this whole market dynamic of the Chinese models. You have Deep seq, you have Quinn, you have these other ones, I'm blanking the other one right now, but you have these other ones that are coming up and in order for them to compete with these big proprietary models, they're going open first and they're trying to be as good so that they can compete. So I think we are seeing this world where the open source catches up just enough that it becomes just as valuable to a regular person, mainstream person, than these other models.

A (29:12)

Sure, yeah. So first off with Maple. Right now, we're only in the cloud. We want to provide local stuff as well. It's like a hybrid local cloud where all your data is encrypted locally first on your device. And we use a private key. Coming from the Bitcoin world, we understand the power of a private key. Same with Nostr. Right. And so we apply that here. So you're chatting with AI locally on your phone, it encrypts it and then sends it to the cloud for processing. In the cloud, a secure enclave uses your private key, decrypts it, gives it to the AI, it comes with a response, re encrypts it with your private key and sends it back to you. So we are not in the middle. We can't see anything going across the wires. Only the secure enclave sees the personal data. But you can go look at the source code and see that there's nothing going on there. So how do we tailor that experience? Right now we just give you a model picker and you're having to choose. And we have a lot of users telling us they get almost like low key anxiety from like trying to pick which one's the best model that I should use right now. So part of our big 2.0 push that we have coming up is we want to build something that helps guide the user and say, like, I am in big brain thinking mode right now. So I'm going to click on this thing and it's going to drop me into a model that helps me do that. Or I'm just in quick trivia mode. I want to look up something, so it's going to drop me in there. In the beginning it'll be like just an easy picker to do for the user. But then we'll switch over to an auto mode where they just chat and it knows what to do. And you just put a simple classifier in front of it so it looks at your prompt and it can quickly determine itself what should be used. Just like you would, you know, you would say, I'm in this mode, here's what I'm thinking. And you would pick that. Well, it'll do that for you. So we want to get more automatic with that, but always provide these advanced features that people can turn back on and be more selective if they want to love that.

A (31:40)

Yeah, definitely. What you just described. There are two different implementations that kind of help out with the same thing. So one way that the ChatGPT does it is they have these custom GPTs where you can set up this thing and it has a lot of the context pre built into it. It's basically like you typed in a system prompt with all the stuff that you want. And for people listening who maybe aren't like super into AI, a system prompt is basically the instructions that you give to the model, you give to the AI to say, hey, when I talk to you, I want you to kind of be in this personality or this frame of mind, or as this character as I'm talking to you. So you can get silly and you can say like, I want you to talk like a pirate to me. So every time you chat with it, it's going to talk like a pirate. That's like the extreme example, but more nuanced. You can be like, hey, I am going to have a legal discussion with you right now. So I want you to be a lawyer. I want you to be a contract lawyer and I want you to have these qualities about you. So to me, that's one part of what you just described and we definitely do want to do that. We have a system prompt you can edit. We show you what the system prompt is. We want to have multiple in the future where you can customize those and maybe a dropdown or something, say, hey, I'm in legal mode right now. Switch into that mode. The other aspect you just described is the memory side of it and we are definitely working on that. We're going to have, you know, an open source memory component to it. We don't know exactly which direction we're going to go yet with it, but it's going to be something where you will see. Okay, here's everything that the AI has learned about you. And AI memory is really fascinating because I like to view it as you're sitting down with a biographer, right? Say you're Steve Jobs and you want to have everybody know about your life, so you get the best person out there to write biographies. That's what's happening with you every day. As you're using ChatGPT or any AI product, it is sitting down and trying to learn everything it can about you. Here's how he thinks, here are his childhood memories, you know, yada, yada, yada. The difference is in a proprietary system, you don't get to read that biography. You don't really get to see what's in there. They will show you an interface that says, oh, here's the things we know about you, and we'll even let you delete it. But there's no guarantee that that's actually happening, right? If you delete that thing out of there, it probably still remembers it, but it's just like, oh, we'll tell them that we're not going to use it, but we could use it if we want to. What we want to build is a truly sovereign AI memory where you can go in and see what we remember about you. Not we, what the system remembers about you. And then you can edit it, you can add to it, and then that will get pulled into future chats. And so with those two combined, the system prompt personality thing, that's more proactive. You can say, I want you in this mode, whereas the memory side is more passive. It's like, hey, this is my context about me. So use it selectively as you see fit. Let's take a quick break and hear from today's sponsors.

C (34:20)

Picture this. It's midnight. You're lying in bed, scrolling through this new website you found and hitting the add to cart button on that item you've been looking for. Once you're ready to check out, you remember that your wallet is in your living room and you don't want to get out of bed to go get it. Just as you're getting ready to abandon your cart, that's when you see it. That purple shop button. That shop button has all of your payment and shipping info saved, saving you time while in the comfort of your own bed. That's Shopify. And there's a reason so many businesses, including mine, sell with it. Because Shopify makes everything easier, from checkout to creating your own storefront. Shopify is the commerce platform behind millions of businesses all around the world. And 10% of all e commerce in the US from household names like Mattel and Gymshark to brands like mine that are still getting started. And Shopify gives you access to the best converting checkout on the planet. Turn your big business idea into reality with Shopify on your side and thank me later. Sign up for your $1 per month trial and start selling today at shopify.com WSB that's shopify.com WSB There's a better way to meet Face to face Remarkable Paper Pro Move It's a paper tablet A digital notebook that combines the familiar feel of paper with the digital powers of a tablet. Start by taking notes with any of the dozens of built in templates. Then turn your handwriting into typed text and share it by email or Slack. Think about it. The ways we capture thoughts on the go all have their drawbacks. Paper is hard to organize and easy to lose. Laptops are too bulky and uncomfortable and on our phones our attention can quickly get hijacked. Remarkable Paper Pro Move is nothing like your other devices. It has a display that looks, feels and even sounds like paper. It can fit all of your notes and documents and lasts up to two weeks on a single charge. But it slips easily into your jacket pocket. And most importantly, Remarkable's mission is about helping you think better. That means no apps, social media or any other distractions. Just you and your thoughts. You can try the Remarkable Paper Pro move for 50 days for free and if it's not what you're looking for, you get your Money back. Visit remarkable.com to learn more and get your paper tablet today. That's remarkable.com support for this show comes from public.com you're thoughtful about where your money goes. You've got your core holdings, some recurring crypto buys, maybe even a few strategic options plays on the side. The point is, you're engaged with your investments and Public gets that. That's why they built an investing platform for those who take it seriously. On Public, you put together a multi asset portfolio for the long haul. Stocks, bonds, options, crypto. It's all there plus an industry leading 3.8% APY high yield cash account. Switch to the platform built for those who take investing seriously. Go to public.com WSB and earn an uncapped 1% bonus when you transfer your portfolio. That's public.com WSB paid for by Public Investing. Full disclosures in the podcast description all.

A (41:27)

Yeah. So inference versus training, the cost involved and the power involved are very different. Right. So the training part costs, I don't know what the exact numbers are. Let's just say 10x. It might be bigger, it might be smaller. But yeah, it's like it takes 10x the amount of resources to train something. Just like as a human, it takes all this time to train you over decades for you to live life and learn all these things and then eventually you can sit down and have a fruitful conversation like we are right now. Right. And so it's easier for us to have this conversation than it was for us to learn everything we learned up to this point so that we are capable of having this conversation. So inference should be viewed that way. Now you're ready to have a chat. And I see that. What's the moat? What's the unique thing that's going to be competitive? And that is just the user experience. And so these apps that we're building on top of the inference are going to be the competitive Moat and what different qualities they have. And we're already seeing that with ChatGPT and some of these others. They're trying to build apps on top of their inference layer now that really pull people in. The latest is the Sora video app that's pulling people in and trying to make it more engaging. Right. As far as inference goes, I think that just only comes down in cost over time. And even though we're going to get bigger models, we're going to build chips that are more efficient for processing those models. Apple, even though they don't have the right AI solution yet, according to the market, they have built these chips into every single device that are just highly specialized at processing these models. So one thing that we're looking at with Maple is doing a hybrid approach where you actually have smaller local models that run incredibly fast and are extremely cheap to run because they're just running on your spare cycles on your device. And they will do a bunch of the initial processing and on some of the most sensitive information and they will come up with the most efficient prompt to give to the cloud model. So you might go in and bang out like this massive prompt, paste in a whole PDF of information and then the local model will crunch that all and say, okay, this is all good and dandy, but really what I need to pass on is a smaller chunk. It'll pass it on to the cloud, that'll get processed on the more expensive servers and then come back to you. And I think in a model like that, inference continues just to drive into the ground as far as price goes and gets faster as well. And so we end up with a better user experience. And so the people that can console that kind of user experience are going to have a better moat, be better as a competitive advantage.

A (44:58)

Yeah, it's crazy. I mean, I saw somebody comment over the last 24 hours about the whole Broadcom and OpenAI thing where it's like, hey, OpenAI, we want to buy all these chips from you, but we don't have the money to pay for them. And so they basically say, let's do a press release together. Broadcom stock goes up now their market cap has gone up $150 billion. And it's like, boom, there's your money that you needed. So we'll loan you, we'll loan you our market cap basically to help you out. So it's kind of this crazy thing, a lot of money being tossed around. Where does this resolve? Oh man, I wish I had a crystal ball to understand, but I think that we are going to have big players out there. We're going to have these people who are building these big, massive mainstream solutions. And I also think that you look at the government contracts that have come to these major models, right? They gave 200 million to Xai, 200 million to OpenAI to Anthropic. I think Meta got that too. I can't remember. So there are bigger things at play here with Department of Defense and other governments around the world. So I think that there is going to be a need for large scale systems like that. There's also a need for other people to build out systems. And I think there's a world where they all exist together. I don't think this is going to be a race to where there's going to be one winner take all. Because really there are so many different ways to approach intelligence in this life and there are so many different avenues and so many different needs that Grok's not going to be able to solve them all. ChatGPT is not going to solve them all. And so I don't know where all the money resolves. I think we're definitely going to have a bubble at some point that's going to pop. I view it very similar to the Internet and so we're going to have all these companies that over invest and then there's going to be a retraction. And a retracement back and the winners are going to remain. So I don't have a perfect answer for you on that, but I just think that there will be some over investment, but I don't think it's going to pop and go away. There's too many benefits. People are seeing too much productivity from AI, too much value coming out of it, that it's going to survive. It's just which people and which companies will remain standing.

A (49:14)

So there is a lot of salesmanship going on when it comes to the Vibe coding space. And so a lot of progress has been made. And there are definitely stories where people go on and say, I wrote like a few sentences and it gave me an entire app that I can use. And those are great. I think for proof of concept, we've definitely seen a lot of great things in that space. But getting an app that you wrote with just one paragraph of text into production that millions of users can use, that has covered all the edge cases and stuff, that's a totally different story. So I definitely see a lot of memes where it's like, oh, and software engineers are so cooked. But really what I think the great power is is software engineers using AI and accelerating their abilities. Yes. So that's what we're seeing. You know, I don't want to cast throw shade at the companies that are doing Vibe coding for people who are not software engineers, because what I see as a huge value add right there. Being a software engineer myself for decades, I get approached all the time. Someone's like, I've got a great idea for an app. I want to build this. You know, can you please go build this? And they'll draw on a little piece of paper, they'll maybe draw, build a PowerPoint presentation. But they try to give me requirements. And I can see very quickly that the requirements haven't thought of everything or the idea just is kind of off base. So where Vibe coding comes in now is they can take that, and instead of coming to someone like me, they can give it to an AI. It can build in the proof of concept, they can play with it, and they can say, oh, this is a piece of garbage, or this is a great idea. Let me iterate a bit on this. And so now when you go to approach someone to build it for you, you've got this really refined proof of concept that conveys your idea and has thought through a lot of the initial things. And then for us, we've taken AI and built it into all sorts of parts of our process. So we're using tools locally where we are running coding environments, you know, ides locally with something like Claude code. We've tried out Codex from ChatGPT, we're using factory right now also as kind of a new hotness that's come on. So we're using all of these. We also use Maple, we've got Quinn 3 coder, where we got that plugged into IDE. So we're using all these tools. And then when we check in code to GitHub, do a pull request, we have two other AIs that hop in there as code review agents. And so they're both reviewing the code and they come from two different models, two different companies, and so they give a different perspective. And so they drop in their comments and say, hey, this line of code is maybe, you know, you should think through this more. There's potential bug here, that kind of stuff. And then we go in and we say, hey, Claude, like, respond to these comments from the pull request. And so you have these agents that are really helping out. But ultimately in the end, we are the ones reviewing the final say on the code. And we might say, hey, we don't like the approach they all took, so let's get in there and bang things out a little bit differently. But truth be told, I think probably like 90% of our code, maybe 95% of our code, is written by AI with the human in there, directing it, guiding it, inspecting it, and making sure that it comes out correctly.