#143: ChatGPT Revenue Surge, New AGI Timelines, Amazon’s AI Agent, Claude for Education, Model Context Protocol & LLMs Pass the Turing Test - The Artificial Intelligence Show

Summary10 min read

Podcast Summary: The Artificial Intelligence Show - Episode #143

Title: ChatGPT Revenue Surge, New AGI Timelines, Amazon’s AI Agent, Claude for Education, Model Context Protocol & LLMs Pass the Turing Test
Hosts: Paul Roetzer and Mike Kaput
Release Date: April 8, 2025

Introduction

In Episode #143 of The Artificial Intelligence Show, hosts Paul Roetzer and Mike Kaput delve into a multitude of pressing AI topics shaping the current landscape. From OpenAI's explosive growth and funding to groundbreaking advancements in AGI timelines, Amazon's latest AI agent, educational applications of Claude by Anthropic, and the evolving Model Context Protocol (MCP), the duo provides an insightful exploration into the future of artificial intelligence.

OpenAI's Revenue Surge and Massive Funding

Timestamp: 04:23

OpenAI continues to dominate the AI realm with unprecedented growth and significant financial backing. The podcast discusses OpenAI’s recent achievement of raising $40 billion in the largest private tech funding deal in history, elevating its valuation to $300 billion. This colossal investment primarily stems from SoftBank, with plans to scale compute resources, advance AI research, and fund the ambitious Stargate project in collaboration with Oracle.

Key Highlights:

User Growth: ChatGPT now boasts 20 million paying users and 500 million weekly active users, reflecting a 43% spike since December.
Revenue Projections: OpenAI's revenue is projected to reach $12.7 billion this year, potentially tripling from the previous year, with expectations of $28 billion by 2026.
Cash Burn: Despite soaring revenues, OpenAI anticipates cash burn rising from $2 billion last year to nearly $7 billion this year, peaking at $20 billion by 2027.

Notable Quotes:

Paul Roetzer [00:00]: "They think that their system is basically going to do the work of an entire organization with a couple people orchestrating maybe millions of agents. Like that may sound sci-fi, but that is absolutely what they're thinking is going to happen."
Mike Kaput [04:23]: "ChatGPT has hit 20 million paying users and 500 million weekly active users. That is a 43% spike since December and it is translating into some serious revenue, at least $415 million a month, which is staggeringly up 30% in just three months."

Discussion: Paul and Mike explore the dual nature of OpenAI’s mission—balancing consumer tech advancements with the broader goal of achieving AGI. While ChatGPT's meteoric rise underscores OpenAI's market dominance, the substantial investment signifies a hefty bet on reaching AGI, despite the escalating cash burn.

New AGI Timelines: The "AI 2027" Report

Timestamp: 13:11

A significant portion of the episode is dedicated to discussing a new report titled "AI 2027" by the AI Futures Project. Spearheaded by former OpenAI researcher Daniel Cocotaldjo and AI researcher Eli Lifland, the report presents a dramatic timeline predicting that by the end of 2027, AI will surpass human capabilities in areas such as coding, research, and even self-improvement.

Key Highlights:

Agent Evolution: The fictional scenario introduces an AI company developing "Agent 1," rapidly evolving into "Agent 4," an autonomous system achieving breakthroughs weekly.
Global AI Race: By 2027, the US and China are projected to be locked in a heated AI arms race.
Criticism: Critics argue the report borders on fearmongering, likening the accelerated AGI timeline to science fiction rather than grounded forecasting.

Notable Quotes:

Paul Roetzer [18:13]: "I just wouldn't recommend it [AI 2027] because it's just a lot to handle. If you're mentally in a place where you can consider the really dramatic dark side of where this goes pretty quickly, understand it is all based on fact."
Daniel Cocotaldjo [not directly quoted]: "We predict that AIs will continue to improve to the point where they're fully autonomous agents that are better than humans at everything by the end of 2027 or so."

Discussion: While acknowledging the credentials of the report's authors, Paul expresses caution about the extreme nature of the predictions. He recommends a balanced approach, suggesting listeners start with more moderated analyses, such as Kevin Roose’s New York Times article, to gain a nuanced understanding. Additionally, the episode highlights Google DeepMind’s proactive efforts to safely build AGI, emphasizing the importance of preparedness in managing potential risks.

Notable Quotes:

Paul Roetzer [18:51]: "There's a lot of people who are still trying to process what ChatGPT can do today, but the people who are on sort of the frontier are so far beyond that."

Amazon’s Entry into the AI Agent Race with "Nova Act"

Timestamp: 29:13

Amazon has unveiled its new AI system, Nova Act, marking its foray into the competitive landscape of AI agents. Nova Act is a general-purpose AI capable of autonomously performing tasks via a web browser. Currently in its research preview phase, it is bundled with a software development kit (SDK) aimed at developers to create AI agents capable of tasks like booking reservations or filling out web forms.

Key Highlights:

Integration with Alexa: Nova Act is set to be integrated into Amazon’s upcoming Alexa upgrade, potentially expanding its reach exponentially.
Development Lab: Nova Act emerges from Amazon’s new AGI lab in San Francisco, led by former OpenAI and Adept executives David Luan and Peter Abbeel.
Comparison: Unlike competitors like OpenAI’s Operator or Anthropic’s tools, Nova Act’s integration with Alexa could provide a significant advantage in user adoption and utility.

Notable Quotes:

Mike Kaput [30:25]: "Buy for Me uses encryption to securely insert your billing information to third-party sites."

Discussion: Paul and Mike examine the implications of Amazon's entry into AI agents, noting the potential for widespread consumer use due to Amazon’s extensive ecosystem. While Nova Act is still in its infancy and not yet benchmarked against more established models, its integration with Alexa signifies a strategic move to embed AI functionality into everyday consumer interactions.

Notable Quotes:

Paul Roetzer [32:51]: "It's starting to change the conversation around Amazon and their AI ambitions."

Claude for Education: Anthropic’s Initiative

Timestamp: 43:20

Anthropic has launched CLAUDE for Education, a tailored AI model designed specifically for colleges and universities. This initiative emphasizes fostering critical thinking over merely providing quick answers, utilizing Socratic methods to guide students through problem-solving processes.

Key Highlights:

Learning Mode: CLAUDE prioritizes engaging students by asking probing questions to enhance their understanding rather than simply supplying answers.
Partnerships: The program is being rolled out at institutions like Northeastern University, LSE, and Champlain College, with plans to integrate into platforms like Canvas.
User Base: Northeastern University alone will have access to CLAUDE, covering 50,000 users across 13 campuses.

Notable Quotes:

Mike Kaput [43:20]: "This is a pretty comprehensive initiative in education."

Discussion: Paul underscores the importance of responsible AI integration in education, highlighting personal experiences where AI effectively guided his daughter's learning process. He advocates for proactive measures by AI developers to ensure AI tools enhance educational outcomes without undermining the learning process.

Notable Quotes:

Paul Roetzer [46:32]: "If kids just have ChatGPT and they just go say, hey, give me the answer... then we lose."

Model Context Protocol (MCP): Bridging AI and Data Systems

Timestamp: 60:39

The Model Context Protocol (MCP), introduced by Anthropic, is gaining traction as a universal standard for connecting AI assistants to various data systems like Google Drive, Slack, GitHub, and more. Often likened to a "USB-C connector" for AI data access, MCP aims to eliminate the need for custom integrations by providing a standardized method for AI models to fetch and utilize relevant data across different platforms.

Key Highlights:

Standardization: MCP offers a consistent way for AI models to interact with diverse data sources, enhancing their functionality and relevance.
Support and Adoption: Backed by industry giants like OpenAI and Microsoft, MCP is already supported by hundreds of connectors.
Potential: Experts believe MCP could significantly improve AI assistants' capabilities by enabling seamless access to a vast array of data repositories.

Notable Quotes:

Paul Roetzer [61:16]: "This is an abstract one for me. Usually, there's AI topics that just like my brain generally does pretty good job of understanding the context. This is one I struggle with still."

Discussion: While MCP is recognized for its innovative approach, Paul admits to finding the concept abstract and technically challenging. However, he acknowledges the transformative potential of MCP in enabling AI systems to integrate more deeply with organizational data, thereby enhancing productivity and decision-making processes.

Notable Quotes:

Dharmesh Shah [62:54]: "Once you see it work, it will be magical."

AI and the Turing Test: Can AI Truly Pass?

Timestamp: 37:48

A recent study has reignited the debate over the Turing Test's relevance in assessing AI intelligence. The study demonstrated that advanced language models like GPT-4.5 and Meta's LLaMA can often be mistaken for humans in text-based conversations when appropriately prompted.

Key Highlights:

Experimental Setup: Participants engaged in conversations with both humans and AI without knowing which was which. GPT-4.5, when prompted to emulate a shy, slang-using youngster, fooled 73% of participants.
Implications: This raises questions about the authenticity of AI-driven interactions and their potential applications in fields like therapy and customer service.

Notable Quotes:

Paul Roetzer [40:05]: "I just feel like we're there, like right. People are convinced these things are more human than human in many cases."

Discussion: Paul and Mike discuss the ethical and practical implications of AI passing the Turing Test, emphasizing the shift from theoretical assessments to real-world applications. Paul reflects on the potential psychological impacts and the evolving role of AI in human relationships, noting the fine line between utility and dependency.

Notable Quotes:

Paul Roetzer [43:10]: "I think a lot of society is actually going to come around to that pretty quick."

AI Product and Funding Updates

Timestamp: 68:07

The episode briefly touches on several noteworthy AI product launches and funding rounds:

Runway:
- Funding: Raised $308 million, doubling its valuation to over $3 billion.
- Innovation: Introduced Gen4 AI models for media generation with enhanced fidelity and controllability.
- Vision: CEO Chris Valenzuela aims to transform AI into a digital film crew, significantly shortening the filmmaking timeline.
Replit:
- Funding: In talks to raise $200 million at a $3.3 billion valuation.
- Product: Launched a full-stack AI agent capable of writing and deploying software autonomously.
Sesame AI:
- Funding: Finalizing a $200 million round, valuing the company at over $1 billion.
- Focus: Developing lifelike voice assistants, competing with players like ElevenLabs and OpenAI.

Notable Quotes:

Mike Kaput [66:31]: "They have backing from General Atlantic, Softbank, and Nvidia, betting that all this AI video stuff is not just a gimmick."

Discussion: Paul and Mike highlight the intense competition in the AI landscape, particularly in AI-driven video and voice technologies. They discuss the challenges companies like Runway face in maintaining their edge amidst rapidly evolving technologies and increasing market competition.

Listener Question: Preparing for AGI Without UBI

Timestamp: 72:17

Question: How do you prepare for AGI? Short of having a serious discussion of a meaningful Universal Basic Income (UBI), how do you actually prepare?

Discussion: Paul addresses the complexity of preparing for AGI, expressing skepticism about UBI as a feasible solution due to its myriad psychological and societal implications. He emphasizes the importance of staying informed, understanding model advancements, and engaging with ongoing research and discussions.

Notable Quotes:

Paul Roetzer [68:56]: "My approach to how to prepare for AGI is to stay informed. It's to try and project out the improvements in the models. It's to read the reports of other people who are trying to look to the future."
Mike Kaput [71:45]: "Give yourself a little credit. I know you said you didn't have an answer, but it's a pretty good answer. Isn't the answer? That's the one."

Conclusion: While acknowledging the uncertainty surrounding AGI’s impact, Paul advocates for continuous education and proactive engagement with AI developments as the most practical steps individuals and organizations can take to navigate the future shaped by AGI.

Final Thoughts

As the episode wraps up, Paul and Mike urge listeners to stay abreast of imminent announcements from tech giants like Microsoft and Google, anticipating a flurry of developments in the AI sector. They highlight the importance of leveraging resources like the Marketing AI Institute newsletter for comprehensive updates and ongoing AI education.

Notable Quotes:

Paul Roetzer [72:17]: "Thanks for listening to the AI show. Visit marketingaiinstitute.com to continue your AI learning journey and join more than 60,000 professionals and business leaders who have subscribed to the weekly newsletter, downloaded the AI blueprints, attended virtual and in-person events, taken our online AI courses, and engaged in the Slack community."

This episode of The Artificial Intelligence Show provides a thorough examination of current AI advancements, funding landscapes, ethical considerations, and future projections. Paul and Mike offer a balanced perspective, urging listeners to remain informed and engaged as AI continues to evolve at a breakneck pace.

Loading summary

Transcript68 lines

[00:00]
Paul Raitzer
They think that their system is basically going to do the work of an entire organization with a couple people orchestrating maybe millions of agents. Like that may sound sci fi, but that is absolutely what they're thinking is going to happen. Welcome to the Artificial Intelligence show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Raitzer. I'm the Founder and CEO of Marketing AI Institute and I'm your host. Each week I'm joined by my co host and Marketing AI Institute Chief Content Officer Mike Caput as we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join us as we accelerate AI literacy for all. Welcome to episode 143 of the Artificial Intelligence Show. I'm your host Paul Raitzer along with my co host Mike Kaput. We are recording on Friday, April 4, 8:40am I'm expecting Microsoft is making announcements about Copilot today, so timestamps relevant today. We won't, we won't have the latest other than we know Microsoft is announcing something Google Next Google Cloud Next event is next week in Las Vegas, so we're expecting a lot of news from Google very soon. I will actually be out there all week, so if anybody happens to be at the Google Cloud Next conference, drop me a message. Maybe we can meet up in person. So that is why we're doing this on a Friday. I will not be here on Monday to do this. So we have still a lot to cover. Even though it's a short week, there was quite a bit going on. Some interesting reports released related to AGI, some additional thoughts about AGI. Timing is great. Given that we just launched our Road to AGI series, a lot of new information starting to emerge. This episode is brought to us by the Marketing AI Conference or Macon. This is the sixth annual event. It's happening October 14th to the 16th in Cleveland. This is the flagship event for Marketing AI Institute. If you're kind of new to this and aren't familiar with some of the things we do, the Marketing AI conference was the first major thing we launched in 2019. So I had started Marketing Institute in 2016 as more of like a research entity and you know, sharing the story of AI. And then 2019 is when we launched the marketing AI conference. So last year we had about 1100 people from I don't know, I think it was close to 20 countries came to Cleveland. So we're expecting at least that many. I the team always gives me a hard time when I throw out numbers. But my optimistic is I, I think 1500. So there I just did it anyway, 1500 in Cleveland this fall. I'm excited because it's the first time we're actually doing it. Like Cleveland is our hometown. So I guess get excited for people to come and experience Cleveland anyway. But fall in Cleveland is like my heaven. Like I love fall in Cleveland. The leaves are changing, it's, you know, crisp air. It's just my absolute favorite time of year in Cleveland. So I hope people can come and join us. We just announced the first 19 speakers. So you can go to Macon AI, that's M A I C O N AI and check out the list of speakers. The agenda still shows the 2024 agenda. It'll give you a really good sense of the type of programming we do. And then we'll be updating with the 2025 agenda soon. You can go look at the four workshops that we have planned. So there's four pre event workshops on October 14th that are optional. Mike is leading an AI productivity workshop that's going to be all about use cases and tangible actions. I'm leading an AI innovation workshop. This is a workshop I've been thinking about and kind of working on for a couple years. This is the first time I'm actually going to run this one. Um, we have AI for B2B content and lead Generation with Andy Crestadina, who's amazing. And then we have From Facts to Acts, How AI Turns Marketing Measurement into Results with Christopher Penn and Katie Robert. So that those are going to be amazing. Again, those are optional, but you can go read about all those workshops and check it out. And we have a price. The price goes up April 26th. So you've got a couple weeks here to get in at the current early bird pricing. Again, go to Macon AI. That's M A I C O N A I. We would love to see you in Cleveland October 14th to the 16th. All right, Mike, Chad GPT. OpenAI just kind of keeps growing, huh? Kind of wild, yeah.
[04:23]
Mike Kaput
Our first main topic today concerns just the crazy growth numbers coming out of OpenAI. So they first off just pulled off the largest private tech funding deal in history, raising $40 billion at a 300 billion valuation. This puts their valuation, their size, in the same league as SpaceX and bytepants in terms of private companies and of course well ahead of any private AI competitor. Now that money is coming largely from SoftBank and they apparently plan to spend big. OpenAI wants to dramatically scale, compute, push AI research and fund its Stargate project with Oracle that we've talked about in the past. Now there is a catch here. SoftBank can cut its investment in half if OpenAI doesn't fully convert to a for profit structure by the end of the year, which is also a struggle we have documented in the past as well. In the meantime, ChatGPT has hit 20 million paying users and 500 million weekly active users. That is a 43% spike since December and it is translating into some serious revenue, at least $415 million a month, which is staggeringly up 30% in just three months. Now with enterprise plans, API charges 200amonth. Pro tiers in the mix. OpenAI is now pacing towards a whopping $12.7 billion in revenue this year, which means it could triple last year's numbers even as, however, its cash burn is soaring. However, investors clearly think they've got quite a long Runway and increasingly which we'll talk about, they believe that the destination of all this money is AGI or artificial general intelligence. So first up here, Paul, maybe talk to me about the uses of this funding. Like on one hand, OpenAI is a consumer tech company. It's in a ruthlessly competitive market. It's trying to win and retain users like any other company. So having a huge war chest makes sense. On the other though, there's this kind of regal where they say they've come out and published that they really need the money to build AGI. So which is it?
[06:52]
Paul Raitzer
Yeah, I mean, I think it's a little mix of both. The growth is nuts. I Sam omen tweeted on March 31, I'm not, I can't remember if I said this one on last week's episode or not. I don't remember when this tweet came out, but he said the ChatGPT launch 26 months ago was one of the craziest viral moments I'd ever seen. And we added 1 million users in five days. We added 1 million users in the last hour. So when he was trying to give context to like how, how dramatic the growth from the image generation launch was. So this is all came from the image generation launch. It was massive. And so you hit on this 500 million weekly after users we had just reported on 300 million I think in February is like, yeah, so pretty crazy in terms of how they're going to use the money. I went back to a February Information article. The information, which is a great source that we, you know, constantly reference on on the podcast and they kind of Broke down some details. They were obviously very well sourced in their reporting because everything has come true that they said back then. So they said OpenAI has told investors SoftBank will provide at least 30 billion of the 40 billion, which is what it is rumored or reported that they have provided nearly half of that capital, which will value OpenAI at 260 billion. That's pre money. So the 300 billion is after. The money will go towards Stargate. So they're saying of the 30 billion, well, I guess of the 40 billion total, half of that is being allocated toward the building out of the Data Centers with SoftBank and Oracle. The money will be used over the next three years to develop AI data centers in the U.S. openAI is planning to raise about 10 billion of the total funds by the end of March. It sounds like they got the commitments in place by the end of March for all of this. That article again from February that we'll put in the show notes said the financial disclosures also show how Entangled SoftBank and OpenAI have already become. The company forecast that One third of OpenAI's revenue growth this year would come from spending by SoftBank to use OpenAI's products across its companies. A deal the companies announced earlier this month. Then in addition to this, like, you know, they're now on pace to hit 12.7 billion this year. It says OpenAI expects revenue to hit 28 billion next year. So 2026 is looking at 28 billion with the majority of that coming from chat, GPT and then the rest through software developer tools and AI agents. But as you alluded to, the cash burn is massive. So it said OpenAI anticipates the amount of cash it is burning will grow at a similarly torrid rate. It expects cash burn to move from about 2 billion last year to nearly 7 billion this year. The company forecasted that its cash burn would grow in each of the next three years, peaking at about 20 billion in 2027 before OpenAI would turn profitable by the end of the decade after the build out of Stargate. So yeah, I mean they're just burning cash unlike any other and they, they need to like solve this fast. And they are definitely betting on that. When they build out all these data centers, they're going to follow these scaling laws and they're going to have an insanely valuable tool. We had talked on a recent episode about a $20,000 a month license for, you know, basically a human replacement agent. Some of the things we'll talk about in the next topic on AGI sort of starts to move more in this direction and I honestly, I'm not sure what the ceiling is on what you could charge for powerful AI. AGI, like whatever we want to call it. Like if you're, if you're building an. A system that basically functions like a whole organization, which is their level 5 AI. Like I'm. This is me making stuff up. Like level 5 on OpenAI's internal stages of AI is organization. Right. So they plan on building systems that function as companies. 20,000amonth may look cheap. Two years from now they may be charging a million a month, like, who knows? Because they think that their system is basically going to do the work of an entire organization with a couple people orchestrating maybe millions of agents, like, or an AI that orchestrates all the other AIs and the human oversees the master AI. Like that may sound sci fi, but that is absolutely what they're thinking is going to happen.
[11:20]
Mike Kaput
And this does relate to some of the top topics we've talked about in the past around like service as software. Because it's not like they're just going after the licensing fees of other tools though, they are a bit. It's more about the total addressable market represented by the actual labor costs of knowledge workers. We're talking, we spend trillions of dollars a year hiring people to do a lot of the jobs that it sounds like they expect. Their AI is something people would pay for to do the job instead of a human.
[11:53]
Paul Raitzer
Yeah, and that's, it's just the weird part is like we can't really project what this looks like, but we know it's significant. Michael Dell on April 1, you know, the founder of Dell Computer texted or tweeted, knowledge work drives a 20 to 30 trillion dollars global economy. With AI, we can increase productivity by 10 to 20% or more, unlocking 2 to 6 trillion in value every year. Getting there may take 400 billion to 1 trillion in investment. The return on this over time will be massive. So, yeah, I mean, the people who are closest to this stuff, whether it's, you know, Jensen Huang at Nvidia or Zuckerberg or Altman or Michael Dell or whomever, they're talking about what seems like some pretty crazy numbers, but to them it just sort of seems inevitable. And that's, I think what's going to come through as a theme of the next topic here today is like there's, there's a lot of people who are still trying to process what ChatGPT can do today, but the people who are on sort of the frontier are so far beyond that. And they are seeing a clear path to a very different world like two, three years from now. And to them it truly seems like inevitability. And it might be five years, it might be seven, but like it's coming one way or the other.
[13:12]
Mike Kaput
So let's talk about that. Our second big topic today is about a major new AGI forecast that is making some waves. So this is a new report called AI 2027 and it lays out one of the more dramatic timelines we've seen for AI. So this is primarily in the form of a website you can go to. We'll have the link in the show notes. It's a bit interactive in the sense as you scroll through it and scroll through their timeline, you'll see little widgets and visuals update as you go. It's really cool, it's worth visiting. But in it, the authors predict that by the end of 2027, AI will be better than humans at basically everything from coding to research to inventing even smarter versions of itself. And this whole website, this whole thought experiment they go through shows what the Runway looks like to this kind of intelligence takeoff. Now, this whole project comes from something called the AI Futures Project, which is led by a former OpenAI researcher named Daniel Cocotaldjo. And he actually left the company over safety concerns. He then teamed up with AI researcher Eli Lifland, who was also known as a highly accurate forecaster of current events. And together with their team, they turned hundreds of real world predictions about AI's progress to into this kind of science fiction style narrative on the website. And this is all grounded in what they believe will actually happen. So the vehicle by which they describe this is this fictional scenario which involves a fictional AI company building something called Agent 1, which is a model that quickly evolves into Agent 4, an autonomous system making a year's worth of breakthroughs every week. And by then, towards the end of the timeline, it's on the verge of going rogue. Now, along the way, they show how AI agents will start acting like junior employees by mid-2025. By late 2026, AI is replacing entry level coders and reshaping the job market. And in their forecasts, by 2027, we've got self improving AI researchers making weeks of progress in days. And China and the US are fully locked in an AI arms race. Now, there's plenty of critics of this high profile project. Some critics say it's much more fear mongering and almost like fantasy than forecasting. But the authors argue it's a serious attempt to prepare for what could happen if we do have this kind of fast takeoff of super intelligent AI. So in an interview, Coco Taljo actually said, quote, we predict that AIs will continue to improve to the point where they're fully autonomous agents that are better than humans at everything by the end of 2027 or so. Now, this also comes at the same time this past week, as we saw a couple other significant AGI pieces of news. One of them is that ExitNAI board member Helen Toner published an article on Substack, pointing out that all these predictions we're getting about the timelines for AGI are getting shorter and shorter. And she even writes, quote, if you want to argue that human level AI is extremely unlikely in the next 20 years, you certainly can, but you should treat that as a minority position where the burden of proof is on you. And then last but certainly not least, Google DeepMind actually came out with a vision for safely building AGI. In a new technical paper, the company literally says AGI could arrive within years and that they're taking steps to prepare. So they have this whole safety roadmap over dozens and dozens of pages that focus on what they say are the four big risks of AGI. There's first, misuse, which is a user instructing the system to cause harm. Second is mistakes, meaning an AI causes harm without realizing it. Third is structural risks, which means harms that come from a bunch of agents interacting where no single agent is at fault. And fourth is misalignment, when an AI system pursues a goal different from what humans intended. So Google says of this plan, this roadmap, this safety measures, quote, we're optimistic about AGI's potential. It has the power to transform our world, acting as a catalyst for progress in many areas of life. But it is essential with any technology this powerful that even a small possibility of harm must be taken seriously and prevented. All right, Paul, there's a lot to unpack here, but first up, what did you think of AI 2027? Like, the people behind it seem like they have some interesting backgrounds in AI. Did you find their predictions credible? Was the format of this fictionalized story, like, helpful? Harmful to getting your average person to actually care about this?
[18:13]
Paul Raitzer
Yeah, I, I mean, so my, my initial take is like a reader beware sort of warning on this. I, I, I, I honestly wouldn't recommend reading this to everybody. Like, I, I think that it could be very jarring and overwhelming, and it can definitely feed into the Non technical AI person's fears and, and maybe accelerate those fears. I think when you read stuff like this, whether it's situational awareness, you know, the, the paper series of papers from Liam Poldash and Brenner that we covered last year. Machines of. What was the, what was Dario Amades.
[18:51]
Mike Kaput
Of Machines of Loving Grace?
[18:54]
Paul Raitzer
Yeah, that one. The accelerationist manifesto from Andreessen. Like you, it, it, you have to have a lot of context when you read these things and you have to have a really strong understanding of who's writing them and their perspective on the world. And you have to appreciate that it's, it's just one perspective. Now they are certainly credentialed like they're, they have every, everything on their resume that would justify them taking this effort and writing this. And I think it needs to be paid attention to. And I think that in, I mean I got through the first probably 15, 20 pages of it and then started scanning the rest as it started going through these other different scenarios, but certainly enough to get the gist of what they were talking about and their, their perspectives. I did see Daniel, one of the, you know, the leads on this, he tweeted like challenge us. Like we're actually, they actually put bounties out to disprove them. They're like if you can come at us with a fact that that's counter factual to what we presented, we will pay you. So I, I don't know honestly that anything they put in there is actually that crazy. And that's why I'm saying like I, I, I just wouldn't recommend it because it's, it's just a lot to, to handle. So if, if you're, if you're at a point where you really want to know because like there was a key thing, the thing I would recommend is actually the Kevin Roose New York Times article is actually where I would start. Before you read the AI27, 2027 website, I actually read the Kevin Roose article. We'll put in the show notes because Kevin gives a very balanced take on this. And I thought one of the real key things is at one point Kevin said, so I'll actually, I'll jump to the Kevin article for a second. So he starts off, the year is 2027. Powerful artificial intelligence systems are becoming smarter than humans and are wrecking havoc on the global order. Chinese spies have stolen America's AI secrets and the White House is rushing to retaliate. Inside a leading AI lab, engineers are spooked to discover that their models are Starting to deceive them, raising the possibility that they'll go rogue. That is a, that is basically a summary of AI 2027 project. That is the scenario they're presenting. There isn't a single thing in that concept that couldn't happen by 2027. So that's what I'm saying. Like, I'm not disputing what they're saying. I'm just saying they're, they're taking an extreme position. But the, the key here is to understand who is writing this. So later in the article, Kevin says there's no question that some of the group's views are extreme. Mr. Coca Talio, how did, how did you say that? Okay, yeah, for example, told me last year that he believed there was a 70% chance that AI would destroy or catastrophically harm humanity. So he is. There's something called P Doom in the AI world. The probability of doom. Probability that AI wipes out humanity. And, and this is like a common question asked of leading AI researchers. What's your P dot? And there are some who, who are well above 50% that, that they are convinced that the super intelligence being built is going to wipe out humanity. There are others who think that is absurd, like a Yann Lecun who won't even probably answer the question of P Doom because they think it's so ludicrous. So you have to understand that there are different factions and each of these factions often has access to the same information, has, have worked in the same labs together on the same projects, seeing the models emerge and the capabilities. They have all seen the same stuff, but some of them then play this out as this is the end, like it's all gonna go awful here. But when you actually start getting into like these like fundamentals of like over dramatic dramatizing this, they actually kind of struggle to come back to reality and say, yeah, but what if it doesn't actually take off that fast? What if the Chinese spies don't get access to Agent 3, as they called it? Like, what if we. It's like chatgpt and like society just basically continues on with their life as though nothing happened. And a small collection of companies have this powerful AI that can do all these things and, and like the world just goes on. And that's a, that's honestly harder for them to fathom than this like, doom scenario. And so again, I just, I feel like it's, it's a good read if you're mentally in a place where you can consider the really dramatic dark side of where this goes pretty quickly. Understand it is all based on fact. There's nothing they're making up in there that isn't possible. It just doesn't mean it's probable. And and I still like think that we have more agency in how this all plays out than maybe some of these reports would make you think. But it takes people being kind of locked in and focused on the possibilities. There was one, the chief executive from the Allen Institute for AI AI Lab in Seattle who reviewed the AI 2027 paper and said he wasn't impressed at all by it like that. There was just nothing to it. So again, reader beware. If you want to go down that path, do it. If you want to get really technical, Dwarkesh actually has a podcast through our podcast with the authors.
[24:28]
Mike Kaput
Yep.
[24:28]
Paul Raitzer
And Dwarkesh we talked about before. We'll put the link in the show. Notes he does amazing interviews. They're very technical. So but again if you're into the technical side of this, have a field day. If you're not, read the Kevin Roose article and and move move on with your life basically is kind of my my notes here now on on the Hub but on the Google side the taking a responsible path to AGI also a massive paper like yeah you want to great notebook LM use case drop that thing in a notebook LM and have a conversation with it turn it into a podcast. But there is some interesting stuff within here. If you just read the the article about it that they published on the DeepMind website. They referenced the levels of AGI framework paper that I talked about in the Road to AGI series. They link to the new paper and approach to technical AGI safety and security. But then they also released a new course on AGI safety that I thought was interesting. I have not had a chance to go through it yet but it's looks like it's about a dozen or so short videos. They're like between four and nine minutes it looks like but they've got we are on a path to superhuman capabilities Risk from deliberate plan Deliberate planning and instrumental sub goals. Where can misaligned goals come from? Classification quiz for alignment failures like some interesting stuff interpretability like how to know what these models are doing. So again this is probably made for a more technical audience but it could be interesting for people if you want to understand kind of more in depth what's going on here. So big picture, I'm glad to see this sort of thing happening. Like this was my whole call to action with the AGI series is like we just need to talk more about it. We need more research, we need more work trying to project out what happens. I'm just more interested in like, okay, let's just go into like the legal profession or the healthcare world or the manufacturing world and let's play out more like maybe practical outcomes. And then what does that mean? Like, what happens to these fundamental things that we all are familiar with? Because if you take this stuff to a CEO, I, I, yeah, most CEOs are just still trying to grasp how to personally use Chat GPT and how to, like, empower their teams to figure this out. You start throwing this stuff in front of them and you're just going to have people pull back again. So I, yeah, important to talk about, but I, I just wouldn't let guide people don't get like, too consumed by this stuff.
[26:50]
Mike Kaput
Yeah. For instance, if you're, say, a marketing leader at a healthcare organization struggling to get approval for ChatGPT and get your team to build GPTs, this can send you into an existential.
[27:02]
Paul Raitzer
Yeah, you don't want to link to the AI 2027 board in your deck pitching this.
[27:07]
Mike Kaput
Yeah. All right, so our third big topic this week is that Amazon just entered the AI agent race with a new system called Nova Act. And this is a general purpose AI that can take control of a web browser and perform tasks on its own. So in its current form, this is fully a research preview. It's aimed at developers. It is bundled with a software development kit that lets them build AI agents that can, for example, book dinner reservations, order salads, or fill out web forms. So it's basically Amazon's answer to agent tools like OpenAI's Operator and Anthropic's computer use. But there's kind of one key advantage here that's worth talking about. It is being integrated into the upcoming Alexa upgrade, which potentially gives it massive reach. Now Nova act comes out of Amazon's new AGI lab in San Francisco, which we covered on a past episode, led by former OpenAI and adept execs David Luan and Peter Abbeel. And the lab's mission is to build AI systems that can perform any task a human can do on a computer. Nova act is the first public step in that direction. Amazon claims it already outperforms competitors on certain internal tests, but it hasn't been benchmarked against tougher public evaluations just yet. So, Paul, this is admittedly very early. It's a research preview, it's an agent, which we talk about all the time, is still a Technology that's really, really, really early. So it's not like tomorrow you're suddenly going to have Amazon's agent doing everything for you. But it does feel a little different and worth talking about than some of the other agent announcements because of Amazon's reach and how much it touches so many parts of consumer life. Like, do you think this could be the start of seeing agents really show up for your average person?
[29:14]
Paul Raitzer
Yeah, I mean, I. Generally speaking, we try to not cover like research previews too much. Like, we often will like give overviews of like, here's what's happening. But so often we've seen these things just don't really lead to much. But I think the key here is it's starting to change the conversation around Amazon and their AI ambitions. So, I mean, if you go through the first 130 episodes or so of this podcast, my guess is we talked about Amazon maybe like three or four times. Like it's just. And it's usually related to their investment in Anthropic. We talked about Rufus last year, which is their shopping assistant. So right within the app, you or website, you can just talk to Rufus. I'm going on a trip here. What should I be looking for? And it helps you buy things and they're using a language model underneath to do it. I think it's powered by Anthropic. Then we talked about Alexa Plus a couple weeks ago and now we're talking about not only Nova, but they also last Thursday announced this Buy for Me feature. And so I don't know. Mike, did you, did they say when this one's coming out? Do you remember seeing that?
[30:21]
Mike Kaput
I don't recall seeing the exact, an exact release date.
[30:25]
Paul Raitzer
Okay, yeah, we'll check. They put it out on their site and then TechCrunch covered it. But the basic premise here is Buy for Me uses encryption to securely insert your billing information to third party sites. So if you're searching for something and they don't have it on Amazon, their AI agent, kind of powered by this Nova concept, will actually go find it somewhere else on the web. It will buy it for you by entering your information into that site. And so it's different than OpenAI and Google's agents, which requires the human to actually put the credit card information in before a purchase happens. So if you say, hey, go find me a new backpack for a trip to Europe and The agents from OpenAI and Google go do it, when they get to the site, the human then has to do the thing. In this case, Amazon is Basically asking users to trust them and their privacy and their ability to securely protect your, your information to go ahead and fill this out. And they're trusting that you're not. That their agent is going to accidentally buy a thousand pairs of something instead of one pair of something. Right. So I think that what we're seeing is how Amazon is maybe going to start to play this out. And I think we talked on a recent episode that they're probably building their own models as well, in addition, you know, continuing to invest more heavily in building their own models. So I don't know, like, I think more than anything it's probably starting to move Amazon up in the conversation to where I'm starting to see we may be talking about Amazon a lot more than we used to talk about them. Yeah, because it really previously was robotics, their investments in AI. And then, you know, I always talk about Amazon as it's one of the like OG examples of AI in business was the prediction around like their recommendation engine, their shopping cart where they would predict things to buy. That was like old school AI and they'd been doing it as, as well as anybody for like 15 years. So they weren't new to AI, they just got sideswiped by generative AI. They were like, they had nothing. They, you know, they had Alexa, but it was not, not anything close to what needed to happen. And, and now here we are like two and a half years later, whatever, and they're still trying to play catch up now on a thing they, they should have been leading on. But you know, all of them missed it. Apple missed it, Google mixed it, missed it, Amazon missed it. So yeah, I, I just, I don't know. It's, it's interesting. I expect we'll hear more out of this, this lab. Um, but I think we'll probably also see it built out into their products pretty quickly.
[32:51]
Mike Kaput
And just a note here. According to the Amazon announcement, Buy for Me is currently live in the Amazon Shopping app on both iOS and Android for a subset of US customers. So they are beginning testing with a limited number of brand stores and products with plans to roll out some more customers and incorporate more stores and products based on feedback. So if you have access to this and you're brave enough, maybe you can go give it a, give it a try.
[33:19]
Paul Raitzer
But, but don't expect the same sort of ease of returns as buying from Amazon because they did note that you're, they don't handle the returns the way they normally do. If you bought it from a site you're, you are responsible for returning to that site. So I don't know what, how are you feeling about this stuff? Like, would you use like a buy for me or like you're more aggressive with using agents than I am?
[33:39]
Mike Kaput
I don't know if I have a personal worry about something going wrong or privacy that I couldn't reverse or that wouldn't really matter that much to me, but it does just seem like a hassle for me.
[33:52]
Paul Raitzer
I, I think I just know how unreliable AI agents are today, despite how they're being marketed, that I think I'm just, I'm, I'm letting, I'm willing to let everybody else work out the kinks. Like, I don't find that convenience un. Worth enough of the risk of this going wrong. Exactly. I'm kind of good with just filling out my own form and like going to the other site and, you know, paying for it there and knowing the terms of use and the return policy. And so I don't know, I'm a little more conservative when it comes to like, pushing the limits of AI agents today for sure.
[34:25]
Mike Kaput
All right, let's dive into this week's Rapid fire. Our first Rapid Fire topic is that OpenAI is finally releasing a new open weight language model. This is the first they've done since GPT2. So in a post on X, Sam Altman said the company has been sitting on this idea for a long time, but quote, now it feels important to do. This model will launch in the coming months with a strong focus on reasoning ability and wide usability. So it's important to note here, this is an open weight model and you kind of see confusion of these terms. A lot of people say, oh, okay, that's open source. Well, technically, not exactly, because open weight means the model's weights, which are the numerical parameters learned during training, are made publicly accessible. So the weights define how the model uses input data to produce outputs. However, an open weight model won't give you all the source code, training data, or architecture details of the model like a fully open source one would. So you can still like host and run this type of model at your company, train on your own data, which is what OpenAI is hoping people will do. But it's not exactly fully open source, which is not uncommon to see. Now, before launch, says Altman, the model will go through its full preparedness evaluation to account for the fact that open models can be modified or misused after release. And OpenAI is hosting developer feedback sessions starting in San Francisco and expanding to Europe and Asia and the Pacific to help make sure the model is useful.
[35:59]
Paul Raitzer
Out of the box.
[36:01]
Mike Kaput
So Paul, how significant do you see it being that OpenAI is at least dipping its toe back into the waters of open models?
[36:10]
Paul Raitzer
Yeah, I mean maybe the biggest play here is that Elon Musk won't be able to call them closed AI anymore. That's one of Elon's beefs, is that they were created to be open and then they weren't. And so maybe this is the counterbalance to that argument. I mean it's a strategy I would expect all the labs to do. So obviously Meta's main play has been to release powerful open source models or open weight models. Google, DeepMind, Demis Asabas has said this is their strategy basically that they will release the prior generation as open weight. So they build, let's say Gemini 2.5 is the model today. A year from now, let's say it's Gemini 4, whatever. Then they would probably then open source Gemini 2.5. So like they take the current frontier model that is like the paid for featured model that they don't consider safe yet to release, they build the next generation of models and then once they determine that the prior generation model is safe. Plus the key is compute costs are dropping roughly 10x every year. So now the cost to deliver that open weight model, open source model is dramatically less and now they can go and do it. So it's just like this. I think this is a logical strategy for Google and OpenAI and some of these other labs that you, you just kind of release these prior generation models and you hope that enterprises want to still pay for the most powerful thing because it has more reasoning, because it has better image generation, better, better video generation, all of those things. And the open source becomes more for the developer world to build on top of next up.
[37:49]
Mike Kaput
A new study has reignited some debate over whether AI can pass the Turing test. So the Turing test is a famous test thought up way back in 1950 by computer scientist Alan Turing that is designed to evaluate a machine's ability to exhibit human like intelligence. Now it basically consists of a human judge communicating with both a human and an AI through text based questions and trying to guess which is which. If the judge can't tell which is a human and which is AI, the AI is thought to have, quote unquote passed this test and displayed intelligence that's indistinguishable from a real human. So in this new research, the researchers ran two large scale experiments where people chatted simultaneously with a human and AI for five minutes, then guessed which was which. And the standout was GPT4.5. When it was prompted to act like a shy, slang using young person, it was picked as a human 73% of the time, more often than the actual humans it was up against. Meta's llama model also fooled participants more often than not, while older systems like GPT4O crazy to even consider that an older system were reliably flagged as bots. Interestingly here, the difference that really made all the difference was prompting. Without specific instructions on how to act human, the AI win rates dropped. But even then, some did still match human performance. So this is. Paul, you know, definitely interesting because Turing Test is this kind of legendary thing in AI. We obviously always have to take any claims about all this with a grain of salt. The researchers themselves admit that there's still a lot that's unclear about what this could actually mean and how much it matters in terms of making a judgment call about the level of intelligence being exhibited here. But I think in a practical sense, it is really striking that we have some good evidence now that today's AI prompted in the right way can be basically indistinguishable from a human in certain types of conversations.
[40:06]
Paul Raitzer
Yeah, and I, I think that the whole part about prompting it to act like a human, like, that's not hard. I mean, you can make that, that instruction choice in like the system prompt. You could have a company, it could be a startup that builds on top of an open source model that chooses to make a very human, like, chatbot, and out of the box, the thing feels more human than human. Um, we've talked about on the, on the show many times about like, empathy, and it's sort of, I used to think a uniquely human trait that I'm convinced is not anymore, or at least the ability to simulate empathy. And so you can teach these models or you can tell your model, like, you could go in and build a custom GPT and say, I want you to just be empathetic. Like, I, I just need someone to talk to who understands how hard it is to be an entrepreneur. And like, I just want you to be, you know, I, I just wanted you to listen and, and help me, you know, find my way through this. And it will do it, like, better than many humans would do it. And that's just a weird place to be in. So I, I mean, this constant, like, the. Do we pass the Turing Test? Like, I feel like the Turing Test sort of had its day in like, you know, maybe we probably got past it. In, in, like, certainly when ChatGPT came out. I think we're just now trying to find, trying to find ways to run the test to like, officially say we've now passed it. It's like, I, I, I don't even know that it's worth talking about continuing the research. Like, we're there, like, right. People are convinced these things are more human than human in many cases, especially if they're prompted to be that way. And I think that when it comes to different parts of, you know, psychology and therapy and things like that, like, that's how these things are being made already. Like, people are using them as therapists. And I'm not commenting on whether that's good or bad for society. I'm just telling you that's what's happening. And the VC firms are funding the companies to do that because they're so good at it. And that's the current generation. And, you know, it's not far behind where the voice comes along with it too. And now you truly just feel like you're talking to a therapist or an advisor or a consultant, and their, their system prompt tells them to be very, you know, supportive and empathetic and, and honestly, like, at some point you just, you're gonna just prefer to talk to the AI. I, I, I, I do think a lot of people are going to arrive at a point where they just prefer talking to the AI about these, these things, like the hard topics that awkward to talk to people about, like, it's not awkward to talk to your AI. Um, and I, I think a lot of society is actually going to come around to that pretty quick. It may end up being like, there was some data this week about how low adoption actually is to like, the vast majority of society. I could see like the empathetic chatbot with, with a human like Voice being like the entry point for a lot of people. And that's why I mentioned that in the road to AGI, like, I thought voice was going to become like a dominant interface. And I think it could be a gateway to generative AI for a lot of people who maybe are sitting on the sidelines still.
[43:10]
Mike Kaput
Yeah, it's almost like throw out the Turing Test and look at today, all the millions of people that use character AI for relationships or therapy that tells you everything you need to know.
[43:21]
Paul Raitzer
Yeah, it goes back to like, the, when we've talked about the evals, like these labs run all these like, really sophisticated evaluations to figure out how smart these models really are. And my feeling is like that's Great. And I, I get that the technical AI people want to do that. What I want to know is like, can it, how does it work as a marketer, how does it work as a psychologist, as a physician? Like, I want evals that are like, tied to real life. And I think that's the same thing as you're alluding to. It's like, we need to be practical.
[43:48]
Mike Kaput
Our next topic is about Anthropic. Anthropic has just launched CLAUDE for Education, which is a new version of its AI tailored specifically for colleges and universities. So the centerpiece of CLAUDE for Education is a new learning mode that prioritizes critical thinking over quick answers. Instead of solving problems for students, Claude gives them guidance using these, like, Socratic methods. So by asking questions like what evidence supports your conclusion? CLAUDE is going campus wide as part of this initiative at Northeastern University, LSE and Champlain College, giving every student and faculty member access to Claude at Northeastern alone that it's 50,000 users across 13 campuses. They're also focused on a campus ambassador program, giving free API credits to student builders and partnerships with Internet2 and canvas maker Instructure to weave Claude into existing academic platform. So, Paul, this definitely doesn't just seem like a press release. This is a pretty comprehensive initiative in education. You talk to tons of schools about the need for AI literacy. What do you think of how Anthropic has gone about this?
[45:07]
Paul Raitzer
Yeah, I think it's, it's great to see. I, OpenAI did something similar with their academy. They just announced last week they have like a AI for K to 12 where they're trying to get into like the education. And I, I, I don't think they had a higher ed one yet. OpenAI also announced, you know, not to be outdone, they love to steal the headlines from everybody else. I think they tweeted it was over the weekend, I believe or no. What day is today? Friday. So it was like Wednesday or Thursday that they're now giving like chat GBT free to college students I think for the next two months, something like that. So I think everybody's playing this space. I, I, I, I don't know, like, it's so disruptive and I don't know that, you know, schools are still grasping. I have seen some really impressive stuff. Like, I've seen some, some high schools, I've seen some universities that are being very proactive. But like, I, I don't, I don't think I shared this example on the podcast last week. But like, I was, I was, I was Home with my kids the other day my wife wasn't, wasn't here and my daughter was 13, seventh grade, doing like advanced pre algebra or something. She's like, I need help on math homework. I was like, that's a mommy thing. Like I not, I'm not the math guy. When you get into like the language, like let me know and we'll talk. She goes, no, mommy's not here, I need help. And so it was a math problem I have no idea how to solve. And so I pulled up the, you know, go into chat GPT hit my, you know, the camera open. I don't know what they call that, what do they call that? Is it live or I don't know.
[46:32]
Mike Kaput
When you're live showing it.
[46:34]
Paul Raitzer
Yeah, I just like turned on the camera and it could see what I was seeing. Yeah, I know project Astra for Google. But I don't know what they actually call it in OpenAI. But if you don't know what I'm talking about, just go into the voice mode and then in voice mode there's a camera. Click that and it now sees what you see. And so I held it over the math problem and I said, I'm working with my 13 year old, do not give us the answer. We need to understand how to solve this problem. And it's like, great, okay, let's go through step one. And it actually like would read it and then say, okay, do you understand how to do. And it, it like walked us through and then she was writing on paper the formula and like going through and doing what it was saying. And so I held the phone over what she was writing and said, you're doing great now when you get to this point, you know. And then it would ask her another question and then she would answer. So now she's interacting with the AI and we walk through the five steps of the problem with her actually doing it and being guided how to do it, not being given the answer. And to me that's just like so representative of where this can go if it's taught responsibly. If kids just have ChatGPT and they just go say, hey, give me the answer to this question, then we lose. So I think that having Anthropic and Google and OpenAI and others be proactive in building for education and building in a responsible way for education is a really good thing and we should support that and encourage more of that.
[48:00]
Mike Kaput
Yeah, it's really cool to see. Next up, the Tony Blair Institute out of the UK has released a sweeping New report calling for a, quote, reboot of UK copyright law in the age of AI. And their recommendations are already drawing some fire. One of the big reasons is because the report endorses a text and data mining exception to copyright law that would allow AI companies to train models on publicly available content unless rights holders explicitly opt out. It argues this opt out model would balance innovation and creator control. But longtime AI copyright commentator Ed Newton Rex, we've talked about a bunch on the podcast called this report basically quote, terrible and quote, a big tech lobbying document. He says UK copyright already gives creators control over how their work is used and that shifting to an opt out regime would reduce that control more sharply. He accuses the authors of misleading rhetoric, likening their claim, their arguments to claiming that using someone's AI art for training is no different from a human being inspired by it. So he basically says under this kind of scheme, creators would lose their rights, the public would put the bill, and AI firms would keep training on others work for free. Now, Paul, this is obviously UK specific, but we wanted to talk about it in the wider context of the copyright topics we covered last week. Artists and authors in many areas are up in arms about how AI models are being trained on their work without their permission. This certainly seems like some parties, whether they're actually lobbying for AI labs or not, are trying to make the argument that AI companies should be allowed to train on publicly available content, that we should exempt this from copyright. What do you think of this approach? And should we, should we expect to see more arguments like this in the us?
[49:58]
Paul Raitzer
I mean, these AI companies have a lot of money for lobbying efforts and I think at the end of the day, those lobbying efforts win. I, I think the opt out thing's a joke. I, I've always just felt that that was an absurd solution. It was just like an obvious thing to present. But like, I mean, if you're a creator in any way, you know how prevalent it is for people to steal your stuff. Like any, anything we've ever created behind a paywall, I guarantee you someone has stolen 10 times over and published it in different places of sites I would never like click through and download something from. But like, you know, whether it's, it's movies or courses or books or whatever, it gets stolen all the time and it's a game of whack a mole to try and keep up with it. Like we have an internal system to track all the stuff people steal from us and what, what can we do about it? Pay our attorneys every time we find it, and that's easy to find. Like you could just keyword search the thing and you can find the people stealing your stuff.
[50:56]
Mike Kaput
Damn.
[50:57]
Paul Raitzer
How in the world are we supposed to ever know unless someone leaks the training data, whether or not they stole it or not? I saw something last night that was like, they had proof now that one of the major model companies, who I won't throw under the bus right now, absolutely stole stuff from behind a paywall of a major publisher and they could prove it. So I just feel like, I don't know, the copyright thing is so frustrating to me because I have yet to hear of any sort of like, reasonable plan for how you acknowledge and compensate creators whose work made these models possible. And even if they come up with the plan, how do we know, like, how, how will we ever do it other than being able to audit the system and find out what the actual training data was or someone suing them? And then seven years later it's like, okay, yeah, sorry, your seven books were used in the training of the model. Here's your $15. Like, I don't know, I, I don't have a solution. But it's very frustrating that nobody seems to have a plan for how to do this. It's just like, yeah, we should probably pay them, but first we have to admit we stole it. But we can't admit we stole it because we're going to claim it's fair use and then eventually we'll like, have to pay a fine and maybe there'll be some class action lawsuit and we'll pay a billion dollars and that billion dollars will get spread across 200 million creators. And you know, here's your fifty dollar check. Like, I don't know, like, I, I, I hope someone much smarter than me in this area eventually comes up with a plan and the model companies agree to, to, to, to do something to compensate people for their work.
[52:27]
Mike Kaput
And in the meantime, like we talked about last week, expect the backlash to continue.
[52:32]
Paul Raitzer
Yeah. And it is growing. Yeah, for sure.
[52:36]
Mike Kaput
Our next rapid fire topic, Google DeepMind, has hit a new milestone in AI because it taught AI to find diamonds in Minecraft without any human guidance. Now, this breakthrough comes from a system called Dreamer, which mastered the game's notoriously complex diamond quest purely through reinforcement learning. So that means it wasn't trained on videos or handholding instructions and explored, experimented, failed, and learned. Now, if you're unfamiliar with Minecraft, doing this task, finding diamonds is not easy. It requires building tools in sequence, exploring unknown terrain, and navigating a world that is different every time. So what makes Dreamers special is in how it learns this stuff. Instead of brute forcing every option, it may build a mental model of the world and simulates future scenarios before acting much like how a human might visualize possible outcomes. That world model lets it plan more efficiently, reducing trial and error while still enabling reality discovery. Interestingly, Dreamer wasn't even designed for Minecraft. This diamond challenge was just a stress test. But the fact that it passed without ever seeing human gameplay shows really interesting progress toward general purpose AI. So Paul, this is obviously not just us being fans of Minecraft over here. One of the researchers involved in the work said why this matters. Quote, Dreamer marks a significant step towards general AI systems. It allows AI to understand its physical environment and also to self improve over time without a human having to tell it exactly what to do. That is a much bigger deal than Minecraft.
[54:23]
Paul Raitzer
It sounds like. Yeah, and this, I mean this is very similar in, in terms of past research that like, you know, Google has done where like they had AlphaGo learning the game of Go, but then they build AlphaZero that could basically learn from the ground up. And, and Google Divine has been doing this stuff since like the early teens. Yeah, and this is why like I often come back to like I, I don't, I just don't know how you bet against Google. Like, I don't think people realize the amount of breakthroughs that they have had and the knowledge and capabilities that they're sitting on that aren't in these models yet. And when you can start introducing this kind of capability, even if it's just an internal model that they don't release, it's kind of hard to process. So I think there's the, this is a significant line of research. The ability for these things to sort of learn and pursue goals on their own is it matters. I ironically have been listening over the last few days to a podcast, big technology podcast with the Roblox CEO David Baszucki. And and so I, in my head I have this because my kids play Roblox and Minecraft and I, I know that to them the process of doing these things is the point. So like in, in Minecraft you build block by block. It is repetitive, it is mind numbing, but they love it and they create insane things. Like my daughter has showed me, like castles she's built. And I'm like, how long did you work on this? Like, this is amazing. And like you did this with blocks? Like, it doesn't even make sense to me. And it might be something she spent, like, 20 hours on over, like, months where or maybe more. And that is the point. Now, if you can go in and just say, like, build me a fantasy castle. And, like, and I'll. Now you have the same beautiful castle, but zero effort from the human to do it, other than, like, I'm envisioning a castle here and I want to moat there, and now I want a dragon. That's the world the CEO Roblox is presenting, that they're enabling. You're going to be able to just go into roadblocks and, like, just text the characters you want and the scenes you want and eventually entire games. And so this line of research also just, like, I don't know, concern is the right word. There's parts of it that just make me sad because I. I feel like so much of what makes games so fascinating that I loved them as a kid, my kids love them now, is the repetitive nature of doing something yourself and, like, figuring it out and finding a solution and finding diamonds. Like, instead of going and say, hey, find me 50 diamonds. Then you sit back and, like, sip your Coca Cola while you're, like, waiting for the. I don't know. So it just continues on, this whole, like, creator thing. Like, when the AI can create, like, where's the human element? Where is the AI element? And again, I don't. I don't know. I just. I find myself thinking about this stuff a lot. And as these things get better and I see image generation, I watch VO2 from Google DeepMind. Like, I see the Runway stuff. We'll talk about. Like, I just have. I continue to really struggle to envision, like, the next few years and what it means to creators and creativity.
[57:30]
Mike Kaput
Well, it is so cool to be able to summon these kind of pieces of art or creativity out of thin air. But then you wonder what's lost that the artist learned in the process of learning how to create that thing. Right?
[57:42]
Paul Raitzer
Yeah. I got home last night from a trip, and my son couldn't stop talking about this thing. He was coding in class. Now he's in sixth grade, and they were doing this in design class, and he's taking, like, a couple of code camps, and he has way more knowledge of coding than I do at this point. But, like, to listen to him explain it and, like, then this morning, he gets up and he's like, can I show you? Can I show you? Can I show you? And he's, like, showing me these, like, sprites he built for this game. And then, like, this whole thing he coded where, like, These little monsters show up. I don't, I don't even understand it, how he did it. But like, that's, that's the joy of creation is like, he learned how to do it. He didn't just give a text prompt and like created the monsters. Oh, great, great game. He wouldn't have the same passion for it, he wouldn't have the same fulfillment from it. He wouldn't have the same inspiration to learn how to do more code. And that is why I think about this all the time. It's like, I just, I don't know. Like, I don't, I don't know what it means for them in two years, five years, you know, by the time they get out into the professional world, nine years, 10 years, like, so weird.
[58:41]
Mike Kaput
Our next rapid fire topic concerns something called Model Context Protocol, or mcp. So in November of last year, Anthropic announced it was open sourcing Model Context. The Model Context Protocol, mcp. They define this as, quote, a new standard for connecting AI assistants to the systems where data lives, including content repositories, business tools, and development environments. Now, in recent months, talk about MCP has been gaining traction. It's happening more and more in AI circles. So we at least wanted to introduce the concept and talk through it a little bit. One way to think of MCP is like a USB C connector, but for AI data access. So today's AI assistants are smart, but they're often stuck in silos. They don't know what's in your files, your code base, your company wiki, unless someone builds a custom integration to access those data sources. MCP is kind of trying to change that by creating a universal standard for connecting AI models to external tools that might be Google Drive, Slack, GitHub, or Postgres. So no more one off connectors. Basically just a way to plug in and go. Now, because of that, MCP is gaining a bunch of traction. It has support from both OpenAI and Microsoft. It's open source, so hundreds of connectors are already live. And basically the idea behind all this is simple. Give AI systems a consistent way to fetch fresh, relevant context from all these different sources. So it's still really early days for this, but some people think the potential for MCP is huge and that it could really enable AI assistants to use your actual knowledge and other data sources to do even better work. So Paul, why is this getting so much attention in certain AI circles?
[60:39]
Paul Raitzer
Dude, I tried to avoid talking about this topic for weeks. I don't know, like three or four weeks ago. This like, took over my Twitter feed with all these AI people and I was like, man, sounds important. But God, it I, it's hurt my brain to like, think about it. So I just kept leaving off of the list and I finally told my class, like, all right, man, we, we finally gotta like, just talk about this. I still honestly, like, I, I'm. This is an abstract one for me. Like, usually there's AI topics that just like my, my brain generally does pretty good job of like, understanding the context. This is one I struggle with still, to be honest with you.
[61:16]
Mike Kaput
Damn.
[61:17]
Paul Raitzer
So ironically, last night laying in bed and I'm checking LinkedIn and Dharma Shah, my friend and founder and CTO at HubSpot, he put on LinkedIn. So I'm just going to read this because this is, it'll do better than I think I would do. Trying to add context. He said, Someday soon each of us will have our MCP moment. It won't be quite as powerful as the chatgpt moment we had, but it will open our eyes to what's possible now. For example, right now I have Claude Desktop configured to interact with several MCP servers from different companies. This configuration gives the learning the large language model hundreds of tools that it can decide to use based on what I enter for a prompt. I can have the large language model use agents on agent AI, which Dharmesh created, access CRM data in HubSpot, read write to a specific directory in my local file system and read write messages to Slack Access. My Google Calendar and Gmail. Possibilities are endless. The beauty of MCP is that it's an open standard that defines how MCP clients in this case CLAUDE can talk to arbitrary servers that provide lots of different kinds of capabilities. They don't need to be custom coded to talk to certain APIs or servers. He says. Here's an example. Prompt, look, quote, look up OpenAI in the HubSpot CRM and Slack the details to at Dharmesh, including how long ago I had the last interaction. Then he says, I could have done something much more complicated and had a dozen different systems. But you get the idea. Once you see it work, it will be magical. The setup is a bit tricky now, but that'll get easier real soon. My guess is when OpenAI adds support for MCP to ChatGPT, things will be smoother.
[62:55]
Mike Kaput
Yeah.
[62:55]
Paul Raitzer
So yeah, I think it again, it fits in the context of my guess is like three months from now this we talk about this again on an episode and now it's much more tangible and the average person's able to do something who isn't, you know, Dharmesh, the CTO of HubSpot. Um, I think it's a very technical thing right now. I don't, I don't think that the average, maybe listener to our show who isn't, you know, consider themselves technical AI leader is probably going to be doing anything with this, but it seems like it's a conversation that's going to start coming up within your company if you're working with it and starting to do more advanced things with your language models.
[63:30]
Mike Kaput
All right, Paul, I'm going to go through some AI product and funding updates real quick and then we're going to wrap up with our listener Question question segment. So couple product and funding update announcements. First up, OpenAI is rolling out its new internal knowledge feature for ChatGPT team users. You may have seen a notification about this in your account, with Enterprise Access coming later this summer. So this update allows ChatGPT to access and retrieve relevant information from Google Drive Docs like docs, slides, PDFs, Word files to answer user queries using internal company data. So admins can enable this feature through either a lightweight self service setup or a more robust admin managed configuration that syncs access organization wide. Next up, Replit, the coding startup known for its kind of vibe coding ethos, is reportedly in talks to raise $200 million in fresh funding at a3.3 billion valuation, which is nearly triple its last known valuation. Their recent momentum comes from its full stacked AI agent, which was launched last fall, and that can not only write code but deploy software end to end. So this kind of puts it in the same category as GitHub, copilot or cursor, but with a deeper focus on autonomous agents. We talked about the other week, CEO Amjad Massad has gone as far as to say you no longer need to code in a world where you can simply describe the app that you want. Runway, one of the pioneers of AI generated video, just raised 308 million in funding, more than doubling its valuation to over 3 billion. Now they have an interesting creative ambition. Over at Runway, CEO Chris Valenzuela wants to shrink the filmmaking timeline, turning AI into a kind of digital film crew. He envisions kind of the future pace of film production to something like Saturday Night Live, where you turn ideas into a full production within a single week. They're already working with major studios like Lionsgate as well as Amazon. Now they have backing from General Atlantic, Softbank and Nvidia, betting that all this AI video stuff is not just a gimmick, it may be the future of content creation and filmmaking. And then last up, Sesame AI, the voice focused startup founded by Oculus Co creator Brendan Iribe, is reportedly finalizing a 200 million funding round led by Sequoia and Spark Capital that values the company at over a billion dollars. Now, Sesame only emerged from stealth in February, but it has quickly gained traction for its really lifelike voice assistant since they've been backed previously by Andreessen Horowitz and are entering a heating up AI voice market alongside companies like Elevenlabs and major model companies like OpenAI that have voice capabilities.
[66:32]
Paul Raitzer
In addition to the Runway funding, they Also on Monday, March 31st announced Gen4, which is their new series of state of art AI models for media generation and world consistency they said is a significant step forward for fidelity, dynamic motion and controllability. They also rolled out an image to video capability to all paid and enterprise customers. They say that Gen 4 is a new standard video generation marked by improvements over Gen 3 Alpha. Yeah, so like I think I have like a thousand credits in Runway. I don't know if they fire, but I've been paying for a Runway license for like three years.
[67:13]
Mike Kaput
Yeah.
[67:13]
Paul Raitzer
And I think I've generated a grand total of like five videos in there. I should probably go in and see if I have any credits I can, I can use for this one. So yeah, Runway is again a major player, but it's getting really, really competitive. They're going to have some major challenges ahead. There was another one Higs Field AI, I think it was, was tweeting all week long, sort of like sub tweeting Runway that they've made some improvements. So I. The video space is going to be wildly competitive this year and it'll be interest to see if Runway, you know, sticks it out. They were definitely there early, but it's gotten very competitive.
[67:46]
Mike Kaput
Yeah. And that Hollywood angle will be interesting to see how much they actually go down the the road of using these tools in lieu of kind of regular film production.
[67:55]
Paul Raitzer
Well, and I think James Cameron, Titanic. Titanic fame. He's a major investor now in Stability.
[68:01]
Mike Kaput
Stability.
[68:02]
Paul Raitzer
Y. Yeah. So there I'm sure going to be trying to push that as well.
[68:08]
Mike Kaput
Okay. Our last segment is a recurring one that we are getting lots of positive feedback on, which is listener questions. So we take questions from podcast listeners, also audience members across our other various courses, webinars, et cetera, and we try to pick out ones that are relevant and useful to answer for the audience. And this one is particularly important this week Given our topics, the question, Paul, is how do you prepare for AGI? Short of having serious discussion of a meaningful ubi, Universal Basic Income, basically giving people money when nobody has a job due to AGI or a new economic system, how do you actually prepare? I thought that last part was important here because it's like, okay, what do we actually start thinking about and doing about this? Right?
[68:56]
Paul Raitzer
Oh, it was the most loaded question we could have possibly picked. This is like a full episode.
[69:00]
Mike Kaput
This one's.
[69:01]
Paul Raitzer
Yeah, yeah, I, I mean, so UBI is the lazy person's answer to this. It's what everybody's, you know, kind of throws out there with no actual plan of how that would work. Some people refer back to like, the pandemic and how the government just sent some checks and people, you know, spent the money, whatever. Like, there's just no precedent for it, honestly. And there's, you know, OpenAI or Sam Altman led a UBI study for like seven years where they gave people like a couple thousand dollars a month. And I, there's no way to, to possibly project this out. Like, if UBI was even a possible solution, what's the psychological impact of that? Right? It's like, okay, great, you're, you're, I don't have to make pay my mortgage anymore, and you're giving me $10,000 a month for everybody, you know, in the country or whatever. But, like, you have no job or meaning in your life anymore. You're just going to collect a check and just do whatever you want. It's like, okay, well, we got some problems psychologically as a society. So I just feel like any time that UBI is thrown out as, like, well, we could just do ubi, it's like, okay, let's, now let's play the domino effect here. Let's go 10 layers deeper of what does that mean if you do UBI in a country.
[70:06]
Mike Kaput
Right?
[70:06]
Paul Raitzer
So I have no idea. Like, I, I, I don't like right now. My approach to how to prepare for AGI is to stay informed. It's to try and project out the improvements in the models. It's to read the reports of other people who are trying to look to the future like we talked about in today's episode. It's, I would say I'm, I'm very much taking the information gathering and processing approach to try and understand it. And my hope is that by being on the frontier of understanding it, we have the best chance of figuring out what to do about it. Do I have confidence the labs are going to be super helpful in this process. Not really. I think that they're mainly just going to build the tech and let us figure it out. Do I think the government's going to figure it out? Uh, no. I, I don't have great confidence the government's gonna figure it out. Um, so I, I honestly don't know. I wish I could give people some like really comforting answer to this question, but my only answer is we have no idea. And the thing you can do is focus on the next step you can take to educate yourself and to be in a position to make informed decisions when the time comes. Because otherwise it's really, really hard to like play this out without getting overwhelmed by it. So I generally just process the information and then I, I say, okay, tomorrow though, what can I do about this? And I try and stay very focused on an understanding of the long term, but an action oriented short term of just taking the next logical step.
[71:39]
Mike Kaput
Well, give yourself a little credit. I know you said you didn't have an answer, but it's a pretty good answer.
[71:46]
Paul Raitzer
Isn'T the answer? That's the one.
[71:49]
Mike Kaput
All right, Paul, that's another wrapped packed week in AI. Thank you so much as always for breaking everything down in ways we can all understand. Just a quick reminder for folks if you haven't checked out the Marketing AI Institute newsletter, it rounds up all of this week's news, including the stuff we weren't able to cover on this episode. So go to marketingai institute.com newsletter and we will be seeing you next week I believe. Paul, thanks again.
[72:17]
Paul Raitzer
Yeah, and keep an eye out for those announcements from Microsoft and Google. And if Microsoft and Google are announcing something, assume OpenAI is going to try and steal the show. So I would expect we're in for a wild seven days in the world of AI. April tends to be a very, very busy time in the model company world, so buckle up for a crazy spring. Thanks for listening to the AI show. Visit marketingaiinstitute.com to continue your AI learning journey and join more than 60,000 professionals and business leaders who have subscribed to the weekly newsletter, downloaded the AI blueprints, attended virtual and in person events, taken our online AI courses and engaged in the Slack community. Until next time, stay curious and explore AI.
[73:09]
Jim Stroud
How much do you understand the future of finance? I'm Jim roos, a top 10 banking influencer and host of the podcast Banking Transformed where we dive deeply into the rapidly evolving world of banking and financial technology. Join me as I interview industry experts, thought leaders and innovators as they unravel the latest banking trends. Disruptions and game changing technologies reshaping the world of finance redefine your understanding of the banking ecosystem. Subscribe now to Banking Transformed, available wherever you get your podcasts and now available on YouTube. The Jim Stroud Podcast explores the discoveries and trends forming the future of our lives. Brain to brain communication, robot bosses, microchip implants for workers, and artificial intelligence replacing human workers are all happening now. If you want to know what's happening next, subscribe now to the Jim Stroud Podcast.