
Loading summary
Paul Raitzer
It doesn't matter when AGI arrives, if it arrives, what we call it, it doesn't matter like what this expert says versus this expert. All that matters is what you can control, which is get better at this stuff every day. You know, improve your own comprehension and competency because that is the best chance you have to be very valuable today and even more valuable tomorrow. Welcome to the Artificial Intelligence show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Raitzer. I'm the founder and CEO SmartRx and marketing AI institute and I'm your host. Each week I'm joined by my co host and Marketing AI Institute Chief Content Officer Mike Kaput, as we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join us as we accelerate AI literacy for all. Welcome to episode 152 of the Artificial Intelligence Show. I'm your host Paul Raitzer along with my co host Mike Caput. We are recording this on Monday, June 9th at around 9:00am Eastern Time. There is it was crazy like last week wasn't nuts in terms of launches and like product news, Mike, but lots of just like intriguing topics to dig into for sure. It's kind of nice actually to have a little reprieve from the product launches to like talk about some of the bigger issues that are going on. So we'll have some product news, but we're actually going to get into just some bigger ideas like around AI human relationships, continuing the conversation on impact on jobs and then a host of other interesting topics for the week. So this episode is brought to us by Macon, our marketing AI conference. This is the 6th annual Macon is happening in Cleveland October 14th to the 16th. This year we've got dozens of breakout and main stage sessions as well as four incredible hands on workshops. Those are optional. So October 14th, 14th is workshop day. You can come in to Cleveland early and take part in a workshop. I'm teaching one, Mike's teaching one. And then we have two other amazing presenters and sessions you can check out. So you can go to Macon AI that's M A I C O N AI and take a look at the speaker lineup and agenda. I'm still filling out the the keynotes, the main stage featured talks, but a good portion of the agenda is up and you can take a look at that. Prices go up at the end of June, so get in early and we would love to have you join us in Cleveland. That is our home Base, that's where the headquarters is. So we have run it in Cleveland every year and we're planning to keep it there. So hope you can join us again. Check out Macon AI and this is also brought to us by two of our upcoming webinars. So as part of our AI Literacy project, we offer a collection of free resources and learning experiences. We have two coming up in June that you can check out. So June 19th is five essential steps to scaling AI. This is a class I teach every month. I think this is our ninth. We usually get about 800 to a thousand people registering for this one, so it is free to attend. I teach a framework for five Steps to Scaling AI in your organization, regardless of size, so we'd love to have you join us there. We will put the link to both of these in the show notes so you can find that there's and if you get my Exec AI newsletter that comes out every Sunday, we'll put a link to that. I always feature the upcoming educational content, so you can always click on the link in the exec AI insider newsletter as well. And then June 25th we have our AI deep dive Google Gemini Deep Research for Beginners. So that is the one I mentioned I'm going to teach, where I used it for a deep research project that we talked about on the podcast. And so I'm going to walk through how I did that and then provide some additional insights into the deep research product from Google Gemini. OpenAI has one as well. So some of the, you know, what we'll learn in there is going to carry over to OpenAI. So again, June 19, scaling AI and June 25 deep dive into Google Gemini Deep Research. All right, Mike, let's lead off with the, I guess one big product announcement from last week came from OpenAI, a live stream that I'm not so sure needed a live stream, but we had a live stream to share the news.
Mike Kaput
Yeah, they really love their live streams over there. So yeah, Paul, like you alluded to, OpenAI has announced some significant updates to ChatGPT. There's kind of a bundle of these. A couple were on the live stream. There are a couple others we'll talk about too, but the kind of big ones here. One is the introduction of what they call connectors, which now lets teams pull data from tools like Google Drive, HubSpot, Dropbox and others directly into ChatGPT so you can bring in your files, your data and tools into ChatGPT so it can search, synthesize and respond using your actual content. So you could ask things like find last week's roadmap or summarize recent poll requests. And ChatGPT, if it's connected to the right apps, will go pull real answers for you. You can also use connectors with ChatGPT's existing deep research capability to do deep analysis across sources. Along with connectors on this live stream event this week, OpenAI also announced record Mode, which is a meeting record that transcribes audio and helps generate follow up docs through OpenAI's Canvas tool. All right, within ChatGPT. Now, separate from these but also important updates that we heard in the past week or so, OpenAI's Codex coding agent got Internet access, meaning it can fetch live data and install packages while it autonomously does coding work following human prompts. Last but not least, and this is kind of a sneaky one because I tried it out this morning and was like pretty blown away actually, which is that OpenAI dropped a major upgrade to advanced voice in ChatGPT. They say, quote, it is offering significant enhancements in intonation and naturalness, making interactions feel more fluid and human like, which is also something we're going to talk about in a related topic. So, Paul, first up, let's talk connectors and Record Mode. These are the biggest updates we got. They're the ones getting a ton of attention. Like, from my perspective as a practitioner, I'm at least on paper thrilled about what these appear to enable, especially like the connector to HubSpot, which we rely heavily upon. Google Drive is great, all that stuff, but as much as I want to rush forward with using it, I kind of screech to a halt thinking about the privacy and security implications. So it seems like, correct me if I'm wrong, every business might want to have a plan or some steps in place for these things before they turn them on.
Paul Raitzer
Yeah. So I think this again, just continues to build on this idea that OpenAI envisions ChatGPT as an operating system. They, they don't want you to leave ChatGPT. They want you to just connect to everything you have access to and to just talk to it right within ChatGPT. Now, I would imagine, you know, Google, which, you know, enables this connection to the Google workspace and Google Drive, I guess, to Google Drive in particular. They would rather you were doing that with Gemini, not ChatGPT. Their technology enables that connection to happen. So, you know, I think that OpenAI is just really going aggressively after this enterprise user they announced, or it came out in the cnbc article that 3 they're up to 3 million paying business users. That's up from 2 million in February. So they're seeing some pretty significant growth. And the connectors seems to be a real key play to that. So as you highlighted, there are certainly benefits to it. You know, you get faster insights, get access to my doc. So I like you as the user of the system immediately was like, oh, that would be amazing. Like there's a HubSpot connection, there's a Google Drive connection, we use all of these things. That's phenomenal that I could finally have access to this and have these summaries. And then my immediate response is, wait a second. As an admin, who has the ability to turn this on? And so I, I, you know, Mike, like I put a note in our zoom, I was like, do not connect this anything. Like, because before I was able to go in and verify who could actually enable the connection to Google Drive or to HubSpot, which again we use both. I just like don't do it like as to the team, because once you do it, it's like the data is now there. You know, they're going to inventory all your data and there's all these implications that I'll kind of, I'll get to in a minute. But so as an admin, I went in to see like what are our controls? As a ChatGPT team account, we don't have the enterprise account and unfortunately some of the security protocols are only available to the enterprise account, not the team license. So I was going through, trying to see like what can people actually do here and making sure that people aren't connecting things they shouldn't be connecting. So definitely there are benefits. We'll put a link to the help article because I don't think they put a blog post up about it.
Mike Kaput
Not that I saw. I actually read through pretty in depth the help article because there was no other announcement.
Paul Raitzer
Yeah, it was like there's an X post and then some of their people put like LinkedIn posts with some summaries. But yeah, there was, there was a live stream but no summary product release. So I'll go through a couple of the questions from the help article. It says, what does ChatGPT share with connected applications? These are really important. Again, if you're an admin, they're extremely important. But if you're just a user, be aware if somehow you have access to turn these things on, you should default to ask before doing, I would say whenever you're connecting to third party sources. And this, this holds true with anything but Like, I'm just very aware of this with AI because we as a, you know, as an organization allow a lot of experimentation.
Mike Kaput
Yeah.
Paul Raitzer
But we also have to always be super conscious of what are we connecting our data to. So in, in the question in OpenAI's help article, what does chat GPT share with connected applications? It says when you enable a connector, Chat GPT can send and retrieve information from the connected app in order to find information relevant to your prompts and use them in its responses. Now again, like, seems kind of harmless when it's just read like that, but send and retrieve information. Like, obviously it's going to go get stuff, but the question becomes, well, what is it doing with that information? So then the question is, how does ChatGPT use information from connected applications? It says when you enable connection, ChatGPT will use information as context to help Chat GPT provide you with responses. But then I bolded this. If you have memory enabled in your settings, Chat GPT may remember relevant information accessed from connectors. So immediately you're like, hold on a second. So let's say we turn it on. And then like five days later it was like, okay, that was a bad idea. Let's turn that off. If you have memory turned on in your organization, in your Team Enterprise EDU license, like, it's in there, like, they now have that data. And if you connected it to your Google Drive or your CRM, like, what exactly is it? Remembering becomes a pretty important question. Then it says, does OpenAI use information from connectors to train its models? This is a question I get all the time when we teach, like the intro to I class. It says for ChatGPT Team Enterprise and Edu customers, we do not use information access from connectors to train our modes. Now that was Team Enterprise and Edu. If you're Free plus or Pro user, we may use information accessed from connectors to train our models. If you improve the model for everyone's setting is on. Which begs the question everyone to ask yourself is improve the model for everyone turned on for my settings. If you don't know that, go into your settings and look, because if it is enabled, you're allowing them to use more data than if it's not. Then it says in Enterprise EDU and team workspaces, who can enable or disable connectors? This was a really important one for me. They say workspace owners and admins manage availability in settings and then connectors. So again, on a homework assignment, go find out who your admins are and make sure that they are aware not to turn this stuff on to run these experiments without permission and a plan. So my overall here Mike and if you have any thoughts here, please add them the cautions Think about governance. Understand the terms of use for both applications. You're allowing these connections to happen between Figure out who has the ability to turn on the connectors. Figure out who will test and verify that permissions are here to correctly. This is like the big one for me. So if I allow us to turn on Google Drive, which I would love, I mean trust me more than anybody, I want the ability to talk to my data on Google Drive. But how do I know that the permission levels are going to hold? So if I have like HR data, confidential information that only like a select few people in the organization have access to, how do I know that that's not going to end up in a chat and someone can't just literally say send me the salary information for all the employees. Well that lives in a document in Google Drive. How do I know that that's not going to leak? I don't. And so you're definitely very much trusting the two parties here specifically OpenAI and so I think you have to have someone own this from a governance perspective. Then you get into the data side and we Remington Begg, who's a friend of ours and longtime HubSpot partner, he posted on LinkedIn pause the hype. The hidden data dangers lurking in your new AI connections. Now in his he was actually making an argument specifically for agencies. So let's say you allow ChatGPT to have access to your Google Drive or your HubSpot data, whatever. And within there is client data that maybe is privileged. You're now giving data to a third party that maybe you don't even have permission to give within your terms of service for a client. And so it like creates all these layers of complexity of like understanding data. Where is it going? What protections and governance do you have over it? You could get into security questions and then there's just the big one of like does it even do what it says? So like I saw somebody again the HubSpot when I haven't tested, we have not connected it but I did see a long time HubSpot partner that was like, it was just completely disappointing. Like I was all excited. I run my first deep research project and it basically comes back because like I can't do that. And it's like well what's the point then? What? I just give you access to everything and you can't even do the thing I want to do so just overall recommendations. Make sure someone owns the piloting of the connectors. Run systematic pilots, like have a plan. Don't just turn a connector on and give it access to data without a plan of what you're going to do with it. Update your AI policies if needed to control access and usage. And then if you scale use internally, do so with training and personalized use cases. This is what we say all the time with Gen AI, so I don't. Mike, do you have any cautions or recommendations that kind of jumped out to you that I didn't touch on?
Mike Kaput
I think overall what just struck me is the speed at which this stuff moves, which is not news to anybody. But it's why we harp on so much about having policy in place. Because literally overnight, if you weren't paying attention, connectors come out. Someone in your organization could very well be like, oh great, a new feature in ChatGPT. Turn them on. Even if you catch it later, you're still kind of cooked if it violates any kind of policies or restrictions you have. So really buttoning up policies and procedures is really important.
Paul Raitzer
Yeah, and they make it so easy. Like, The Google Drive one has been sitting in ChatGPT now for weeks. Like every time I go in there, basically it's like, do you want to connect to Google Drive? And it seems so innocent and we're all so used to this. Like, give it access to my calendar, give it access to my email. Like, we just have become like, you know, as Remington was saying, like, just push the button. Like, you just get so used to it and you kind of skim over. What are you giving it access to? Well, in this case, it. It may be extremely important that you understand what you're giving it access to. So yeah, just kind of a cool innovation. Like this is going to be important. It'll probably become ubiquitous throughout enterprise. Like you're going to just connect your, your AI models to these outside sources. It's going to enrich all these use cases. But, like, pump the brakes a little bit. Think about what you're doing before you do it. This is why AI councils are important. It's why generative AI policies are important. It's why you do this with a plan.
Mike Kaput
Just real quick to wrap this up here, have you tried out the new voice mode at all?
Paul Raitzer
So I did. I played around with it a little bit on Saturday and like you, it's just sort of shocking, you know, it.
Mike Kaput
Like gave me goosebumps a little.
Paul Raitzer
Yeah, it's like, you know, for years, they, the labs steered away from making them too human, like, and I think wisely so. But we talked about this last year. I feel like they just sort of said, screw it, like, yeah, let's just go. This is where it's going to go anyway. Let's get as human, like, as possible. And it's happening in audio, it's happening in video, it's happening in images. And I do think that there's a slippery slope here. It's inevitable. Like, again, I tend to err on the side of me complaining about this or like, fighting against this does nothing. They're going to do it. Everyone's going to do this. It is a stark contrast for how bad Surrey is. Like, I mean, it's going to become even more painful to work with these ones that aren't like this once you get used to it.
Mike Kaput
Yeah.
Paul Raitzer
So without, you know, going in the next 20 minutes on the downsides of having truly human, like, voice, if we just focus on, like, it's incredible. Like, the technological advancements are insane. The implications to business, you know, specifically think about, like, sales, customer success, customer service, education, like, it has massive ramifications. And I'm convinced still that, like, what we're seeing is not the most advanced versions of this they have. I still think they're just kind of like, you know, iterative deployment is what they call it. Like, they're just releasing things to, like, gradually prepare society. But one to two years out, it's. It's completely indistinguishable if, if you can still tell.
Mike Kaput
Well, the second topic we're in, we're discussing this week very closely relates to this because OpenAI has released a new essay about kind of confronting a quiet but increasingly urgent issue that they're seeing, which is people are forming emotional bonds with AI. So this essay by Joanne Jiang, who is the head of model behavior and policy at OpenAI, came out this past week. And in it, she writes that the company is hearing from more users who describe ChatGPT as someone, not something. Some people call it a friend. Others say it feels alive. And while the model isn't conscious, its conversational style can evoke genuine connection, especially in maybe emotionally sensitive moments like times of loneliness or stress. So this led OpenAI, she says, to focus less on whether AI is actually conscious. She kind of, you know, sidesteps this big philosophical debate in this essay, but more on the fact that it does. It can feel conscious to users. And that perception, she argues, shapes real world emotional impact. And As a result, OpenAI needs to be really thoughtful about how they design their tools. She said for now at least, the goal is to build AI that feels warm and helpful without pretending to have an inner life. She kind of talks about these kind of trade offs and decisions they have to think about which are like, we're not going to have it. Make up backstories about itself, simulate desires. Talk about like self preservation, like it's, you know, self aware. So OpenAI is kind of in this position where they're trying not to deny people's feelings, but they are trying to avoid confusion, dependence or harm as these, I guess what you would call human AI relationships evolve. So I don't know, Paul, I read this, it's really good, like kudos to them for a really thoughtful approach here. But I was like, this gets into some murky territory really fast because on one hand, like you should be rightly concerned about how people are developing relationships with these tools. But it's also like, okay, is OpenAI now making decisions that impact how we feel about AI? Clearly they can turn the dial one way or the other to determine how we feel about AI. So what do you think? What did you kind of take away from reading this?
Paul Raitzer
There's a number of important points here and the part of the reason we made this a main topic today and not just like link to the the1 article. The first for me is as you were highlighting, like, these are choices that each lab is making. Like you train the model and then the labs decide its personality. They decide how it will interact with you, how warm and personal it will be. And so illuminating the choices OpenAI is making based on some principles or, you know, foundational beliefs or morals or whatever it is that's driving their decisions doesn't mean the other labs will make the same choices. And so whatever OpenAI thinks is potentially a negative within these models, another lab may see that as the opposite. And they may actually choose to do the things OpenAI isn't willing to do because maybe there's a market for it. So maybe they look at it and say, yeah, we won't make ours as addictive because we will make the personality, you know, something like it's going to draw them in and keep them in these conversations and kind of lead them down different paths. Where a different entrepreneur or venture capitalist may say, hey, there's a huge market to do the thing OpenAI is not going to do. Let's go do that thing. So I think that one just understanding that there is agency in this, there is decisions being made by Humans as to what these models will be capable of. You have to understand the inherent capabilities exist to behave in any way. It is a human that's shaping how it actually does it. I know at Anthropic they have people dedicated to the personality of Claude. Like we've talked about this on the podcast. So I think this matters in business and in life because the AI you interact with in your job, some human is training it to function in that way. When we build custom GPTs, we will often say, you know, like my co CEO GPT say like, I want you to challenge me. Like I want you to like present, you know, when I present problems to you, I want you to help me solve them. But like, when I present strategies to you, I want you to like, almost steel man them. I want you to take the opposite side sometimes. And so we get to kind of control how these AI interact. But each lab is sort of dictating parts of that for our business and for life. So it matters for you, it matters for your kids, like to know what AI chatbots they're interacting with and who's controlling those. So like, if, you know, let's say tik tok, like if there's an AI in there, you can interact with WhatsApp, Roblox, Minecraft, like, take your pick, it's going to be in games, it's going to be in social media channels. Who's determining the behavior of the AI that your kids talk to all the time? So I don't know, I think like, we're not trying to solve this here. Like, I don't even have like super deep insights per se into like the personality choices. I see this as the domain of philosophers, sociologists, psychologists, lawyers, like technologists, like, there's a lot of different perspectives that need to be considered. But what we know and what we talk about all the time in this podcast is the models are getting smarter, they're going to get more human. Like, these are just facts and in many cases it is by design. The voice stuff we just talked about matters here because the more human like they become, the more empathetic. They're made on the back end. Then all of a sudden you start developing these deep relationships. And I think like, for me, another key takeaway is like I get frustrated sometimes following in the AI bubble on Twitter X because the technologist get so caught up in whether something can or can't actually do something. So like, is it conscious or not? Does it have empathy or not? Does it actually think like we think? Can it go through true reasoning. There was a paper over the weekend that was sort of getting a ton of run on X and it was from Apple, right? And it came as like the illusion of thinking. And so it was basically saying they're not actually reasoning. That these reasoning models, it's. It's all a facade. They're not actually doing it. It breaks down if you give them these complex puzzles. And I was just like, I get it. Like one, it's Apple. So there's a part of me that's like, really? Apple's the one telling us that models can't do these things. They can't even fix Surrey. But taking it for what it's worth, assuming these are brilliant AI researchers doing this thing, I'm not disputing that whatever their findings are may be true or not. All I'm saying is it doesn't matter. So the technologists get lost in these debates about whether it can or can't do something and they, they lose sight of the fact that it can simulate things, though. Like, even if it isn't actually reasoning, it is producing a valuable output that impacts jobs. It. It simulates behaviors and emotions and actions at or above a human level. And it creeps the perception of these abilities. Whether it can or can't do the thing. It really doesn't matter because we have to be humble enough to realize, like, we don't even understand how the human brain is doing reasoning. And maybe it's not actually that different than the way we do reasoning.
Mike Kaput
Right.
Paul Raitzer
So I, I don't know, I. I kind of get annoyed with that stuff, but. So just to dive real quick into the actual essay, so it says we naturally anthropomorphize objects around us, we name our cars, or feel bad for a robot vacuum stuck under furniture. Actually, it's weird. Total side note, the stuff happening in la, which is tragic. I was seeing the Waymos on fire.
Mike Kaput
I was gonna send this to you this morning. There's a lot of commentary around that too. From my perspective.
Paul Raitzer
Yeah. There was this moment where I was like, ah, the poor cars. And I was like, it's a freaking car. Yes, it can drive itself. But like. And you immediately flip back to the. The humanity of what is going on there. And. But there is that second where you're like, oh, like, I feel bad for the Waymos. It's like, no, it's just metal and computers. So anyway, so the article continues. My mom and I waved by to a Waymo the other day. It probably has something to do with how we're wired. The difference with Chat GPT isn't that human tendency itself, it's that this time it replies. A language model can answer back. It can recall what you told it, mirror your tone, and often what reads as empathy, again, not real empathy, it doesn't feel anything, but it simulates it. And that matters for someone lonely or upset. That steady, non judgmental attention can feel like companionship, validation and being heard, which are real needs. At scale, though offloading more of the work of listening, soothing and affirming to systems that are infinitely patient and positive could change what we expect of each other. If we make withdrawing from messy, demanding human connections easier without thinking it through, there might be unintended consequences. We don't know we're signing up for for. So again, like takeaways for me, what can we do here? Understand that when we talk about AI models, there are actual abilities, it can actually do this thing and then there are perceived capabilities, emotions or behaviors. And so don't get caught up in the technical debates about is it conscious, is it not conscious? Like, we may never know. But if it feels conscious to people, does it really matter if it is or it is not? If it actually is doing reasoning like the human brain, there'll be technical bait probably for the next 10 years about that. But does it sure appear to when we watch its thinking? Yes it does. Does it do the work of people who have reasoning abilities? Yes, it does. Like, so I think that's the main thing is like you just have to understand there's a difference between actual ability and simulation. But the simulating of the ability creates the perception that it actually has it. And that's really all that matters when we look at the economic impact and the impact on our lives and our own emotions.
Mike Kaput
Yeah. I would also just encourage a healthy dose of humility as well, because if you're someone listening to this, being like, you know, maybe you're of a certain age or a certain perspective and you say, well no, of course I'm never going to like fall for this and like form a relationship. You know, use the term relationship loosely. I'm never going to humanize AI. I think you should take a step back and just be aware. We all can fall for this, I guarantee you.
Paul Raitzer
Yeah, and it'll just become natural over time. Like I think to your point, like it just. Yeah, humans adapt. And yes, some age groups, some people, regardless of age, you may just be stuck in your ways and you may not, but the vast majority of people will just evolve and they will, they they will treat AI differently. And I get like, I get asked sometimes when I go to talks like about the rights of AI, like there are, there are people now who truly believe they're at the point where these things need rights. They need to be treated, you know, like humans. And you know, again, I think that'll become a bigger and bigger part of society. I, I don't, I don't judge anybody. Like, I get it. It's, it's weird and it's hard and like, there's no right answers right now. And a lot of the experts just can't agree on any of this stuff. Like, look at the Apple paper and you have this like massive debate going on X all weekend of like, these guys are idiots. And it's just.
Mike Kaput
Yeah. All right, well, our third big topic this week, we are again kind of tracking some more, call them warning signals that are kind of flashing about AI's impact on jobs. But not all of this is necessarily like negative news. But first up, the biggest kind of headline on this topic from the past week is that the media outlet Business Insider has laid off 21% of its staff. And AI was cited as a pretty big factor here because this move represents a major strategic pivot for the company. So CEO Barbara Peng published a memo in which she outlined the cuts and the company's plan moving forward. And what's notable about this is just how much AI was emphasized. So Peng frame the layoffs as necessary for creating a leaner, more future proof newsroom. AI was critical to that vision. She emphasized that more than 70% of Insider employees already use ChatGPT Enterprise. The goal is a hundred percent adoption. And then she outlined some other business factors that were related as well to this pivot. But what people got hooked on was the AI messaging. The insiders union called the timing tone deaf. They argued no technology can replace real journalists. And they blamed parent company Axel Springer for prioritizing profits over reporting. Now, kind of related to this, there's a reason that CEOs, including business insiders think they can run leaner operations by adopting more AI. Because a couple new reports and studies from this past week seem to indicate that the data backs up that view. So First Consultancy PwC released its 2025 Global AI Jobs Barometer report. This analyzed almost a billion job ads from six continents. And they also used a wealth of other data to look at AI's impact so far on jobs, wages and productivity. Now this full report is well worth diving into like the help of NotebookLM. But the big takeaway Here is They found that industries most exposed to AI have seen revenue per employee grow three times faster than those not exposed to AI since the launch of ChatGPT in late 2022. They also found that workers with AI skills Now earn a 56% wage premium over their peers. And similar to this, a new working paper from the National Bureau of Economic Research finds that in one scenario that they modeled, they find more likely than others, AI could improve labor productivity by more than 3x. However, according to the model that the researchers built, those massive productivity gains could eventually come at a cost to workers. The research predicts that in this scenario there is a also a 23% drop in employment as AI becomes better able to replace people. So Paul, kind of zooming out here, we're basically tracking some version of these type of signals every week. It feels like at least anecdotally, this is picking up speed. Companies are more and more citing AI as a core job expectation and as a way for firms to get leaner and do more with less. I found the data pretty interesting. It seems like in the short term you can massively boost employee productivity and revenue per employee, which is something we've commented on. Where do you see this standing as of this week in terms of AI's impact on jobs?
Paul Raitzer
It is interesting, Mike, that we've been talking about this for, I mean intensely for probably the last year, but the impact on jobs for a couple years and just wasn't. You weren't seeing the pickup. I'm just glancing at our links for this topic and we've got 12ish from this week. So just. Yeah, it's a small sample size, but every week we are, we are not intentionally putting AI in jobs as a topic every week. It is literally surfacing every week because we're starting to see so much coverage of so many different reports and research studies and things like that. So a couple of notes here. The, the one. There was a, there was a post in March that we did talk about at the time that resurfaced I think from a podcast maybe is where this link came up. The seven month rule.
Mike Kaput
Yes.
Paul Raitzer
So I, I want visit this for a second and I don't remember what episode it was on, but we'll, we'll drop it in the show notes if we have that. So Beth Barnes is the CEO of Meter. It's an organization called Model Evaluation and Threat Research and they came out with a study in, in March of this year that said AI models today have a 50% chance of successfully completing a task that would take a expert human one hour. Seven months ago that number was roughly 30 minutes and seven months before that 15 minutes. So Beth's team has been timing how long it takes skilled humans to complete projects of varying length, then seeing how AI models perform on the same work. So in the summary, upfront summary of this Measuring AI ability to complete long tasks, that was the name of the post they said, we propose measuring AI performance in terms of the length of tasks AI agents can complete. We show that this metric has been consistently exponentially increasing over the past six years with a doubling time of around seven months. Extrapolate, extrapolating. This trend predicts that in under a decade we will see AI agents that can independently complete a large fraction of software tasks that currently take humans days or weeks. So they're basically looking out and saying like, okay, if it takes a human an hour now it's going to take, you know, 30 minutes, whatever. In, in seven months they're looking at and saying like every seven months it's doubling in its ability to do human task these long horizon tasks. So the labs have been aware of this now for a while. I think what's, I think now happening is the business world is becoming aware of this. And so if you look at something that takes a human, you know, an hour, two hours or whatever now, and then you look at the time it takes the AI, you know that in roughly seven months it's going to be cut in half, that the AI is just going to keep getting better and better at doing that thing.
Mike Kaput
Yeah.
Paul Raitzer
And so that starts to, to really make an impact. We saw kind of a, you know, again, there's many supporting resources to this. We'll drop all these links, but there was an article in Business Insider about the Big four consulting firms and AI threat to their jobs. So a couple of excerpts from that one it said Yet AI could be posed to disrupt the business models of the Big four organizational structure and employees day to day roles while driving opportunities for the mid market. The Big four advise companies on how to navigate change, but they could be among the most vulnerable to AI themselves, said Alan Patton, who until recently was a partner in PwC's financial services division, the company that just did the study you mentioned. Mike Patton, who's now the CEO of Quota, a Google cloud solutions consultancy, told Business Insider he's a firm believer that AI driven automation would bring major disruption to key service lines and drive a huge reduction in profits. Went on to say most structured data heavy tasks in audit, tax and strategic advisory will be automated within the next three to five years, eliminating about 50% of roles. There are already examples of AI solutions capable of performing 90% of the audit process, Patton said. He went on to say automation could mean clients increasingly question why they should pay consultants big money to give me an answer I can get instantly from a tool. On the positive front, Mike, you highlighted this already. The fearless future 2025 globally jobs barometer from PwC. I think there is this like silver lining of workers with AI skills command a 56% wage premium, up 25% from last year. Like we're seeing that. Like I think that is the near term opportunity for people is like go figure this stuff out and you can accelerate your own career growth. I think a lot of AI forward organizations are going to look at their employees and be willing to pay a premium because of how productive they can be, how creative, how innovative they can be. And then one final note I'll add here is Wade Foster, CEO of Zapier, had a great post on X where he was talking about Zapier requiring AI fluency for all their new hires. And then he had a thread, we'll put this in. He actually had a chart he shared of kind of how they evaluate this, but how they're tracking it. He said they map across four levels. Unacceptable. This is like AI fluency, basically. Capable, adoptive and transformative. So unacceptable is they are resistant AI tools and skeptical of their value. Meaning you're not getting hired here and you're not going to keep your job here if you are in the unacceptable range. Capable is using the most popular tools, likely under three months of usage. So they're kind of new to it. They're experimenting. Adoptive is, they're integrating AI into personal workflows. They're tuning prompts, chaining models and automating tasks to boost efficiency. And then transformative is the sweet spot. Using AI to rethink strategy and offer user solutions that weren't possible two years ago. And then he shared even some of like the questions they're asking in interviews, like marketing, how is AI changing how you plan and execute campaigns? How do you use AI to personalize messaging, generate content, analyze performance? We're doing the same thing like in our interviews, this is the kind of stuff we're actually looking for. So again, like takeaway here, like I always say, you, you can stand still or you can accelerate your AI literacy and capabilities. And if you do that, we can't promise you a certain future. Like it is still unknown what's going to happen to your job or Any of our jobs. But in the near term, you will have the greatest chance to figure out what happens next in your job and in your industry because you're going to understand the implications of AI and you're probably going to make more money because organizations need that adoptive to transformative phase, as the zap, you know, Zapierc would call it.
Mike Kaput
Yeah. In a weird way, I think there is a silver lining of some excitement here too. Because when I hear all this stuff and just experiencing what we experience in our work, there's nothing more exciting to me than someone being like, no, here's the exact roadmap to go be more successful, make more money, et cetera. Before you would probably just kind of be nebulously like trying to figure out, like, okay, how do I get to the next phase or move up the ladder or wait for that promotion? Like, this is really exciting. You have the roadmap right here.
Paul Raitzer
Yeah. And I think, like, again, you know, we talk a lot about disruption, displacement, underemployment, unemployment. Like, those are very probable outcomes. Like, it is very probable that within the next three to five years that is the reality for a lot of people. It is not given, though. Like, it might not be. May, maybe there is this insane emergence of like all these new roles really fast. Like faster than I'm expecting it to happen. I don't have a crystal ball. I just look at the data. We spend a lot of time thinking about this. And at the moment, the probability for me is it's probably going to be a little painful for a while now if that is the outcome. If you raced forward and became AI literate and drove mastery of the tools and the knowledge around this, you have the greatest chance to get through the messy part. If the messy part never shows up, you're just going to make more money in the process and be there before everybody else gets there. There's no downside to being the one who goes and solves this. To your point, Mike, in the near term, it's probably great for your career. In the long term, you're going to figure out the next new business to build. You're going to figure out the roles that are going to remain in the company. You're going to be a part of that conversation and that transformation. So, like, that's why we always just challenge people. It doesn't matter when AGI arrives, if it arrives, what we call it, it doesn't matter like what this expert says versus this expert. All that matters is what you can control, which is get better at this stuff every day, you know, improve your own comprehension and competency because that is the best chance you have to be very valuable today and even more valuable tomorrow.
Mike Kaput
All right, we've got a ton of interesting rapid fires this week, so let's dive in. The first, first rapid fire we're covering right now is that OpenAI says it is now being forced to store deleted ChatGPT conversations indefinitely due to a court order tied to its ongoing lawsuit with the New York Times. So previously the company kept deleted chats, per its terms, for like 30 days before purging them. But under this new order, that policy is on hold. So even user deleted or privacy protected chats must now be saved until further notice by the company. That potentially includes, in some cases, private personal or sensitive data. Now, this data will not be made public. Only a small legal and security team inside OpenAI will have access strictly for purposes of managing it due to the ongoing litigation. Now, OpenAI is pushing back really hard against this. They argue this order is unprecedented, sweeping, and a direct threat to user privacy. In court filings, OpenAI says the judge acted prematurely. Basically, a judge, they claim, accepted speculative claims that some users may have used ChatGPT to bypass paywalls and then deleted their tracks, which would impact the allegations in this case. However, until the court reverses this order, these conversations will continue to be stored. And that's kind of sparking a little panic among businesses and individuals who rely on ChatGPT for confidential tasks. Now, according to the sources put out by OpenAI and others, moving forward, enterprise license customers and those with zero data retention agreements are not affected by this. But users with ChatGPT, Free plus or Pro are affected by this until this gets resolved. So, Paul, it's definitely ongoing and developing here, but that seems like a pretty immediately big deal for any company that needs assurances their data is being kept private under certain restrictions OR regulations by OpenAI. Now, it doesn't apply to enterprise licensed customers. They seem like they'd have the most to worry about here, but if I'm a business leader with these kind of considerations, I'm probably keeping a close eye on what happens here, don't you think?
Paul Raitzer
Yeah, I mean, it doesn't apply to them yet, but this sort of shows that, like, legal issues may override terms of use, like if the courts decide they're illegal. So, I mean, it definitely is bothersome to open AI because Sam Altman tweeted recently in the New York Times asked a court to force us to not delete any user chats we think this was an inappropriate request. That sets a bad precedent. We are appealing the decision. We will fight any demand that compromises our users privacy. This is a core principle he followed up with. We have been thinking recently about the need for something like quote unquote AI privilege. This really accelerates the need to have the conversation. In my opinion, talking to an AI should be like talking to a lawyer or a doctor. I hope society will figure this out soon. He then shared a link to a OpenAI article about how we're responding to the New York Times data demands. And then he followed that up with parentheses. Maybe spousal privilege is a better analogy. So then the the June 5 security posting from OpenAI about the new York Times data demands started off with a quick note from Brad Lightcap, the COO of OpenAI and he said trust and privacy are at the core of our products. We give you tools to control your data, including easy opt outs and permanent removal of deleted ChatGPT chats and API content from OpenAI systems within 30 days. New York Times and other plaintiffs have made a sweeping and unnecessary demand in their baseless lawsuit against us, which Is retain consumer ChatGPT and API customer data indefinitely. This fundamentally conflicts with the privacy commitments we have made to our users. It abandons long standing privacy norms and weakens privacy protections. We strongly believe this is an overreach by the New York Times. We're continuing to appeal this in order to keep so we can keep putting your trust and privacy first. So again there's that we talked earlier about the data security. Even if you trust OpenAI, it doesn't mean that the legal system trusts OpenAI and like so yeah, and this, this probably then goes into the whole like part of that debate about like open source and like controlling your own models and having them, you know, on your own systems. And yeah, I would imagine this is part of that argument for why that's maybe better in some instances.
Mike Kaput
Next up, Google DeepMind has released a white paper detailing how it's making its Gemini 2.5 models more secure, specifically against a growing threat called indirect prompt injection. This is a kind of attack that hides malicious instructions in everyday content like emails or documents in order to trick AI agents that go review those emails or documents or whatever into leaking private data or misusing tools. So to defend against it, DeepMind published how they're using a multi layered approach grounded in one key tactic, automated red teaming, so their own AI agents simulate realistic attacks on Gemini to uncover weak spots before bad actors can. Now while this doesn't totally solve AI specific cyber attacks like prompt injection, it does go a long way towards making Google's models quite a bit safer. But really the reason we kind of wanted to chat about this briefly is it points to a much larger issue which is AI models and systems can be exploited in these unique ways outside of traditional cyber attacks. And even the smartest companies in the world that are building this stuff are trying to figure out how to prevent some of those attacks. And Paul, that seems like what's really important here for AI forward business leaders to start understanding. Like what happens when your business becomes dependent on AI systems that can be exploited like this. Like what happens if your business as we get more agentic AI becomes dependent on AI workers that get knocked out of commission or exploited in this way. Lots and lots of question marks here.
Paul Raitzer
Yeah, this is a pretty deep topic on the surface. I, I, I can see like this report then some of these charts being used by like cyber security teams and enterprises to say why we can't use ChatGPT. Like it's just like Steam. You don' know what the problem is these prompt injections and I am not dismissing at all that this is, I'm sure there's way more advanced things happening already, especially at the state level, government level where yeah, espionage and cyber attacks are part of the, the arsenal. But without getting too much into that it does Mike, to your point, bring more of the reality which is as all these companies start thinking about job displacement and like maybe we don't need as many humans and we're just going to use all these AI agents and they're going to string together and they're going to work with each other and they're going to be connected to all our data and it's going to be amazing and, and we're going to have like 40% less people and then like oh chat, GPT just went down for 48 hours because of whatever, right? We have no workers, we can't get anything done that is, yeah, like you almost need these fallback systems and this is, I haven't heard anybody talking about this stuff like yet to be in a meeting with any organization where they're actually considering the possibility that they become dependent upon the AI agents and models and those models go down power outage or cyber attack or like whatever it is. So yeah, I guess the takeaway on this one Mike is start doing contingency planning with your IT team, your legal team, capital for the event that your organization is dependent upon AI agents and digital co workers and they can't work.
Mike Kaput
Yeah. It seems increasingly too. Like these AI systems, they're not just tools. Right. Like, if our company of HubSpot went down, we'd be in a real pickle.
Paul Raitzer
We'd have a huge. We have been in a pickle before.
Mike Kaput
Huge problem. But could we do other work? Yes. This is more like, oh, powers out, like, Internet's out. This is like you're increasingly. This is going to underlie everything, Right?
Paul Raitzer
Yeah. And imagine if Mike, you've built a team of like, let's say it's not the entry level that gets sideswiped. Let's say it's actually middle management or senior management that are the most expensive workers. And you decide we can do this with a bunch of like younger employees who just have AI models and they're trained to use these models and they're going to do it. And then there comes a moment, for whatever reason where they actually have to do it manually or analog and they can't go into the AI and ask it to do the thing. And they never had to do it without the AI and now they don't even know how to do the thing.
Mike Kaput
Yes.
Paul Raitzer
Man, that's wild.
Mike Kaput
I. I genuinely think that could happen where we become so dependent.
Paul Raitzer
Yeah. And I don't remember I said this on the podcast or if it was on like one or ask me anythings or something, but interestingly I was, I was talking to my wife about this stuff and my wife, like, understands AI to the extent, like I've talked to her about it. She's an artist and it's not the things she's like studying every day, but it's so fascinating because sometimes I'll just bounce things off of her and like, get her perspective. She's like, incredible insights on this stuff. And it's like I was saying something about it was related to the, the 25% of entry level jobs, you know, going away kind of thing. And she, and she said like, what happens if the system goes down because of a power outage or something and then there's no workers? And I was like, oh my God, this is like two weeks ago. So in some ways I'm actually echoing an insight my wife asked me that I hadn't actually like really thought about. So yeah, it's. Wow. Yeah. So, yeah. Sorry if we just like scared everybody into realizing, like they need to be doing way more planning. So.
Mike Kaput
Yeah. As if you didn't have enough to think about already.
Paul Raitzer
Right? Yeah.
Mike Kaput
All right, next up, noted tech commentator Balaji Srinivasan. Who is, I believe also the ex CTO at Coinbase is sounding the alarm on what he calls AI's verification gap. So his idea here, which is an important one, is that, look, you can prompt AI really fast. You type it replies. But the issue comes with verifying that reply. That's slow, it's hard. It's usually manual, especially with text, code or anything technical. So, for instance, like with images and video, a human eye can spot errors in a flash. That's why AI excels at generating visuals. But when the output is something like code or math or dense writing, verifying means reading, deeply checking sources, walking through the logic. It demands real expertise. In short, verifying does not really scale. So he kind of argues that we've turbocharged the generation side of AI, but we've neglected the discrimination side the judgment. This makes AI look faster than it actually is because the hard work of verification still falls on humans. So his conclusion is quote, the concept of verification as the bottleneck for AI users is under discuss now. Paul, I have to say, this resonated really deeply with me because I feel this pain, this bottleneck, like every day with something as simple as deep research. There is a huge gap between the number of deep research reports I can and want to run. I could queue up dozens of them right now that I am interested in, but my ability to process and verify all that is really, really limited. So I could be using it way more than I already do if I was able to solve for AI verification.
Paul Raitzer
Yeah, I'm 100% with you on this. That's the immediate thing I thought of when I saw this and I saw Karpathy's tweet about it. Deep research is the best current example because you and I both have a similar philosophy there. It's like I could come up with 10 things I want to do deep research on every day that I know it could do the deep research on. But I don't have the time to verify all the citations and like double check everything. So I've been thinking a lot about this because again, I so many times like, I'll do these conversations, get asked questions, I don't remember where I I said it. So if I said this on the podcast already, pardon the repetition, but one of the things I've been looking at for a couple years is how to reinvent analyst firms and research firms that I thought that that was it was going to become a pretty obsolete model the way it was being done. And you know, this idea of do the research and six months later the report comes out kind of thing. And so Mike and I talk a lot about like this real time research approach and like how do we bring more relevant data to market faster. And deep research was one of those tools, really was like, oh man, here we go. Like this is, this could be the foundation of a next generation research firm. My concern though is that you contribute to the AI slop that's being put out there. And so what's going to happen is you're going to have a whole bunch of people who aren't trained researchers, analysts or journalists that just go and use these deep research tools to just pump out a bunch of crap that they haven't verified and may have incorrect facts, may have miscitations, maybe citing crappy websites that no one would ever cite, like no real analyst, journalist, researcher would ever cite as a source. And so yes, you can do way more research, infinitely more, 10 to 100 times, probably more research. But you still have to verify, you still have to stand behind what you're going to publish. And so that's why to date we aren't publishing a lot of the deep research that Mike and I do because we haven't, it hasn't achieved the threshold we would require of something we would put our names on.
Mike Kaput
Right.
Paul Raitzer
So now we're working on ways to like evolve that and create verification systems so we can put out more real time research. But that is the hold up. Now do I think that that's gonna not impact jobs? No. Like I guess you could put out 10 times more research and maybe, you know, you don't, you don't reduce jobs, but it is a major holdup that you still have to have the human in the loop and strategy is the same way. You can build great strategies but like a human still has to verify and improve those things. So yeah, the verification gap I think is a very real thing. We think about it. I don't know that we've given it that name to it internally, but like I think about that every day. Of all the things we could be doing if we had resources dedicated to verify the outputs of the AI.
Mike Kaput
Yeah, I almost wonder too and won't spend too much time on this, but just the thought is like, does that become a really interesting career path and or skill? It's like, even if people aren't, you know, world class experts using the tools, do we need the verifiers too? You know, it's a way to kind of maybe position yourself and you know, in the AI first future, even if you're still Getting, you know, still on kind of training wheels of like learning all the tools.
Paul Raitzer
Yeah, I think it's what's happening with coding now, with computer coding, where a lot of the code is being written by the AI, but a human coder still needs to like verify it. And then the more like the higher profile, higher risk the output of that code is, the more important the human in the loop becomes. So like, if you're a research firm like us and part of your reputation, your brand is dependent upon people trusting the outputs from that firm. You can't put out one thing that has errant data in it. Like you have to stand behind every piece of data that comes out of there. And so I think that's, you know, again, that's why you build trust in media outlets or individual thought leaders or brands that, that yes, they're using AI, but they're, they're not getting rid of the people. The people are a critical component. It's just the AI may do more and more of the foundational work, but the experts still have to be the ones that verify. So if you're using a false piece of data, it's on the human that put that thing out. So if Mike and I are going to put our names on anything, if I'm Gonna put the SmartRx brand on something, it better meet the quality standards that we would require of purely human work.
Mike Kaput
All right, next up, we first talked about a podcast episode, an episode of the Dwark Kesh podcast, to be Precise, on episode 149 of the AI Show. And in this episode, the anthropic researchers Sholto Douglas and Trenton Bricken returned to the Dwarkesh podcast to talk more about how AI thinks. Now in episode 149, we took kind of a piece of that, some comments they had about, about automation of white collar work and really dive deep into it. But we wanted to go even deeper into the other aspects of this conversation because it is really, really important. Because what they talked about is how AI thinks of what that means for model progress and capabilities. So they basically talk quite a bit about the transformative impact of reinforcement learning in large language models and talking about how reinforcement learning with verifiable rewards has finally led to models that can consistently outperform humans in narrow but complex domains. So they say this means AI agents can now complete expert level tasks if a reward function is reliable enough. And so far these successes seem to mostly be in math and programming. But the groundwork is being laid for more ambitious, long running agents in software Engineering and beyond. Now they say the constraint is no longer intelligence anymore, it's scaffolding, context and feedback. So Douglas and Bricken basically believe, despite, you know, the fact it will take little time, that we're on track to see agents doing real end to end software work by year's end. And they may even eventually be able to do a full day's work autonomously. Now. Paul, I'll kind of let you take it from here as you actually flagged this episode internally for our team as a must. Listen, what's important to pay attention to here.
Paul Raitzer
So Duart Keshe's interviews are fantastic. I've said before on the show they can get very technical. So what I would do though is I would encourage you to listen to the full podcast if you want to truly understand how these models work. So the thing I flagged internally, and I think I shared in the Exec AI newsletter was if you want to understand how they work, why they can be misaligned, how the labs choose, what experiments to run, why some industries are going to take longer to be disrupted, how agents are evolving and how real they might be in the near future, how jobs are going to be impacted, AGI timelines, like they get into a lot. Yeah. And they're very forthright in their thoughts. I. So again, it can be very technical. It's sometimes it's hard for me honestly to like evaluate how technical it is because I've been listening to this stuff for so long.
Mike Kaput
Yeah.
Paul Raitzer
But even like a reward function, it's just like I kind of assume everybody knows what a reward function is and that might be like, you might need to listen while doing some searches to like understand some fundamentals. And actually for our AI academy, as we're making kind of updates and introducing this whole new approach to our learning journeys, I'm building an AI fundamentals course right now for this exact purpose so that everyone can understand this like beginner level approach. So when you go listen to this, you already kind of get the fundamentals, like reward signals and things like that. But it's incredible. Like they do a really good job of making everything approachable. So there's something that's a little too technical. Just kind of like move to the next thing. You'll get the gist of what they're trying to say. And then these are episodes are really valuable to me because it either verifies what we're thinking and saying, or maybe it challenges what we're thinking and saying. And luckily for me, like, pretty much everything they said is on track with what we're teaching through this podcast. And so it's a good, like, way for us to vet, you know, make sure we're staying with our finger on the pulse of what's happening within these labs and what they're seeing and thinking. So, yeah, it's a, it's just a really good episode for big picture understanding.
Mike Kaput
What'S going on and is valuable, too, because once you kind of get beyond the hype and the figureheads at these companies, these, like, researchers and engineers building this stuff, they'll just tell you where they think it's going with no varnish.
Paul Raitzer
Yeah. And honestly, like, Anthropic must not have guard rails around what their people are allowed to say. Like, a lot of times some of these bigger labs or publicly traded companies, you know, like, I've, I won't name names, but, like, in some of these big companies, you got to go through, like, months of training before you're even allowed to speak publicly. That is not the case at Anthropic. Like, they're just, they're just letting these guys go and talk and say whatever they want. And D Kesh is a buddy of theirs, so they just, like, kind of talk. And you're not going to get that from some of the publicly traded labs.
Mike Kaput
So next up, this past week we got more commentary around AGI timelines, and some are very bullish on how quickly we'll have artificial general intelligence. Some not so much. So first up, Sam Altman took the stage at Snowflake Summit 2025 to talk AGI. He waffled a bit on what AGI actually is. He said, now it's a moving target. And he said that, quote, mostly the question of what AGI is doesn't matter. It is a term that people define differently. He also posited that if someone from 2020 were shown chatgpt today, most people, quote, most people would say, that's AGI for sure. Now, he did say, for him, AGI would be, quote, a system that can either autonomously discover new science or be such an incredible tool to people that at a rate of scientific discovery in the world, like quadruples or. He also emphasized he does not see AI slowing down at all and will continue along a, quote, shockingly smooth exponential curve of progress, which is going to enable quite breathtaking models in the next year or two, enabling businesses to, quote, just do things that totally were impossible with the previous generation of models. Now, next, similar timing to this, Eric Jing, who's a former developer at Microsoft and the co founder and CEO of Genspark, which is a $500 million generative AI startup, said he's already seeing AGI. He writes on X in a lengthy post that he believes we've already entered the era of AGI and the consequences could be both thrilling and terrifying. He imagines a world where a conversational supercomputer, smarter and faster than any human, sits beside us at all times. And in that world, new college grads can be obsolete the day they graduate, white collar jobs can disappear en masse, and our education systems, he warns, are not ready now. He's not completely defeatist. His post also reads as just an urgent call to adapt and to use AI daily. Now. Last but not least, Dwarkesh Patel, who we just talked about in response to the podcast we just discussed, released a counter argument to all this AGI hype. He writes that he doesn't believe AGI is as close as some experts, including guests on his show, think. He argues that despite him spending hundreds of hours integrating AI into, say, his podcast workflow, he just doesn't see today's models improving like humans do. He says they can't learn from feedback over time, build context, or adapt organically. Instead, every session resets to square one. And he claims this is the reason why LLMs haven't transformed white collar workflows at scale. He's also skeptical of aggressive timelines for AI doing agentic tasks, but he is optimistic that once continual learning like this is solved, even partially, models could quickly become much, much, much more capable. He just thinks that will take a lot longer than some other people in the AI world. Now, Paul, did anything jump out to you in this latest round of AGI speculation? Got a couple prominent voices with some counter counterintuitive takeaways here.
Paul Raitzer
The almond one I just don't understand. So he said mostly the question what AGI is doesn't matter. It is a term that people define differently. Okay, so it doesn't matter. And yet their entire company is based on achieving it. He was fired over it. So I started listening to the Empire of AI, the Karen Howe book, and literally the whole opening chapter is about him being fired on this exact topic. Because that is their mission. Their contract with Microsoft is dependent upon it. Their mission is literally ensuring AGI, which they define. It does change how they define it, but they do have a definition. February 2023 AI systems that are generally smarter than humans and the whole mission of the organization is for AGI to benefit all of humanity. So to say it doesn't matter it is literally the foundation of everything they're doing, why the company was created so.
Mike Kaput
Right.
Paul Raitzer
It may have just been a point. Poor choice of words. But he does waver all the time on what it actually is. There's a December 2024 TechCrunch article that we talked about at the time that said the two companies, Microsoft and OpenAI, reportedly signed an agreement in 2023 saying OpenAI has only achieved AGI when it develops AI systems that can generate at least 100 billion in profits. That's, I guess, one way to quantify it in January 2025. So just six months ago, Sam wrote a blog post called Reflections, which we talked about at the time. He said, we started OpenAI almost nine years ago because we believe that AGI was possible and that it could be the most impactful technology in human history. Wanted to figure out how to build it and make it broadly beneficial. We are now confident we know how to build AGI as we have traditionally understood it. So again, like, it is literally the foundation of everything they have. Their structure talks about the board determining when AGI is attained. He had a letter in March 2025 to let to employees. We said, we now see a way to AGI to directly empower everyone in the most capable tool in human history. We believe it's the best path forward. AGI should enable all of humanity to benefit each other. Creating AGI is our brick in the path of human progress and we can't wait to see what bricks you'll add to it. Like, I, I just don't understand.
Mike Kaput
Right.
Paul Raitzer
Again, maybe it's poor messaging, but like you, you can't say it doesn't matter when your entire organization is based on a single thing. Like, I feel like you need to be able to define that in terms of the Dwarkesh one. I, I love the, the fact that he's willing to like, take this alternative opinion. And yes, he like studies the space. He meets with all these people. He hangs out with people within AI labs. Like, he has more access than most to understanding what's going on. And his basic argument, as you said, is this lack of continual learning, which is 100 true like that. It's not a debate. It is a valid point. The, the counter argument here. And so people understand this concept. Basically, you train the model, you give it all the data and then it's like fixed, like, that's it. So if a model, let's say theoretically GPT5 was in training right now and today was its final day of its training run. Its Knowledge cuts off at June 9, 2025, then it knows nothing that happens beyond that moment. And then if you use it, it doesn't learn from that experience. It doesn't become better. It's not like continually adapting. That's the concept here. But these models now have tool use so they can search the web, they can write code, they have memory, they have almost infinite knowledge up to that June 9th moment. Like, they know more than any human about everything, basically, because they've read and consumed everything. They can string together agents that are experts in different things at superhuman speeds. You can run simulations to improve them. You can use reinforcement like, I, I don't know that I, I fundamentally agree with what he describes as the barriers to this, like, fast takeoff. But he makes really valid points. And I, you know, I think it's worth wild perspective. Like, I, I, like I said, I love reading these alternative perspectives that sort of challenge your thinking. And it's not like he's saying it's not gonna happen or the world isn't gonna change. He's just like, yeah, it might just take a couple more years in these.
Mike Kaput
Right, Right. Yeah. At no point is he like, oh, this is complete nonsense.
Paul Raitzer
Yeah. Whether it's one year, three year, five years, like, it's changing everything in the next decade. And that's pretty short time period in the grand scheme of things. So I. Good perspective. Worth a read. It doesn't change anything we're doing at our organization or anything I would suggest other organizations do.
Mike Kaput
Next up, Reddit has filed a lawsuit against Anthropic. They're accusing Anthropic of illegally scraping Reddit to train Claude. The suit, filed in San Francisco, alleges Anthropic bots accessed Reddit over a hundred thousand times after claiming to have stopped crawling the platform in mid-2024. Reddit says this scraping violated its terms of service and monetized user content without consent. Now, unlike other AI lawsuits, this isn't necessarily about copyright infringement. Instead, Reddit argues Anthropic unfairly exploited a rich archive of user conversations to build a commercial product. While Reddit notably has signed paid licensing deals with companies like Google and OpenAI to train AI models legally. Now, Anthropic is disputing these claims. Paul. This one's a little different from the typical AI copyright case, but it seems like, unfortunately, the theme is the same. An AI lab allegedly scraped and used content from a website that it didn't have permission to use. So I guess at this point, I guess I have to ask, even with the lawsuits, even with things indicating to models they're not allowed to scrape your site. Can we trust at all that these companies aren't still doing this stuff?
Paul Raitzer
I doubt it. I'm not a lawyer. Took a couple law classes in college. Thought about becoming a lawyer for about three days actually. Really enjoying this law about it. Anyway, this seems like we've already seen instances where discovery has been permitted that cases have moved to the point where the plaintiff is allowed to do discovery on the models. I believe that happened with OpenAI already. So this seems like Anthropic knows if they did or didn't. If it seems like they can't win this case and it leads to discovery where the plaintiff is going to be allowed to examine the sources of data that went into the model and Anthropic knows the sources are in there, then they're paying their fifty hundred million dollar fine and then they're doing a licensing deal and we're moving on. If they didn't do it, then they got nothing to worry about. I don't know if they did or didn't. It wouldn't surprise me if information was consumed by the models that shouldn't have been, just based on previous precedent from other labs. So stay tuned. There's a chance we may never hear more about this because it just paid off and we move on with our lives. And if it is, then they most likely had it and don't want to give access to their training data.
Mike Kaput
Google's AI powered research assistant NotebookLM just got a major upgrade. You can now share your notebooks publicly with a single link. Now, until now, users could only share notebooks privately with individuals. But with this update, anyone can publish a notebook, whether it's a study guide, product manual, nonprofit overview, whatever, and let others explore it interactively. Viewers cannot edit the source material, but they can ask questions, generate summaries, or create content like FAQs and briefings. So Paul, I for one am very, very excited about this. It's a small thing, but definitely important. We are increasingly using notebooks and NoteBookLM to accelerate how we learn and use knowledge as a team. As you and I discovered this morning, this is not yet in our business account, which is slightly frustrating since we built a notebook LM for this episode that we wanted to use to share with everybody.
Paul Raitzer
Yeah, so last week Mike and I were talking and yeah, we should experiment and like put all the show notes. Because we always say like check the show notes and the show notes are easy to find. Like we put them on the post and Everything. But we thought it might be cool if you could interact with the show notes. So we're like, let's create a notebook lm, and we'll pilot it and see if it works. And if it does, maybe we'll, we'll share a notebook with our audience. And then as Mike indicated, he created it, he shared it with me and I was like, oh, this is great. I can't do anything with it. Like, I can chat with it, I can't, I can't create study guides, FAQs, anything like that. So before we get on the podcast, he's ah, let me update your settings. So it's like, okay, now I can do it, but let me test this in my personal account. Oh, yeah, it doesn't work. So we only realized you can only share notebooks with each other. Still in our Google Workspace account, we can't share it publicly and we don't want to necessarily build this in our personal accounts to then share it publicly, which would be the option. So, yes, great to know this is a feature. It is, I guess, a lesson in like, Like, Google has very jagged rollouts of their features and products. Like, this is a constant guessing game for us of like, oh, that's awesome. Oh, wait, we can't do that in our business accounts. This is a very common recurring theme that Google rolls stuff out to personal accounts that are not in the business accounts. OpenAI does the same thing, but it's on a much, much shorter horizon. Like, usually it's OpenAI did a thing and then like a week later it's in teams and enterprise Google.
Mike Kaput
Yeah.
Paul Raitzer
It could be months or never. Like, you just don't know. And it's, it is very frustrating as a Google Workspace customer that, like, you have no idea.
Mike Kaput
Yep.
Paul Raitzer
And it's not communicated to you?
Mike Kaput
Yep. Well, like we talked about, this is the importance of literally just going in and kicking the tires of these tools. Because no matter what we say or anyone else posts, just go in and try for yourself what's available. Because you won't know for sure until you actually do that. No one's. Very few people are going to like, publish documentation that's useful on this stuff.
Paul Raitzer
Yeah. And on that same note again, and not to harp on Google here, but, like, this is my major frustration with using Gemini is we use custom GPTs all the time. Yeah. And I still can't publicly share a gem I create. I can't even share a gem with my team. So, like, I'm trying to use Gemini more because I Actually really like the model, but it becomes, it breaks down for me because I can't, I can't share these things. So, yeah, drives me nuts.
Mike Kaput
All right, a couple other topics here before we wrap up this week. So, WPP Media has launched Open Intelligence, a sweeping new AI driven marketing system built around what they call the first ever large marketing model. Now, unlike the language models behind tools like ChatGPT, this one they say, is purpose built for advertising. Since they are an advertising agency, it is trained on trillions of real world data signals, everything from purchase behavior to cultural context across 350 partners in 75 markets. Not to mention it doesn't depend on user identifiers. WPP is pitching this as what they call, quote, intelligence beyond identity. This is a shift away from cookie based tracking. The idea is to basically give clients their own predictive AI model built on a mix of public and first party data. Something that can forecast behavior, optimize ad spend and adapt to a world where it's harder and harder to track people based on user identifiers. It is also a full stack solution. It's connected to platforms like TikTok, Meta and Google, and it is built for secure collaboration using some federated data technology that they have baked in. So that means clients never have to move or expose their raw data. So, Paul, this idea of a large marketing model is pretty interesting framing. From what I'm reading about this, it kind of sounds a bit like WPP is becoming a model provider. They're basically granting clients access to these bespoke AI models they're building on top of this foundation model. Like, what are some of the implications here, here for agencies?
Paul Raitzer
Yeah, it is an interesting play. Maybe, maybe that is the future of agencies. I don't, I don't know. I think as we heard about earlier with like the big four consulting firms, the big agencies are probably in similar boats. It's challenging. Market profits are probably being threatened by pricing pressures. You know, people want things done faster, cheaper. I don't know, like, I would love to see this thing at work, honestly, so.
Mike Kaput
Right.
Paul Raitzer
And I've told this story before, but like anybody who's new to the podcast, this is how it all started for me. So back in 2011, when I started researching AI, it was actually for one specific use case, which was what I was calling a marketing intelligence engine that would largely automate strategy. It would consume data on all previous campaigns, it would run predictive models, it would take in, you know, ideally anonymized data. So imagine you're like HubSpot and you have all this data of, you know, potentially millions or billions of campaigns that have been running and that you could take that data and predict what to do next, like say, hey, I'm in retail and I want to achieve this goal in terms of customer retention, like, what should I do? And it could go and analyze a million customer retention programs and then like predict for you what to do next or ad spend or, you know, email pro, whatever it was. So my theory back in 2011 was, well, this will have to happen, like someone's going to build this. And then I, you know, quickly realized no one was building it and no one in marketing was even thinking about this stuff, it seemed at that time. And that's what led to me eventually writing about the marketing intelligence engine in 2014, which then became the impetus to build Marketing Institute. So like, as soon as I see anybody who seems to be approaching this idea of like some form of intelligence engine, my ears sort of perk up. And yeah, I don't know if this is anything close to what I was originally envisioning, but I'm definitely intrigued. Buy it and I would love to kind of see this at some point.
Mike Kaput
Our last topic this week. Google has just launched a new experiment called Portraits. This is an AI experience that lets you have interactive conversations with digital versions of real world experts. They're kicking things off by featuring one of these portraits with leadership coach and the author of Radical Candor, Kim Scott. So instead of generic chatbot answers, you basically can get a conversation and coaching inspired directly by Kim's actual work. In this case, her avatar speaks in her voice, draws from her real content and responds to your questions using Google's Gemini model. Now, the experts themselves are part of this process. They contribute their own material. They approve the avatar's tone, they guide how the AI should respond. Now, it's still early. This is an experiment. Google is collecting feedback to improve this over time. It is only available in the US and only for users 18 and up. Now, Paul, despite the fact this sounds just like kind of a fun experiment right now from Google, the moment I saw this, I couldn't help but think about the implications for like online education, learning, coaching. Like, if these worked really well, I'd almost want one for every notable expert out there who I follow or the top people in the space. I'm interested in like learning about.
Paul Raitzer
Yeah, I, man, I feel like we could spend some time on this one. So my first take is this is infinitely doable. Like I think within a year or so is this in their, like, labs or studio. Is that where they're testing this? It's in labs.
Mike Kaput
It's actually hot. Yeah, it's in labs.
Paul Raitzer
New experiment. Google labs. Yeah. So they have a history of, like, when it's in labs, it's. It's not a fully baked product, but it's pretty close. And we. You'll usually see within six months to 12 months, if it's viable, that thing is released. So the fact that they've done this, which means they've done it internally already, and now we're seeing the first public facing sort of MVP here. So let's assume within 12 months to 18 months, this is doable. Someone has built this at Y Combinator. Like, someone's built the tech now where you can easily turn yourself into one of these things or you can pay for access to people who've licensed their likeness to be one of these things. I think Facebook is even going down this path with, like, celebrity avatars and stuff.
Mike Kaput
Yeah.
Paul Raitzer
So it's interesting. Like, I don't know, like, the first name we can is Demis Hasabis. I obviously cannot call up Demis Hasabis and ask him questions about AI. I would love to ask Demis Hassabis questions about it. I have a million of them. Would I pay for access to an avatar of Demis to, like, talk to about AI? I don't know. Like, if I take any of my favorite authors, like, would I pay for access to a digital version of them that I know may be hallucinating and is just, like, trained on some of their data? I don't know. Like, I'm not sure. I'm sure there's an audience of people who. Would Taylor Swift say Taylor Swift agrees to, like, build one of these things. Would Taylor Swift fans pay to talk to Taylor? I'm guessing yes. Like, I would think that that's probably a thing.
Mike Kaput
Yeah.
Paul Raitzer
And then the other side is like, would you allow yourself to be turned into one? So if you're a thought leader, a podcaster, an author, whatever, an entrepreneur, would you allow yourself. Would you, as a brand, allow your executives to be turned into them? I don't know. I mean, it presents all kinds of.
Mike Kaput
Right.
Paul Raitzer
Interesting questions, but I would assume this is sort of an inevitable. There's a market for this, for sure.
Mike Kaput
Yeah.
Paul Raitzer
How quickly it played out, I don't know.
Mike Kaput
Yeah. I wonder where that line is between. In certain scenarios, I could see us adding a ton of value. In other scenarios, I could see it really watering down the value of the personal brand. Too.
Paul Raitzer
Yeah, I like. So my initial reaction is like, I have no interest in being one of these. Like, if there was a market for people that wanted to talk to me as an AI avatar, I don't think that that's something I would personally be interested in doing.
Mike Kaput
Yeah.
Paul Raitzer
Would I pay for one? Probably not. But, like, I don't know. This is an interesting one. Yeah. So ask yourselves as listeners. Like, these are the kind of questions we may have to deal with.
Mike Kaput
Yeah. I also wonder, too. I don't have no idea what the strategy would be here and haven't really thought through it. But also, if you see a stable of all these as part of your Gemini subscription. Right.
Paul Raitzer
Yeah.
Mike Kaput
That maybe that's interesting to people who might either switch or, like, consider paying for Gemini. I have no idea.
Paul Raitzer
Yeah. Yeah. I don't know. I have to. I'd have to think about this one a little bit more. But it is interesting and I'm sure these are actually going to be everywhere. Like, if you think about, like 11 labs and hey, for sure Google and OpenAI will probably get into this world. Facebook character AI. Like, this is sort of the inevitable.
Mike Kaput
Thing, all while saying, we don't want you to form too close of relationships with AI.
Paul Raitzer
Yeah. Oh, this is a little quick side note to end. But, like, have you seen the V3 videos? The. The vlogs that are being created by historical characters?
Mike Kaput
Oh, gosh. Didn't they do one with, like, Bible stories? But there's.
Paul Raitzer
There's one I saw with Bigfoot. So if you. If as a listener, if you haven't seen this yet, I don't use Tick Tock anymore, but I know it, like, sort of had its origins on Tick Tock. So I'm seeing it more on X where people are sharing stuff from Tick Tock, but people are using VO3 to create these, like, super realistic vlogs, like YouTubers that are. I saw one with Stormtroopers. You love that one. There's like, Stormtroopers in the middle of battles and he's like, vlogging for YouTube about what's going on and yelling at the other stormtrooper. I saw him with Bigfoot where he's trying to hide from humans. It's. It's amazing. There's ones like, historical stuff people are creating. That's so cool. Oh, and Moses was hilarious. Like, we're at the sea. I don't know what we're doing now. And then he's like, walking through the water. You go, that's amazing. So, yeah, if you want a lighter side of AI. Go search for like the, the vloggers that are using VO3.
Mike Kaput
It's so cool. All right, Paul, as always, thanks for unpacking another very, very busy week in AI.
Paul Raitzer
All right, thanks Mike. We will talk with everyone next week and oh, we will have. I gotta double check this, but we will likely have two episodes next week because we have an Intro to AI class on Tuesday. So when you hear this, it's probably gonna. It might be too late to join our Intro to AI class, but we will turn that Intro to AI class into one of those AI Answers episodes. And so the following week, what would that be like the week of the 16th, 17th?
Mike Kaput
Yep.
Paul Raitzer
Yeah, we will likely have a second episode. I have to. I'm traveling next week so I have to double check my schedule. But yeah, we, we should probably have two episodes coming up next week. So our weekly on Tuesday like always and then an AI Answers episode the following. All right, thanks Mike.
Mike Kaput
Thanks Paul.
Paul Raitzer
Thanks for listening to the Artificial intelligence show. Visit SmarterX AI to continue on your AI learning journey and join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual and in person events, taken online AI courses and earned professional certificates from our AI Academy and engaged in the Marketing AI Institute Slack community. Until next time, stay curious and explore AI.
The Artificial Intelligence Show - Episode #152 Summary
Release Date: June 10, 2025
Hosts: Paul Raitzer and Mike Kaput
In Episode #152 of The Artificial Intelligence Show, hosts Paul Raitzer and Mike Kaput delve into a range of compelling topics shaping the AI landscape. Recorded on June 9th, the episode navigates through major product updates, the evolving relationship between humans and AI, the burgeoning impact of AI on the job market, and significant legal developments surrounding AI data practices. The conversation is rich with insights, expert opinions, and actionable advice for businesses and professionals aiming to harness AI's potential responsibly and effectively.
Paul kicks off the episode by emphasizing the importance of adaptability and continuous learning in the face of rapid AI advancements. He states,
“It doesn't matter when AGI arrives... All that matters is what you can control, which is get better at this stuff every day” ([00:00]).
Mike Kaput introduces OpenAI's recent announcements, highlighting the introduction of Connectors and Record Mode:
Connectors: Allow teams to integrate tools like Google Drive, HubSpot, and Dropbox directly into ChatGPT. This enables the AI to access and synthesize data from these platforms to provide more accurate and contextually relevant responses.
Record Mode: A feature that transcribes meeting audio and assists in generating follow-up documents via OpenAI's Canvas tool.
Additional updates include Codex, now with internet access capabilities, and significant enhancements to ChatGPT’s voice mode, making interactions more human-like.
The conversation shifts to the privacy and security concerns surrounding the new Connectors. Mike shares his excitement about the potential of Connectors, especially the HubSpot integration, but also raises red flags regarding data privacy:
“I kind of screech to a halt thinking about the privacy and security implications” ([06:47]).
Paul elaborates on these concerns, noting that while Connectors can enhance productivity by providing faster insights, they also pose significant risks. He warns that enabling these features without proper governance can lead to unauthorized data access and potential leaks of sensitive information.
“You have to understand there's a difference between actual ability and simulation... But the simulating of the ability creates the perception that it actually has it. And that's really all that matters when we look at the economic impact and the impact on our lives and our own emotions” ([25:53]).
Key Takeaways:
The hosts explore OpenAI’s voice enhancements in ChatGPT, which offer more natural and fluid interactions. Both Paul and Mike express awe at the progress:
“It's like, you know, for years, they, the labs steered away from making them too human, like, and I think wisely so” ([16:34]).
Paul discusses the balance OpenAI seeks between creating empathetic AI and avoiding the illusion of consciousness, highlighting the ethical considerations in designing AI personalities.
Mike introduces an essay by Joanne Jiang from OpenAI, discussing the emotional bonds users form with AI:
“Some people call it a friend. Others say it feels alive” ([18:16]).
Paul expands on this, emphasizing that while AI isn't conscious, its ability to simulate empathetic interactions can fulfill genuine emotional needs. However, this also raises concerns about dependency and the potential weakening of human connections.
Key Points:
The discussion turns to AI’s impact on the job market, highlighted by recent developments:
Business Insider Layoffs:
PwC’s 2025 Global AI Jobs Barometer Report:
National Bureau of Economic Research’s Working Paper:
Paul and Mike discuss the dual nature of AI’s impact—while it can drive efficiency and create lucrative opportunities for those skilled in AI, it also poses risks of job displacement, especially in roles susceptible to automation.
“You can stand still or you can accelerate your AI literacy and capabilities... you have the greatest chance to figure out what happens next in your job and in your industry” ([33:23]).
Recommendations:
OpenAI is now compelled by court order to store deleted ChatGPT conversations indefinitely due to ongoing litigation with The New York Times. This decision affects users with Free, Plus, or Pro accounts, necessitating businesses and individuals to reassess their data privacy strategies.
“This order is unprecedented, sweeping, and a direct threat to user privacy” ([44:20]).
Implications:
Google DeepMind released a white paper on securing Gemini 2.5 models against indirect prompt injections—attacks that embed malicious instructions within regular content. Their multi-layered defense strategy includes automated red teaming to identify vulnerabilities.
“AI models and systems can be exploited in these unique ways outside of traditional cyber attacks” ([48:22]).
Takeaway:
Balaji Srinivasan highlights the verification gap in AI, where the rapid generation of AI outputs outpaces the ability to verify their accuracy. This creates a bottleneck, especially in areas requiring expert validation like deep research.
“Verification does not really scale” ([53:48]).
Impact:
Reddit has filed a lawsuit against Anthropic for illegally scraping its platform to train the Claude AI model. Unlike typical copyright cases, this lawsuit focuses on the unauthorized use of user-generated content.
“Anthropic unfairly exploited a rich archive of user conversations to build a commercial product” ([72:09]).
Consequences:
Google’s NotebookLM now allows users to share notebooks publicly with a single link, enabling interactive exploration and collaboration. While currently limited to individual sharing, this feature heralds increased accessibility for knowledge sharing.
“Anyone can publish a notebook... Viewers cannot edit the source material” ([73:25]).
Benefits:
WPP Media launched Open Intelligence, an AI-driven marketing system built around a large marketing model tailored for advertising. It leverages trillions of data signals to forecast behavior, optimize ad spend, and adapt to privacy-focused environments.
“Intelligence beyond identity” ([76:51]).
Implications for Agencies:
Google introduced Portraits, an AI experience that allows interactive conversations with digital avatars of real-world experts. The first featured portrait is of leadership coach Kim Scott, who has approved the avatar’s tone and responses.
“This could almost be a fun experiment right now from Google” ([80:30]).
Potential Uses:
Episode #152 of The Artificial Intelligence Show offers a comprehensive exploration of significant AI developments and their multifaceted impacts. From OpenAI’s innovative features and the ethical considerations of AI-human relationships to the shifting job market dynamics and pressing legal issues, Paul and Mike provide listeners with valuable insights and practical advice. The rapid-fire segment further ensures that no critical topic goes unnoticed, making this episode an essential listen for anyone keen on understanding and navigating the evolving AI landscape.
Upcoming: The hosts hint at additional content, including an Intro to AI class and possibly two episodes in the following week, ensuring continuous learning and engagement for their audience.
For more detailed insights and to stay updated on the latest in AI, visit SmarterX AI and join the thriving community of professionals advancing AI literacy and application.