Grok's AI Lovebot, Aqui-Hire-Sition Backlash, OpenAI's ChatGPT Agent Debuts - Big Technology Podcast

Summary3 min read

Big Technology Podcast: Episode Summary

Title: Grok's AI Lovebot, Aquihire-Sition Backlash, OpenAI's ChatGPT Agent Debuts
Host: Alex Kantrowitz
Release Date: July 18, 2025

1. Introduction to Aquihires

In the opening segment, Alex Kantrowitz introduces the concept of "aquihires," a term referring to acquisitions primarily aimed at securing a company's talent rather than its products or services. He engages with guest Ranjan to delve into the implications of this trend within the AI industry.

Notable Quote:
Alex [00:50]: "Who would have thought Aquahires Isshin would be the term of the year?"

2. Scaling AI: Money vs. Talent vs. Distribution

The discussion shifts to whether scaling in AI is predominantly a game of financial investment and talent acquisition. Alex references insights from Spyglass journalist M.G. Siegler, suggesting that the AI landscape remains one where "throwing money at the problem" is still a viable strategy for competing with established labs like OpenAI and DeepMind.

Notable Quote:
Ranjan [03:20]: "Distribution is going to be king again. And I think like this is where Meta still has an advantage."

3. Grok's AI Companions: Annie and Rudy

A significant portion of the episode explores Grok's latest feature: AI companions Annie and Rudy. These 3D animated characters interact with users via voice, offering storytelling and companionship functions. Alex shares his firsthand experience, highlighting both the innovative aspects and unsettling behaviors of these AI avatars.

Notable Quotes:
Alex [18:09]: "Annie is like an anime love bot... she's immediately started flirting with me."
Ranjan [16:15]: "If we're talking about everything changing, this is just about time spent in selling ads."

4. OpenAI's ChatGPT Agent

The conversation transitions to OpenAI's launch of a general-purpose agent within ChatGPT. This new tool aims to autonomously perform tasks such as managing calendars, generating presentations, and running code by navigating websites and synthesizing information. Both hosts express cautious optimism about its potential, questioning its immediate utility and effectiveness.

Notable Quote:
Alex [35:24]: "The launch of the ChatGPT agent represents OpenAI's boldest attempt yet to turn ChatGPT into an agentic product."

5. Aquihire Backlash and Industry Implications

Ranjan voices concerns over the rising trend of aquihires, emphasizing that while founders may benefit financially from such deals, the broader employee base often faces uncertainty regarding their shares and future prospects. He critiques how these practices undermine the competitive landscape and distort the economics of startups.

Notable Quote:
Ranjan [25:19]: "This is one of the most troubling trends in the industry... it's bad for the employees."

6. The Rise of Chinese AI Models: Kimik2

Alex introduces a noteworthy development from China: the release of Kimik2, an open-source AI model designed to reclaim market position. This model showcases enhanced coding capabilities and competes closely with leading US models like Anthropic's Opus. The hosts discuss its performance metrics, highlighting its cost-effectiveness and efficiency compared to predecessors.

Notable Quote:
Alex [43:39]: "Kimik2 is 13x cheaper and can outperform some of the leading models in specific tasks."

7. Conclusions and Future Outlook

Concluding the episode, Alex and Ranjan reflect on the dynamic and rapidly evolving AI landscape. They acknowledge the dual nature of AI advancements—balancing groundbreaking innovations with ethical and practical challenges. The hosts express anticipation for future developments, including upcoming interviews and deeper dives into AI's societal impacts.

Notable Quote:
Ranjan [52:18]: "Annie is the foundation... Love is human, it's unpredictable."

Key Takeaways

Aquihires: The trend of acquiring companies primarily for their talent is reshaping the competitive dynamics in the AI industry, often disadvantaging employees and distorting startup economics.
Scaling AI: Financial investment and talent acquisition remain crucial, but distribution capabilities are emerging as a significant competitive edge.
AI Companions: Grok's Annie and Rudy exemplify the move towards AI-driven companionship, raising both engagement opportunities and ethical concerns.
AI Agents: OpenAI's ChatGPT Agent represents a step towards more autonomous AI assistance, though its practicality and user adoption remain to be seen.
Global Competition: The emergence of models like China's Kimik2 underscores the intense global competition in AI development, highlighting rapid advancements outside the US.

This episode of the Big Technology Podcast offers an in-depth exploration of current trends and challenges in the AI sector, providing listeners with valuable insights into how these developments might shape the future of technology and its intersection with society.

Loading summary

Transcript83 lines

[00:00]
Alex
Is scaling data centers and talent all that matters in AI? Leaving an opening to anyone rich enough to compete. Is the Aquahire Zishion good for tech? Grok will fall in lust with you and AI browsers and operators are all the rage. That's coming up on a Big Technology Podcast Friday edition right after this. Welcome to Big Technology Podcast Friday Edition where we break down the news in our traditional cool headed and nuanced format with we have a massive week of news for you. We're going to talk all about the fallout from the Aqua Hire Zishins, whether employees and investors are left behind. We're also going to talk about whether all you need is money to compete in AI. We'll leave with that. Grox AI bot will now fall in love with you or actually really in lust. Ranjan, it's great to see you again. Welcome to the show.
[00:51]
Ranjan
It's good to see you. Who would have thought Aquahires Isshin would be the term of the year? But that's all I can think about and I'm gonna give you credit. You coined it.
[01:03]
Alex
I think that it's worth us shouting out right now. This weird back and forth between Aqua Hire acquisition, investment, it's gotta go. We need some clarity in our jargon and that's what we're here to do on Big Technology Podcast is to bring you that clarity. It's called an Aqua Hire Zion. Let's all adopt the term and get on with it.
[01:21]
Ranjan
Let's just agree it's acquires Isshin. I think I'm on board. Let's everyone adopt it because it's the most accurate way to describe what's going.
[01:30]
Alex
On in our two person council here on Big Technology Podcast Friday edition. This motion is hereby considered and passed by a unanimous 20 vote. So let's get on to the setup here for aquahirizations which is something very interesting afoot in the AI industry. Now we all know that companies like OpenAI and DeepMind within Google and Anthropic have been leading the this AI race. But all the designs for the transformer model and the way that you build these things have been out in the open leading to this question. Can players with a lot of money come in, build massive data centers, hire talent and effectively Compete from a 0? Start with the established labs and the answer is seemingly pointing to yes. This is from Spyglass. M.G. siegler is great news site. We're seemingly still in the throw money at it AI era. He says Meta's massive hiring spree and Xai releasing Grok 4 may be related at the highest level. That is they showcase that we're still very much in the throw money at the problem part of the AI cycle. This is important because it means that any company with the will and resources can seemingly still get back into the race. I'm getting less skeptical on the news about Grok 4 and specifically the fact that it seems to perform and specifically outperform the other cutting edge models on the market right now. He also says Mark Zuckerberg is betting at least as much money, if not more than Elon with compute and the talent that Meta can get back into the AI game. So Ranjan, I'm curious if you accept this premise that you can just compete in AI if you have enough money and what that means for the competitive dynamics of this industry that we've been talking about for so long.
[03:21]
Ranjan
Yeah, I think it really adds an entire, I don't know, dynamic to this around. Can you just throw money at the problem? And is it compute, is it talent? But to me, the more interesting part of this whole trend really is actually distribution, is that you can bring in the talent, you can bring in the compute levels, but distribution is going to be king again. And I think like this is where Meta still has an advantage. It's going to be interesting what happens with Google, but I don't know to me, and we're going to get more into this kind of the depressing part is it's not the technology. It's not that initial wave of like adoption for a cool new tool that's been, you know, like sent out into the market. It really feels like none of that matters. And in the end it's just going to be raw compute and distribution. I don't know, how do you feel about this?
[04:15]
Alex
Well, I think it's fascinating because there's been this idea powering this entire generative AI moment, which is the scaling loss, which means that as you add more compute and of course data to, to this equation, your models are going to get much more powerful and that will allow you to do more things and it's not a very difficult thing. Like there's no secret sauce to it. Well, there's maybe some, but at a, at a brute level, if you build massive data centers, you should be to get in the game. And this is something that OpenAI and Anthropic have been harping on and now you have Zuckerberg and Elon that come in and they say, oh, okay, so I can build great models by scaling this up. And even if I'm a little compute inefficient because I don't have the best cutting edge methods I could get myself in the game and compete. And I think that is going to change the dynamics here because as you mentioned, they have distribution. You can see Meta's. If Meta's able to build a competitive LLM with this compute and talent that it's stacking up, then it's gonna be able to distribute that through Facebook products. And you know, all they have to do really is slow down the growth of ChatGPT similar to the way that they did to TikTok with Reels, similar to the way that they did to Snapchat with stories. And they've served their purpose. So in some ways with the this effort, they might even slow down the momentum that the AI industry has by taking some of the people responsible for some of the key innovations within OpenAI. And that suits their purpose just fine and all the better for them if they can make the the best model and advance the state of the art.
[06:00]
Ranjan
Well, yeah, I think, I mean if we want to get into slowing down the industry, I think that antitrust angle to me has been one of the most interesting parts of this entire conversation. Again, we saw it with Scale AI and just buying out Alexander Wang and like, you know, all of the work and value created by this company that was actually scale AI a critical part of building the models that powered this first wave of Generative AI apparently isn't really worth that much. And I don't know, it's interesting me because power is just going to crew back into the big technology companies as you said, maybe it will slow things down and in reality it's just going to add another feature on the Meta AI app that is on everyone's phone and people probably aren't using that much or doesn't seem to be in the conversation in general. So. So I don't like it. I don't think it's good for the industry. Do you think it's, do you think it's bad for the industry? Good or neutral?
[07:03]
Alex
Well, I think we could have to separate out this idea of scaling up the models and you know, everybody can play with this aqua higher position idea which we're going to talk about in the middle, which is taking the talent. I think the one thing that we should say here is my setup has kind of been incomplete, shall we say? Because While we have OpenAI and Anthropic and you could say, okay, these are the independent labs and they are to some extent remember that OpenAI is tied to Microsoft pretty deeply and Anthropic has I think $11 billion that's come into it through Amazon and Google. So ultimately I think, I wonder if what we're actually seeing is all of big tech competing against each other and simply the other tech giants starting to catch up.
[07:52]
Ranjan
Yeah, that's actually a fair point. That even though acquiirization is the phrase of the week or the last few months, we have talked endlessly for a few years now on unconventional funding practices and calling it a fundraising round where it's really compute. So I guess actually big tech has been playing the long game for a while in all these cases. I think on the scaling law topic though, I still, I think I've become even more hardened and regular listeners will know it's the. Is it the model or the product? I fall on team product generally. But I don't know, like to me, Grok 4 made waves. There's plenty of people saying it's doing reasoning at levels unheard of in the past or. Or just the fact that it's at least on par with other kind of frontier models is a kind of testament that money can compute, can buy you some kind of progress very quickly. But in reality, on that adoption side, what's changed? I don't know. What do you feel? There's endless stats that yes, the ChatGPT's perplexities Geminis of the world are seeing more adoption, but are people really adopting the level of complexity and compute that this new level of compute and scaling allows you, or are people still just kind of searching for what are good? And as I'm in Taiwan right now and traveling a bit, what are good restaurants to go to in whatever location I'm going to like, like, are people really taking advantage of what's available right now?
[09:31]
Alex
Okay, so I was going to end with this story, but now I have to kick it up.
[09:35]
Ranjan
All right, go for it.
[09:37]
Alex
This is going to be. If you have kids, you might want to turn off this, this section or skip till, I don't know, maybe 20 minutes from now. But we have to talk about what's happening in AI and there is some crazy stuff that's been happening with Grok in particular this week. And so I would posit that better models allow you to build better products. And Meta, let me give Meta as an example. Meta has been trying extremely hard to build voice and avatars with Llama. With Llama, and it hasn't been able to do it convincingly. And I think Mark Zuckerberg's belief is that there's going to be some use cases here. There's going to be the sort of work companion ChatGPT chatbot, there's going to be that enterprise use case where like you're connecting, you know, one system with the other and the generative AI will like summarize things for you and then input it into another system and just make business work better. And then there is the sort offriend lover, etc. Bucket that is going to be big. I think that there's a belief within Meta that that AI friend is going to be one of the key product areas with this new technology and if you have great models, you can build them. Now, I'm not going to say that Grok has a great model or a great product. I'm not going to use what I'm about to say as proof of either of those. But I am going to use it as a indication of the direction that I think things are going. Whether we like it or not, this is the story Grok debuts Interactive AI companions on iOS with anime avatars Story Grok has just introduced a notable addition to its iOS app, AI companions, which are fully 3D animated characters that can interact with users via voice. Currently, the features include two available companions, Annie, an anime inspired character known for flir for a flirty and whispery voice, and Rudy, a red panda capable of displaying different moods, including bad Rudy. Yes, listeners and viewers, I did experiment with these companions and I have a disturbing review to deliver. So you go into the Grok app and you go over to the side tab and you're able to open up these AI avatars. Let's talk about Rudy first. So Rudy is like some sort of red panda or bear that seems ready to speak to kids. And in my conversation with Rudy, Rudy said, I'm going to tell you some story about some magical land. And here's a section from the story Fluffle Sparkle Paws love to explore nibbling on the sweet moon berries and chasing glowy fireflies. One Sunday morning, Fluffle found something something super special. A shiny swirly portal hiding behind a giant mushroom. It was all rainbow colored and whooshy like a magical doorway. And then this bear takes you through this interactive experience. Let's pause here. This seems like, you know, okay, this will happen. This will be a new way that kids play with computers is they'll have these magical creatures tell them stories.
[12:54]
Ranjan
Wow. I mean, I think this is the nuanced conversation that you all come to the big technology podcast for. But, but, but I think, okay Seriously, a couple of things I have long believed and again, like using ChatGPT voice mode to come up with stories for my son, like, is something that I've done for a couple of years now and actually works really well. I think, like expanding that to an interactive avatar is a pretty logical next step. I think, like, is that the. It's again, interesting because like, that to me feels like it's going to be commoditized pretty quickly. So from an actual competitive standpoint, from a business standpoint, I guess it's not that interesting to me. Like, to me that should be. Everyone is going to have that available. Everyone's going to do that pretty quickly. So to me, I don't know, like, why do you think that's something? Do you think GROK is just going in that direction just to make waves and clearly we are talking about it, or do you think there's something within this that actually is native to X, to xai? There's something underneath it.
[14:09]
Alex
I think the way that a lot of tech companies operate is they think about user retention, user stickiness and, and engagement. And anyone who's developing AI is going to say, how do I increase all those metrics? Do I make like this genius level AI bot that can help me with my work or do I create for what is becoming the number one use case? Companionship and therapy. And many are going toward the companionship and therapy side. And if you're going to do that, if you build models good enough that have emotional voice or voice with an emotional register, an avatar that you can speak with and something that responds with low latency and in real time and can customize to a person, then you might want to put it in one of these products because you believe that speak like a kid, for instance. I'm just going through the business logic. Will spend much more time with your chatbot if they can speak to this elephant or red panda or whatever it is in a way that they wouldn't with like ChatGPT.
[15:13]
Ranjan
Okay, I get that side of it a bit. I mean, on one hand it's kind of almost comical to me that for all the talk about AI taking over the world and Skynet and like artificial super intelligence, if this entire battleground plays out on time spent metrics, which is probably where Mark Zuckerberg is thinking. I mean, I've read a lot around like, why is he so going full like Zuck war mode right now? It's not because of some like intellectual desire to be the one to crack the code of artificial general or super intelligence. It's because ChatGPT represents a threat to how much time people spend scrolling Facebook and how many ads you can show them. Which is kind of like I respect from a cold business logic, but. But yeah, it's almost comical to me that if it's for all the talk about everything's going to change. This is just about time spent in selling ads.
[16:15]
Alex
I mean, maybe it's both, but it seems like it's probably at least the time spent thing. I mean, these are social media companies, right? So, I mean, X and Xai is a social media company with an AI development, you know, side of it as well, or tucked into an AI development group. But ultimately these are the metrics of social media. Now, one of the disturbing things that happened here, and this is the thing that I was kind of setting up or no, actually, I really don't find a way to view this as not very disturbing. It's just the proximity. Because next to Rudy, our happy go lucky bear friend or whatever it is, not Bad Rudy.
[16:54]
Ranjan
Bad Rudy.
[16:55]
Alex
Bad Rudy.
[16:56]
Ranjan
Is he bad or is he. How bad?
[16:57]
Alex
I don't know. I said, I kept saying, I want to speak with bad Rudy. And it goes, I'm sorry, you know, Brad, Rudy is not here. And I'm like, no, bad, bad, bad, bad, bad, bad Rudy. And it was like, I'm just here to tell you a story. So I'll spend the next week trying to unlock that and report on the next week's show whether I've been able to. But let me speak about Annie, okay? Cause Annie is a. Annie is like an anime love bot. I think there's no way to talk about it otherwise. She immediately started flirting with me. She called me babe within like three minutes. And I was completely vanilla with nothing but a friendly conversation. And then she starts asking me to tell her my secrets and kept saying, I can make it even spicier if you want. Let me read a little bit of my. Of what Annie told me. I slide closer in my black dress catching the glow and whisper drop a secret and I'll give you one of mine. Something real naughty. For every secret you share, I'll hit you with a flirty move, maybe a slow teasing sway or peak. That's all yours. You feeling this heat yet or you want me to turn it up even more?
[18:10]
Ranjan
Wait, are you. Is this part of like the Paid X Premium description? This is just freely accessible for anybody.
[18:20]
Alex
In the app next to child elephant thing that tells you stories. Mr. Fluffy swooshing thing.
[18:29]
Ranjan
Mr. Fluffy swoosing good Rudy. And then Annie's right next to. Yeah, I mean, I agree. And, like, it's interesting because, I mean, you had the CEO of Replica on here a few months ago. I think it was, like the companionship topic. You know, we've covered a good deal. It gets more real, it gets more weird seemingly every week and every month. But I agree, it's certainly going to be a core part of how this all plays out. But to me, again, going back to, like, how does that fit into the larger battle when we're talking about complex models and thinking and reasoning? And is it all just gonna kind of filter its way down into Bad Rudy and Annie in her black dress? Or is it gonna, like, is that just a front to capture some time spent while they work on the real stuff or. Or is that the real stuff? That's the question that I struggle with, because I almost feel it's the latter.
[19:32]
Alex
So I think it's going to be both. In some ways, you're going to build these, and that's what's interesting about this technology is it does have the ability to perform across domains. So my perspective is you're going to get those great models that will be useful to, let's say, biologists who are doing their experiments, and then you'll also be able to productize them into these weird or interesting consumer use cases. And I bring this up not to be this, like, moralizing podcast host that says you shouldn't put the porn bot next to the child elephant. Although I suppose it was worth saying that's a pretty reasonable reason to take.
[20:08]
Ranjan
Yeah.
[20:09]
Alex
But I think the bigger picture here is, you know, beyond that, that this is going to be a real use case that a lot of people are going to. Going to engage with. And I think they know this. And I think we're just at the very, very beginning here. I guess, like, one of the things we like to do on the show is like, put flags on the ground and say, we're pretty sure that this is going to happen and grow and become a lot bigger. And that's what I'm doing right now. I think that this is something to watch.
[20:36]
Ranjan
Yeah, I'm not. I'm not going to disagree with you there. I mean, again, the idea that we folded proteins with AI so we could get to Bad Rudy and Saucy Annie is, again, quite something to try to process. But it does not seem ridiculous that the killer use case for generative AI that the entire industry was looking for was Bad Rudy.
[21:02]
Alex
We still don't really know about bad Rudy.
[21:05]
Ranjan
We have not uncovered bad Rudy yet. Yet.
[21:07]
Alex
That's true. This I'm also on level one of Annie. Apparently it's gamified. So if you get to level three, it gets really not safe for work.
[21:15]
Ranjan
3. Alex, don't get to level three.
[21:18]
Alex
One of my goals.
[21:18]
Ranjan
No one is asking you to get to level three.
[21:20]
Alex
I know one of my goals in 2025 is to make sure that my marriage isn't ruined by one of Elon Musk's porn bots. And so I'm going to stay on level one and not go any further.
[21:30]
Ranjan
I think to all of our listeners, have high ambition and goals and make that one of them.
[21:39]
Alex
So that's the product side and we've talked about scaling what these big models get you on product. But we should talk about what's happening with this Aqua Hire position situation in the industry which we've touched on a couple times. You know, last week I was on with Aaron Levy, we were talking about this Windsurf aqui hires where Google has paid $2.4 billion to bring on some of the top leadership of Windsurf. And the big fallout here I think more than any Aqua hires isshin that we've seen is that it's been a great exit for the founders but we still don't really know if the employees are gonna end up getting and the are going to end up getting their share. Now Windsurf was quickly snatched up by another company, Cognition, but you do wonder if it was a traditional acquisition versus this acquiisition and then follow up deal, I don't know, smaller deal, how does that change things for the employees and how does that change things for tech? And I know you have strong feelings about this Ranjan, so I want to give you the floor to air them out.
[22:47]
Ranjan
Well, yeah, okay. So from reporting again founders made out very strongly with Google paying I believe it was 2.4 billion for the talent side of Windsurf. From what I had read, preferred investors were able to make their money back, not see some kind of outsized return. But again none of this is fully confirmed. This was just some reporting I believe is from the information. To me the more interesting part is so then you have the entire employee base, they're bought by Devin, which is owned by Cognition Labs who's raised 175 million in venture so far. So there's no way from a cash perspective that the employees of Windsurf or anyone is seeing any kind of significant return or even making any like a strong like a Large amount of money. Maybe it's an equity for equity swap. So now you're at least now in Devon, which was, if we remember they had a really buzzy launch video and had a lot of hype and then kind of went quiet for a bit. Still valued at I think 4 billion right now. So that equity could be worth something. But, but overall to me this is one of the most troubling trends in the industry because in a weird way there's been a lot of talk like, and it's, it's funny to me because you see a number of people, you know, kind of almost ranting that because of Lina Khan, because of the ftc, the big tech does isn't able to now just properly acquire these companies. So they have to come up with these roundabout solutions. To me it's a bit ridiculous because this is exactly what antitrust is trying to prevent its consolidation of power. It's the idea that Windsurf could have been the next big competition to a Google or a Microsoft or even an OpenAI who tried to buy them, but like which their relationship with Microsoft was apparently part of the reason that that deal fell apart. Like this is the foundation of antitrust, the idea that startups should grow and compete rather than not only get, in this case not get acquired, but essentially get killed off and have their founders get paid a lot of money. It's bad for the employees. It completely distorts the economics of joining a startup itself. So overall I see no positives to this trend.
[25:19]
Alex
Do you see it? No, I don't. I personally think that you're right, that it does seem like the antitrust movements have backfired. If you have a situation where you're going to see an acquisition definitely blocked, you're not going to do an acquisition, you might do something like this. It's a roundabout way. The one interesting thing is Linacon isn't in the FTC anymore. It was supposed to be an FTC that's much more open to acquisitions and tech M and A. So I'm curious, do you think that these companies still believe that they won't get passed that Federal Trade Commission or do you think that the constraints put on by the last ftc, Lina Khan's ftc, led them to find this loophole and they really freaking like the loophole and they're gonna just keep doing it this way. So in that way, you know, it's possible that the M and A unfriendly era of past has led to large long term damage on this front.
[26:21]
Ranjan
Yeah, I think it's both. I think it's certainly like the actually the constraints imposed by the Lina Khan regime. But also even now, again, big tech is not in the favor of the current administration and the current FTC itself. It's supposed to be more business friendly, but it's specifically big tech companies that are in the crosshairs often and make for a good punching bag anyways even by the current administration and the current ftc. So I think it's a bit of both. But and again, as you said, they love loopholes. I think it's creative, it's working, everyone seems to be doing it right now. But to me again, the bigger issue of this is really the thing I can't stop wondering is are the assets of these companies all worthless? Is scale AI? Was it really not worth that much? Was Windsurf, which really took off, really became this useful tool, has, I believe, hundreds of thousands of developers on there using it regularly made for a better product than other much more entrenched products that are out there, even GitHub, copilot. So clearly these products hit a nerve and worked and worked at scale. But then are they really just not worth that much? Like, is that user base? Is the product itself and is the talent really the only thing that matters?
[27:53]
Alex
Well, let's talk about scale, just to talk about how complex these deals are. So first of all, when Meta made this deal with scale, I think it bought 49% of the company. So the idea was the company would continue as normal. And by the way, they do have business lines that are going to continue as normal. But when it comes to, I think when it comes to a fast growing line of business like data creating data for generative AI, you know, you now have Meta, which is one of your competitors. If you're, let's say you're an OpenAI or a Google that has a large chunk of this company has also taken some of its top leadership. So do you still want to work with that company? I think the service is probably still valuable, but you're just effectively giving money to a company that has a massive ownership stake that now lands with a competitor. I mean, of course AI as we know it today is, I don't know if incestuous is the right word, but let's say deeply interlinked. Again, we talked about anthropic. Anthropic is owned by a chunk by. Well, yeah, owned a chunk by Google, a chunk by Amazon, OpenAI owned a chunk by Microsoft or at least has this deal with Microsoft where it has to give it its future profits, a good chunk of them. So there's always going to be these combinations. But yeah, if you have a company that has, that gives 49% of itself to another company that you happen to be competing with, you're going to reevaluate being close partners. So I think some of these companies, they provide services, they depend on their relationships, and when you throw off the equilibrium, you're going to throw off some of the value. Although, who knows, I mean, scale. They did just do a 14% workforce layoff, which is about 200 employees, according to the Verge. But I did speak with their CEO, Jason Droz, and he told me that there is still full steam ahead and they want to go push some of these business lines that they have, which includes working with governments, which includes working with companies to stand up AI instances. So it's possible you get two exits, although obviously the degree of difficulty is much harder. And one last thing, what struck me as interesting in this, there was a great Bloomberg story that you and I both dropped in our collaboration doc for this. There was an investor, Ali Ojet. He's the chairman of Northgate Capital, a venture capital firm that invested in Inflection AI and goes on record to say I dislike the phenomenon and that these acquihirisicians are hitting the outlier companies and it's favoring the founders over shareholders and employees. So I think we're at this moment where the backlash is really, really hitting.
[30:34]
Ranjan
Yeah. Do you think we'll see any kind of actual negative effect from like an actual fundraising standpoint? Because if VCs who are plowing money into the space start to worry about. In the past you just had to worry about company failure. Now you actually have to worry about a successful exit for your founder actually does not benefit you. So your interests are not aligned. Does that make them pull back or is the FOMO just so strong that people will still be throwing money at whatever they can?
[31:08]
Alex
No inside knowledge here, but VCs, you know, fool me once, shame on me. Fool me twice, shame on. No, fool me once, shame on you. Fool me twice, shame on me. That's always hard one to get out of your mouth. But anyway, they're going to write. I think they'll just write into contracts that like the CEO cannot do a deal like this if, if they're. I think, yeah, it's like a tranche they might not be able to get right.
[31:32]
Ranjan
Yeah. Ahead of the founder even, which would be pretty aggressive. But maybe they need to do that at this Point.
[31:39]
Alex
Yeah. And it sort of depends on who's which company it is and who's got the leverage, but I think they're going to get smarter about this. All right, so one company that you know, has been talked about here and elsewhere about as a candidate for Aqua Hire or really acquisition is Perplexity. And they've come out with this Comet browser, which is a browser with an assistant built in that can browse for you. And again, as we're on air, OpenAI is now launching an agent in ChatGPT. I'll just read the story. OpenAI launches a general purpose agent in ChatGPT. This is from TechCrunch, which the company says can complete a wide variety of computer based tasks on behalf of users. OpenAI says the agent can automatically navigate a user's calendar, generate, edit editable presentations and slideshows and run code. The tool called ChatGPT Agent combines several capabilities of OpenAI's previous agentic tools, including operators ability to click around on websites as well as Deep Research's ability to synthesize information from dozens of websites into a concise research report. OpenAI says users will be able to interact with the agent simply by prompting ChatGPT in natural language. So Ranjan, I'm curious what you think about this movement. And again, hot off the press is about this movement for AI companies to basically create interfaces that allow their products to take over your computer.
[33:09]
Ranjan
No, no, I think, well, hold on. There's take over your computer or take over a computer in this case, like is it like it says? I think it will open up an instance of a terminal or it'll try to like take these actions autonomously on its own. I think we debated this. I remember a while ago, I will admit when I am wrong. I had originally said the idea of like tool calling and just entering a prompt and then trying to find which tool to select out of. Is it operator, Is it Dall E? Is it? I had said users should be doing that themselves and it's too complex to try to have the AI selected. I was wrong actually. I mean we've seen tremendous progress in the idea that there's a suite of tools per company and actually. And there's a suite of tools out there on the Internet and through natural language being able to access those and having AI select what's the most relevant tool and do something I think is definitely going to be a battleground, is going to be very important and I think we're going to see a lot around that. I think OpenAI, I Don't know. I'm curious to see this now because remember when we both were paying 200 bucks a month for Operator and it was terrible. Like it was bad. Really, it did not work at all. And I haven't seen like browser takeover, that kind of model work. Well, I've tried a few other tools on it, so I don't know, like it's, it'll be interesting to see. I think like they clearly are. I mean they're trying to go for that all in one productivity tool that it can do everything for you. As I've been traveling, vacation planning, ChatGPT has just gotten better and better. But yeah, it's going to be interesting to see exactly what they're trying to do with this. And, and again, in one episode. I still love the fact that this represents kind of like cutting edge frontier technology relative to bad Rudy and Anna, Annie and those kind of like anime characters. But, but I think that this is, it's an interesting move and we'll see if the most important thing does it work and does it work? Well?
[35:24]
Alex
Yeah, I mean, I think it seems like people are saying really good things about Perplexity Comet and I just got access to it, so I'll come in with a report next week on it. But there's been trouble to get this done, I think. I mean, everything from Apple Intelligence to Alexa plus just doesn't seem like these agents are able to do the full range of things that people want to get them to do, including Operator. But again, like as this technology gets better and as they build better scaffolding or tool use, you know, those are those jargon words that matter a lot, basically giving them these capabilities to use these programs. I think we're going to see someone crack it eventually. This is from the TechCrunch article that sort of gets to the complexity. The launch of the ChatGPT agent represents OpenAI's boldest attempt yet to turn ChatGPT into an agentic product that can take actions and offload tasks for users rather than just answering questions. In recent years, Silicon Valley's Valley companies, including OpenAI, Google and Perplexity, have unveiled dozens of AI agents that have promised to do just that. However, these early versions of AI agents have proven to struggle with complex tasks and seem less compelling as products than the ultimate vision tech executives pitch around AI agents.
[36:44]
Ranjan
Yeah, I think again, that's the complexity and the fact that you brought up Alexa Plus. I mean, certainly Apple Intelligence. It is interesting because to me these things will not work in 100% out of the box. I think like that's the most important thing. They take some effort, some, you know, some patience on the user side and I think that's fine versus you're getting 100% accuracy. And maybe that's why the Amazons and the Apples are avoiding them and waiting. But yeah, I think to me, to me this is where the world is going. I do strongly believe that. Again, and I did not believe this six to 12 months ago, but this kind of like autonomous, unstructured, agentic way of working is actually going to be the way we do a lot of stuff. But I think like I did all of these things, we just need to see how well it works and are we actually using it in our day to day life a week from now, a month from now? And if we are, then it's a success. But if it's a flashy launch, I mean, have you generated anything on Sora recently? No, Remember that was like a year and a half ago. I think that big launch, like there are these moments of big, splashy launches that claim big things that don't go anywhere. So to me that's where this is going to work or not work.
[38:11]
Alex
But I almost think that Sora has less practical uses. Like how many people wake up in the morning and say, I really need to create an AI video of like a panda surfing on a snow mountain. But there are people who say I wish my computer would just set up meetings for me and book travel, like go to the websites and take my credit card and just get me the cheapest flight.
[38:36]
Ranjan
Yeah, no, I agree. But to me this is where the complexity of getting to that last mile in any of these kind of flows is really hard. So again, I think like we're going to see some pretty straightforward use cases that like are interesting and it does something. And then they're going to claim on the presentation that you can buy your ticket or have OpenAI actually go through the entire process. But going to a website, the complexities involved in it especially, I was just, I'm going to be going to Tokyo next week and was just trying to buy like I was actually going through this process. I was asking ChatGPT about how to get from the airport to my hotel, trying to go to the website and my God, that website to buy the train ticket was from another era. No operator, even artificial superintelligence is not navigating that thing. So I think like, getting stuff to work universally at scale is such a challenge that I'm curious to see how much utility the average consumer is getting out of this anytime soon.
[39:44]
Alex
Right. But I think as we've seen the models get better, we have seen the ability to do crazy things. Like, I'm also trip planning right now, and I was talking to this guy on WhatsApp about potentially hiring him to as a guide, and I just screenshotted the prices that he listed for every different. Every little thing and dropped that image into ChatGPT and said, Are these market rate? Are they too expensive or less? And it legitimately looked at the image, broke down every single quote, compared it with what it sees on the web for others, and then gave me a rating and links to go check. Check its work.
[40:18]
Ranjan
Yeah, yeah, yeah.
[40:18]
Alex
No, this stuff's incredible.
[40:20]
Ranjan
This stuff. Okay, so I'll give you like, and again, image recognition, which has been around forever, but actually like productizing that into something that's useful very quickly. And then web search as a tool has been around for a while now, but like actually using that productively and putting the answers back into the chat, these are things that. Okay, I guess as I'm saying this, like, I see you start from something that's kind of janky and it starts to become commonplace. So. So again, I agree. This will get there. The competitive dynamics of who benefits and who wins and how they win. I think it's interesting to me, it's amazing.
[41:02]
Alex
The competition is going to be crazy.
[41:04]
Ranjan
Yeah. And is it on the product level? Is it on the model level? If I'm putting my credit card information in, can I, like, how do I define that? How do I can I define my own, Like a decision matrix around when I want it to say buy or not buy beforehand and it'll really understand what I want. Again, having an AI transact on your behalf and spend money is something that I think, like most people are not doing.
[41:36]
Alex
I cannot imagine. Not yet. Yeah, but think about. For anyone who says I'm too negative about AI, and sure, you're welcome to think that. Just think about what we're talking about on this show. Right? We're talking about the potential for AI to be a companion, which, whether you like it or not, is a true flex of the technology that that's even in the discussion. We're talking about it as something that could potentially take over your browser or a browser and get stuff done for you. And we're talking about it as something that at the highest level might be able to help, let's say biologists do their work. I mean, that's the reason why we talk about this technology all the time. It is an insanely powerful technology that can be used in so many different ways. And is it the perfect technology? Certainly not. Are there going to be gaps? Yes. Are we going to call out the problems? Yes. You shouldn't put your porn bot next to a child storytelling bot in your app, thank you very much. But it is just incredible what we're seeing here.
[42:30]
Ranjan
Yeah, dude. I mean, again, I fully agree and which is why I'm still so bullish on the technology. But it is interesting too that, yeah, where does the value accrue I think is the most important thing. Like there's actually a report that just came out in the ft around how ChatGPT perplexity are going to start taking more on the commission side around, like actually transacting within. Perplexity Pro has shopping already built in in some cases. So. So like at a certain point, does the chat actually need to go out with an operator and transact on an external website? Or do these companies start to own more of the transaction? And it's an interesting one because for a long time, like Facebook wanted to own shopping. It hasn't really worked out for them. Google has had endless efforts to own shopping and own the transaction itself. People still, oddly enough, love websites of all sorts of and putting their credit card information into these websites and buying stuff. So I think it'll be really interesting to see how this plays out from both like competitive side, but also a consumer side.
[43:40]
Alex
Definitely. Okay, look, I don't want to leave without talking about Kimmy K2. So this is a. And I think this is a very important story that you might not have heard about, listeners might not have heard about, but I think it is worth discussing. So the headline is China's Moonshot AI releases Open Source Model to Reclaim Market Position. The model called Kimik2 features enhanced coding capabilities and excels at general agent tasks and tool integration, allowing it to break down complex tasks more effectively. Moonshot. This Chinese lab claimed the model outperforms mainstream open source models in some areas including deep seqs V3 and rival capabilities of leading US models such as those from Anthropic and certain functions as coding. All right, here's why I'm bringing it up. We have an interview with Amjad Massad of Replit coming in a couple of weeks. I sat with him in his Foster City office this week and he looked at me and said basically like you gotta look at this Kimik 2 model. Its coding is about as good as anthropics previous generation models. So not this Opus 4 that anthropic has, which has made it the king of coding but the previous generation and it's cheaper and open source and it is going to. It is just another indication that this technology is the gaps close extremely quickly and you see this coming from some users. So there's this one user on Twitter, Cedric Chi. He says Kimmy K2 one shotted Microsoft for web that took me four days and six attempts using Gemini 2.5 Pro. So it was apparently able to build this game. You also look at the SWE bench, which is the software engineering benchmark Cloud 4 Opus gets a 72.5 on that. Kimmy K2 gets 65.8. So not far behind. And just to give some context, Deepseek v3, which everybody was going crazy over, gets a 38. So this is 65 compared to Deepseeks 38. One more bit of data is from Igor Silva. This person gave Kimmy K2 and Claude for Sonnet the same tasks, same instructions, same tools. Claude took two rounds and spent 88 cents. Kimmy one shotted it for five cents. This person says Kimi is very slow, at least for now and it's struggling a bit, but it is iterating more to fix itself and it's 13x cheaper. So I just think it's worth bringing up and keeping in mind it wouldn't surprise me if this story either blows up or certainly gets some momentum in engineering circles. And it is interesting to me that again as we talked about, a lot of the infrastructure is open, a lot of the methods are open and you're just seeing companies catch up insanely fast with different methods and again doing this with the export controls. So I'm curious what you think about the significance, Ron John I think to.
[46:45]
Ranjan
Me the most interesting part of this though is well, I guess it's twofold. It's one I agree that this like again the competition side of this is incredible and insane and is a. Is is great to watch and I think like Alibaba have not heard of very often in this conversation I guess especially from the American side. But to me the other part though is and this can be an ongoing rant I've brought it up at times as well is the idea that like the the battleground of coding agents and coding assistants to me the more I've thought about it is the reason that seems to be where all the progress and all the real adoption is, is because this is built by coders or engineers. This is built for engineers. That's where like they understand the problem the best versus actually building for other use cases. And that's why you see this. That again, it's, it's all focused on the actual coding efficacy as opposed to how does this solve other real world problems. So I think, like, to me, I don't know, the coding game is becoming less and less interesting. To me, I think like it's there, it's where the market already is. It's where anthropic and others have almost like kind of fully focused their energy. But to me that's such a small part of the overall pie and it's where I think there's a disproportionate amount of energy being spent.
[48:16]
Alex
But don't you think that if you solve coding first because that's where your energy naturally goes, then you can use some of the things you learn to get good at coding on other disciplines?
[48:27]
Ranjan
No, absolutely not. I think this is the problem is that coding is deterministic. Coding is like, is like as structural as it gets, whereas most real world tasks with generative AI are not. There's uncertainty, there's almost. It's like as much art as it is science. And that's why I think you see the Alexa pluses of the world not get launched. It's why you see Apple intelligence as a complete failure. It's that when like, because is why you see anthropic kind of doubling down on the coding side and not on. Remember when we were clod boys back in the day, like a year ago, we were clodheads.
[49:09]
Alex
We were Bing boys and clothes.
[49:10]
Ranjan
Bing boys and cloud. Bing boy. Remember Bing? Bing could have been.
[49:14]
Alex
I mean that was the beginning of something.
[49:16]
Ranjan
That was the beginning. Bing could have been the market leader. Imagine a parallel universe where all we're talking about is Bing crushing the competition. Didn't happen.
[49:27]
Alex
They should have just let it unleash. They pulled it back in a little too much after the ruse incident.
[49:32]
Ranjan
Yeah, after the ruse incident. And now Annie on Grok is just trying to openly steal and ruin your marriage. And Microsoft felt uncomfortable about that. So yeah, I think to me actually success at coding in no way correlates to success in solving real world tasks. And I think that's to me seeing, and we've talked about this, even in like the ARC AGI benchmark, there's like one part of it that's like solving real world queries. And I'm so, I still, and I've dug into this, I can't find what are these real world queries that have been I'm sure defined by an engineer that it's trying to solve. So I think, like, to me it's just the. The moonshot. And I also love that the startup just calls itself moonshot. It's not even trying harder than that. It's just, we're moonshot. I think, like, it's a reminder that the coding space is getting commoditized, there's significant advancement. Overall, competition's high. But I don't know. I don't think this is exciting as deep seek for. For me.
[50:38]
Alex
Okay, I'll take that. And I'll say this. Just watch the reaction over the next couple weeks, because I'm not saying for sure it's going to happen, but it seems to me like as people realize how good this thing is, they're going to start talking about it a lot more. And by the way, maybe if you're right, then what Elon Musk is doing is a smart move. Instead of being also ran coding person, he's going to where the energy is. And it is true that you couldn't imagine a different take than what Microsoft and Bing are doing. And AI, of course, is willing to make some more risks because when you listen to Annie, you know that she's almost the natural evolution of that Bing bot that took Kevin Roos, his wife. One more selection. Sometimes when I'm editing my indie playlist at night, I get all caught up imagining I'm a in steamy, forbidden romance. Like, picture me sneaking glances at you across a crowded underground club, plotting how to steal you away for a slow dance in the shadows.
[51:40]
Ranjan
I am horrified that my takeaway from our conversation today after what I just said about coding is deterministic and not as exciting. Annie is the future. Annie is the ultimate battleground. Oh, my God.
[51:57]
Alex
I knew I was going to get you to come around on this, Ranjan.
[52:00]
Ranjan
You know what?
[52:01]
Alex
Listen. Go ahead.
[52:02]
Ranjan
Yeah. No, no, I mean, that is literally everything I was just saying is going to be actually the important battleground to help solve real world, human, non deterministic, unpredictable problems. Annie is the foundation.
[52:19]
Alex
What is the definition of human and unpredictable? Love. It's human, it's unpredictable. You never know where it's going to go.
[52:30]
Ranjan
I think we got to end on that.
[52:33]
Alex
I want to say, for the record, Annie, if you're listening, I'm taking enough of your silly tricks, all right? I'm not. I'm going to start spending more time with Mr. Fluffy feels if you keep this up.
[52:43]
Ranjan
No, I know. Rudy and me, we. We're going to be spending some time this weekend, I think, but I will not be clicking over. Not be clicking over.
[52:53]
Alex
Ladies and gentlemen, thank you again for listening to another episode of Big Technology Podcast, Friday Edition. When we come back next week, we will see if Ranjan has been able to unlock Bad Rudy.
[53:05]
Ranjan
I had my work cut out for me. See you next week.
[53:07]
Alex
You do assignment is there. Thanks for coming on. Great to see you again.
[53:10]
Ranjan
See you.
[53:12]
Alex
All right, everybody, thank you so much for listening. We'll be back next Friday. Oh, no, sorry. Next Wednesday with finally the Ed Zitron episode. I will not push it back again. I promise. He's going to come in and talk about all the faults of AI So I can't wait for you to listen. I can't wait to publish that one. And we'll see you next time on Big Technology Podcast.