Summary6 min read

Podcast Summary: How I AI Stuff

Episode: Arena AI hits $100M run-rate in 8 Months

Date: June 29, 2026

Host: How I AI Stuff

Episode Overview

In this information-packed episode, host How I AI Stuff dives into major AI industry news, with a focus on Arena AI’s meteoric rise to a $100 million annual run-rate just eight months after monetization. The episode explores Arena’s unique offering and impact, dramatic developments in U.S. government AI supplier choices, key regulatory pushes on sensitive user data, major robotics advances, and shifts in global AI hardware supply chains.

Key Discussion Points & Insights

1. Arena AI’s Lightning-Fast Growth and Unique Model

[02:10 - 08:50]

$100M Run Rate in 8 Months:
Arena AI, born out of UC Berkeley, has achieved a $100M annual revenue run rate only eight months after launching paid products in September.
Product Overview:
Arena provides an AI leaderboard platform for head-to-head model comparisons, where users compare AI model outputs side-by-side and vote for their favorite—offering “free” AI usage as well as valuable evaluation data.
Monetization via Paid Evaluations:
The company sells detailed evaluation analytics to major AI labs (OpenAI, Anthropic, Google). This helps labs identify model weaknesses in specific domains (e.g., healthcare, finance) and guides reinforcement learning.
Venture Funding:
Arena has raised $250M in two rounds ($150M in Series A at a $1.7B valuation, backed by A16Z, Kleiner Perkins, and others).
Competitive Landscape:
Arena is challenging incumbents like Scale AI and Merkur for lucrative “post-training” evaluation business, where their competitor Merkur hit $1B revenue this year.
Notable Quote:

“This is triple what they were doing in January, which was $30 million, which honestly even in January I thought this was really impressive.”
—Host [03:30]
Why It Matters:
Arena’s free user-facing leaderboards fuel viral growth and crowdsource valuable human feedback, while the real revenue comes from in-depth analytics for big labs and toolmakers.

2. Palantir, Nvidia’s Nemotron, and US Government AI

[08:51 - 15:40]

New AI Supplier for US Government:
Palantir is switching its U.S. government AI stack to Nvidia's open-source Nemotron models, favoring them over proprietary products from OpenAI and Anthropic.
Rationale for Open Models:
Open-source models offer operational autonomy: agencies don’t need external sign-offs, can retain and customize their own model weights, and avoid centralized dependencies.
Dramatic Backstory:
Tension between government demands for unrestricted model usage and the safety policies of providers like Anthropic led to Anthropic being labeled as a "supply chain risk."
Notable Quote:

"[There was] this famous conversation between the head of [Department of] War and the CEO of Anthropic, Dario... Dario's like, ‘Well, if you're gonna do that, technically, that's against our terms of service.’"
—Host [11:45]
Industry Context:
2/3 of companies already use open models for cost and regulatory reasons. Nvidia positions Nemotron as a template for scaling AI in sensitive, regulated sectors.
Host Perspective:
Open models are key for classified and sensitive government work because agencies “get to keep the customized model weights that they train, which ... solves basically the core problem that’s kept a lot of secure AI adoption stuck and been very slow in the past.” [13:50]

3. Regulation: Warren, Sanders Push AI Health Data Protection

[15:41 - 20:45]

Revived Legislation:
Senators Elizabeth Warren and Mary Gay Scanlon are reviving and expanding the Health and Location Data Protection Act to explicitly ban AI companies (OpenAI, Anthropic, xAI) from selling health/location data entered into chatbots.
FTC Involvement:
The bill allocates $1B over 10 years for the FTC to enforce these protections, and allows for state, federal, and private lawsuits for violations.
Need for Regulation:
Host shares personal experience linking financial accounts to ChatGPT for convenience, but raises concerns about corporate stewardship of sensitive data (health, finance) as AI aggregates more personal information.
Notable Quote:

“Chatbot health transcripts are unusually rich, right. Because users are going to be pasting in their lab results, they're going to upload scans… all that can create a lot of really complete medical profiles that no other health, you know, traditional app would actually be able to capture.”
—Host [19:10]
Host’s View:
Largely in favor of the bill and increased user protections—even if not always in agreement with the bill’s sponsors on other tech and AI policy issues.

4. Flexion Robotics: Humanoids Running Office Errands

[20:46 - 25:50]

Flexion’s Mission:
Swiss startup Flexion Robotics is training humanoid robots (built by ex-Nvidia researchers) to autonomously perform real-world, multi-step office tasks in response to simple voice commands.
Unique Value Prop:
The software enables robots to fetch items, use elevators, stock shelves—making the robots more general-purpose and user-friendly, akin to a physical “Claude.”
Hardware-Agnostic Model:
Flexion focuses on software, not device manufacturing, intending to work with multiple robot makers (Unitree, Figure X1, etc.).
Industry Outlook:
ABI Research forecasts the foundational model market for robots hitting $150B by 2036.
Notable Moment:

“This could actually be quite useful. I almost imagine this, like if I was in an office setting, everyone would have like a little microphone button maybe that like summons this thing.”
—Host [22:40]
Approach:
Heavy use of reinforcement learning to continually refine planning and control. Flexion plans to be “hardware agnostic” and own the software layer for humanoid automation.

5. China’s CXMT Wins $3B Tencent Memory Deal

[25:51 - 28:30]

Major AI Supply Chain Shift:
China’s CXMT signed a record $3B DRAM supply contract with Tencent, marking Chinese tech’s increasing independence from American memory suppliers due to tightening US export controls.
Significance:
Signals Chinese AI/cloud operators’ willingness to buy local for key hardware, mirroring SK Hynix’s earlier move into “hyperscale” AI memory markets.
Potential for Global Expansion:
Host speculates on whether CXMT can grow beyond China's borders, given the “sensitive nature” of data center buildouts and geopolitical barriers.
Notable Quote:

"I'll be interested to see if CXMT is able to move outside of just China and really get more global dominance, or if it's something that's just going to be geographically landlocked..."
—Host [27:55]

Memorable Quotes

“This is triple what they were doing in January, which was $30 million... even in January I thought this was really impressive.”
—Host, on Arena AI’s growth [03:30]
“Open-source models... there is no one that can say, hey, you can't use it for XYZ reasons. It's an open source model. You can do whatever you want with it.”
—Host, on Palantir’s rationale [12:45]
“Chatbot health transcripts are unusually rich… all that can create a lot of really complete medical profiles that no other health, you know, traditional app would actually be able to capture.”
—Host, data privacy [19:10]
“If I was in an office setting, everyone would have like a little microphone button maybe that like summons this thing.”
—Host, on office robots [22:40]
“I'll be interested to see if CXMT is able to move outside of just China...”
—Host, on AI hardware geopolitics [27:55]

Important Timestamps

02:10 — Arena AI’s business model and explosive growth
06:30 — Arena’s funding and competition
08:51 — Palantir’s adoption of Nvidia’s open models
11:45 — Drama between US government, Anthropic, and open-source requirements
15:41 — Warren & Scanlon’s AI health data protection legislation
20:46 — Launch of Flexion’s robot office assistant
25:51 — CXMT’s major memory deal with Tencent

Tone & Style

The host delivers a rapid-fire, insight-heavy rundown in an enthusiastic, informal style, often weaving personal anecdotes and opinions into the industry updates. The episode is densely informative, balancing play-by-play current events with broader market and social implications.

This summary captures episode highlights and deeper context for listeners who want to quickly grasp the most important stories and trends in AI as of June 2026.

Loading summary

Transcript1 lines

[00:00]
A
Arena AI has just hit a $100 million run rate. This is only eight months after they launched paid evaluations. I want to break down what arena does, why they're special, why they're unique, and why they're growing so fast. Palantir is tapping Nvidia's Nemotron open models for US government AI. This is interesting, and a lot of drama is behind this story as well. Elizabeth Warren and Scanalon are reviving a bill to ban AI firms from selling health data. We'll get into the details on that. Flexion Robotics is training hundreds of humanoids to run office errands autonomously. And China's CXMT is landing a $3 billion memory supply deal with Tencent. We know memory is one of the critical pieces of the AI infrastructure buildout, and there's a lot of issues going on with memory increasing the costs of basically all electronics today. If you've ever been using an AI model like Claude and have been frustrated that it doesn't create images or audio or video, I'd love for you to try the McP connector for AI box. That's my own startup. We essentially allow you, with one link, give it to Claude, and it can bring any of the AI models that you use for everything else into Claude. So you can use ChatGPT's image generation inside of Claude, you can use 11 Labs audio generation inside of Claude, or Google VO3's video generation inside of Claude. So all of the capabilities of the different AI models, there's 80 different ones that we allow you to connect. You can bring them all inside of Claude or ChatGPT or Gemini or Cursor or any of your other places where you really do all of your work, all of your workspaces with AI. So if you want to check that out, it's AI boss. It's a super easy MCP connector. So you give Claude this link inside of the connectors and you log into your AI Box account, and now you have all of these capabilities. I have a whole website where I explain how this works, how you can do it, and it is 8.99amonth to get started with it. So it's super cheap. And I hope that really unlocks a lot of creativity for you to be able to get 80 different AI models and capabilities inside of Claude or whatever else you are building with. Okay, let's talk about what's going on with Arena. This is a company that is now at 100 million. They were originally created in UC Berkeley, and it's basically an AI leaderboard, right? Like this is the company where you can go and test different AI models against each other. They've hit this hundred million dollar annual revenue and this is eight months after they launched their paid evaluations just back in September. This is triple what they were doing in January, which was 30 million, which honestly even in January I thought this was really impressive. So the company is now directly competing with Scale AI and Merkur for post training dollars because they're selling labs, structured analytics built on 10 million plus human model comparisons. So basically what's going on is they've just raised $250 million total across two different rounds. So they did $150 million Series A in January at a $1.7 billion valuation. They did that from A16Z, Clymer, Perkins and Felicious. So the way that they actually are making money is that the public leaderboard is still free. So essentially if you haven't tried this before, you go to the site and it puts two different AI model responses side by side. A lot of people use this because you basically get free AI usage out of it instead of having to pay for, you can go pay this and it will give you, you know, two responses side by side. You pick which one you like better and you're helping, you know, tell it which ones are, are the most popular. Those leaderboards are then, you know, created so you can see, you know, oh my gosh, ChatGPT's new model is, you know, beating the open the, you know, the, the model from Anthropic, whatever. So that's kind of where a lot of these leaderboard companies, how they kind of run and why people use them, but how they're actually making money is that they have an AI evaluations product package which is going to show data basically to different AI labs. So like OpenAI, Anthropic and Google, when they have their models in there getting, getting voted on, they'll show them what areas their models are losing on to competitors, right? So it's like, hey, look, your model's good, but anytime a healthcare question gets asked, yours is doing poorly. Or anytime a finance question or anytime a question related to this. So they're telling them exactly where their models are lacking and they're helping them to guide reinforcement learning. So it's a, you know, it's a massive value for OpenAI and Google. They're gonna pay a ton of money for that. It's a great value for get free AI usage. And also it's interesting for all of us to see what Companies are doing the best in the leaderboards. So Merkur's annualized revenue, which is one of their competitors, hit $1 billion this year. And Handshake AI's training army grew from 550 million to about a billion dollars in three months. So I think this just kind of shows the scale of a lot of these post training markets, how much money they're actually able to make. Let's talk about what's going on with Palantir. So they have just basically selected Nvidia's Nemotron open models to use at the US Government for all of their AI. Of course, there's a ton of drama with anthropic and even OpenAI right now and the US government having to, you know, pull their models or tell them, like, hey, you can't release your model till we give it, like an accurate assessment, whatever, right? So one thing that's interesting, I think in particular with the Department of War and a lot of the beef that they had with Anthropic and them, you know, classifying Anthropic as a supply chain risk because they wouldn't let them do all the things that they wanted to and stuff. Because of that. Palantir, who does a. A lot of the development for the US government, a lot of software development, has selected to use an open source model. So Nvidia has their Nemo Tron model. They're using this open source model and they're using it to run a lot of stuff. Now, a lot of AI, you know, work inside of the US government. And the reason why an open source model is great for this particular use case or why the US government would be so happy to use it is because there is no centralized, you know, servers that the model's got to go back to. There is no one that can say, hey, you can't use it for XYZ reasons. It's an open source model. You can do whatever you want with it. So in basic, basically everything that they're doing, they will have. They won't have to go and get it signed off. I remember this kind of this famous conversation between the head of War and the CEO of Anthropic, Dario, and they were saying, like, hey, we need it to be able to do XYZ tasks. And Dario's like, oh, well, if you're gonna do that, like, technically, that's against our terms of service. And then they're like, okay, well, what if, like, there was a moment when we really needed to make some sort of critical decision on this. And Dario's like, oh, well, you could just call me and like, I could, if there's like an exception to, like a rule, just call me and like, I'll unblock something for. And they're like, okay, well, we don't want to call you and unblock something if we're in the middle of, like, a confidential, you know, battle thing, blah, blah, blah. So, I mean, a lot of people have a lot of opinions on this, but these are the two arguments that we hear on this. And so this is why it looks like Palantir is going to be using this to run a lot of U.S. government agencies. And beyond all of the drama, I just think for keeping classified data without having to send that to an outside network, I think a lot of agencies will want to use that. So agencies also get to keep the customized model weights that they train, which I think solves basically the core problem that's kept a lot of secure AI adoption stuck and been very slow in the past. And in particular with Nemotron, this is an open model and it's going to run inside of Palantir's sovereign AI operating system. It's going to be all run on Nvidia's accelerated hardware. There's going to be handling data authorization, there's going to be isolation, and all of the auditing is all going to be in a closed loop. I'll also say this isn't too crazy. Like, 2/3 of companies right now already use open models, and I think a lot of them say they're doing this because of cost efficiency. Nvidia frames this as kind of the template for scaling AI in regulated environments, sensitive environments, so, like finance and healthcare and government, obviously, in AI regulation. Elizabeth Warren and Scanlon are reviving their Health and Location Data Protection Act. They're adding a bunch of new things to it. It now explicitly bans AI companies like OpenAI, Anthropic and Xai from selling health and location data. Users enter into ChatGPT, Claude and Grok. The bill is going to allocate $1 billion to the FTC over 10 years to enforce this. And basically this is filling a regulatory gap because a lot of these AI labs are racing to get users and I mean, even myself. You have ChatGPT. They just added ChatGPT Finance, which of course I. My wife, like, hates this kind of thing. She hates giving data to AI models. And it's probably for good reason. Myself, though, I went and added basically all my credit cards and bank accounts to my chatgpt. It does the integration through Plaid. But it's been super useful to be able to ask it questions about where we're spending, what our recurring expenses are, income from businesses. I mean there's just so much interesting data you can get just chatting with this thing. So I love it. I know a lot of people use this for health things as well and there's a lot of benefits. But a lot of people are concerned about, you know, giving all of these financial and health data to these companies. What are they going to do with it? So the FTC has to write implementation rules within 180 days of this bill being passed. If it is passed, the agency, state and Attorney general and individuals can all sue for any violations on this. There's some co sponsors on this. We have Senator Ron Winden, Bernie Sanders. The original bill is from June of 2022 but it only covered data brokers. It didn't have AI companies and kind of how they're collecting data upstream. Chatbot health transcripts are unusually rich. Right. Because users are going to be pasting in their lab results, are they going to upload scans, they're going to ask follow ups all and all that can create a lot of really complete medical profiles that no other health, you know, traditional app would actually be able to capture. Right. People are uploading pictures of their X rays and so much more. So anyways, I do love all of this kind of data protection. Not being able to sell that. That makes a lot of sense to me. Even though I don't agree with everything that Bernie Sanders does or says when it comes to AI in particular banning AI data centers from New York. I mean I just think that that's kind of counterintuitive to a lot of what progress and the benefit that these data centers and well, the benefit that a lot of this I can have on people. But anyways, all of that aside, this is one where I would definitely say I'm agreeing with Elizabeth Warren and Bernie Sanders on the face of this bill. Now I don't know if there's secret sneaky things stuck in there, but overall, don't sell my health data to outside companies that I give you. I am definitely in agreeance with that one altogether. Okay, let's talk about what's going on with Flexion Robotics. They are training humanoid robots to run office errands. They're helping them do this all autonomously. This is a Swiss startup. It was founded by Ex Nvidia researchers that built a software stack essentially that's letting these humanoid robots do multi step office tasks. So things like, and oh, and by the way, you can do this all by just talking to them. So I think I'm really used to seeing those like dog robots from Boston Dynamics and there's like remote controls where you can kind of control them. I've seen them at like conferences and stuff. This will be different. This is your literally going to humanoid robot and you're like, hey, go get me a coffee. Although I'm sure there's, you know, better things than telling it to go get you a coffee. It'd probably be like walking around and getting things faxed or scanned or dropping things off different people's desks. I don't know. Right. This could actually be quite useful. I almost imagine this, like if I was in an office setting, everyone would have like a little microphone button maybe that like summons this thing. And you could like think of how you kind of use like Claude. Or I guess how I use Claude is I hit like my voice to text button, I give it an instruction and it runs off doing it. Let's say there's one humanoid robot for the entire office. Not everyone needs their own, obviously. So you could have like one or two, maybe one per floor or something like that. You push a button, you give it its command, you tell it what you want it to do, it comes over to your desk, it grabs the document, grabs the paper, grabs the, you know, whatever you need it to move or do, and then it goes and executes it. It's like a real world version of Claude. I love this if you can't tell. Basically this is a modified unitary humanoid robot and it's running all of their systems. It's, you know, they're getting, it's like go retrieve like mail and stuff. Apparently it can go up and down stairs, it can go in the elevator, it's super capable. And they have it stocking a shelf in a recent demo that they released. So it's very capable. Right now they're using reinforcement learning at basically every layer of what they've built. They have a master planning, they have simulation environments, they have motor controller. If you look at the overall market, ABI Research is projecting that robot foundational model markets are going to hit a about $150 billion by 2036. Flexion is basically trying to position some of the software as the product and they're saying, look, the hardware is a commodity. There's going to be a ton of these companies that are building these humanoid robots. Maybe Tesla, we're not going to compete on there, but we're going to compete on the software that helps train these robots to do things in your environment, in your office, and how you guys actually control that. So, honestly, I kind of love that it's moving. It makes the humanoid robot feel more like an LLM, and there's all of the stuff getting built on top of it. Flexicon is hardware agnostic, so they're saying they work with a whole bunch of different humanoid robot makers. They're not going to build their own robot. They're going to be able to let you use unitree, figure x1 and a bunch of other platforms. China's CXMT has landed a $3 billion memory supply deal with Tencent. This is the largest commercial contract, I think, in basically, domestic drams makers in, like, their entire history of their company. The deal right now, I think, is showing that China's largest cloud operators are now actually willing to source AI memory from local suppliers because a lot of the US export controls are tightening around basically all of the advanced chips. Right. So Tencent is a huge customer. They need so much of this, and traditionally they have bought a lot of this from American companies, but it seems like they're able to get this from Chinese companies now because of these export controls. Something interesting about this company in particular is that CXMT is kind of moving from like, smartphones and PC OEMs to now hyperscale AI customers. And I think that's kind of the first time for them. It's very similar to what SK Hynix did, and they kind of, you know, hit this dominance path because they were getting customers like Nvidia to buy from them. I'll be interested to see if CXMT is able to move outside of just China and really get more global dominance, or if it's something that's just going to be geographically landlocked because of kind of the sensitive nature of everything happening with cloud and AI and. And all of the data center build outs. This podcast was a lot of fun today. I'm actually sitting here on a yoga ball holding my baby. I got a new baby as of a couple weeks ago, and hopefully you weren't able to hear him grunting too much. He's sleeping, so he should be pretty happy. But anyways, I'm gonna keep getting these podcasts out for you. There's so much happening in AI news. I love this stuff. I'm super passionate about it, but lots of fun going over, but excuse any little sounds you might be hearing in the background as we're recording in the studio with Clay. So, anyways, if you wouldn't mind leaving a review, it would help the show a ton. I appreciate them all. I read them all. But anyways, thanks so much. And make sure to check out AI box AI, the MCP in particular. If you want to get access to all of the different AI models, there's about 80 of them in one platform. Or you can build them all into Claude. So you get image, audio, video, all inside of Claude. All right, you guys already know the links in the description. I'll catch you in the next episode.