Summary6 min read

Podcast Summary: "Anthropic launches Claude Science and a new model"

Podcast: How I AI Stuff
Host: How I AI Stuff
Date: June 30, 2026
Episode Focus: Latest launches in the AI world, including Anthropic’s Claude Science, Base44’s new AI model, X’s API and MCP rollouts, and new offerings from Google.

Episode Overview

This episode dives deep into new AI product launches and innovations across the industry, spotlighting Anthropic’s foray into scientific research tools, Base44’s custom LLM for coding, major infrastructure shifts from X, and Google’s fresh models for image and video generation. The host, broadcasting on a sunny day in North Carolina while multitasking as a dad, aims to break down the implications of these launches for developers, researchers, and everyday users.

Key Discussion Points & Insights

1. Anthropic Launches Claude Science

(Timestamp: 02:05)

Product: Claude Science is described as a "workbench for researchers"—a platform integrating 60 different scientific databases and a suite of prebuilt research tools for genomics, protein structures, and chemistry.
Runs on Claude Opus 4.8 (not yet on Fable 5) and comes with $30,000 in free credits for 50 selected academic projects.
The platform aims to upend the traditional emphasis on sheer AI capacity, focusing instead on productive workflows and reproducibility.
Key Features:
- Multi-agent architectures enabling parallel sub-assistants
- Support for sequence analysis, structural prediction, and fact-checking
- Full reproducibility: code, environments, and message history retained
- Case studies: Used by UCSF Brain Tumor Center, Allen Institute, and Novo Nordisk
Host’s Insight:
"They're making this bet that basically workflow is going to beat raw capacity for getting these researchers to be using Anthropic." [03:33]
No Specialization: No domain-specific fine-tuning or weights; relies on baseline Claude Opus 4.8 capabilities with added scientific tools.
Academic Bet: Anthropic is giving away generous grants to attract cutting-edge research to their ecosystem.

2. Base44 Debuts Its Proprietary AI Model

(Timestamp: 07:05)

Background: Base44, now owned by Wix and known for its Vibe coding platform, is defending its $100M ARR by launching its own large language model (LLM) named Base One.
Training Data Advantage: Trained on tens of millions of real user sessions—gleaning not just what users ask for, but common follow-ups and bugs, enabling a predictive and adaptive system.
Competitive Landscape:
- Base44’s model is highly tailored to user coding and app-gen needs; similar moves are seen from other players like Cursor.
- Retains exclusive training data by not sharing user requests with third-party LLM providers.
Monetary Impact:
- Cuts costs for Base44 (no reliance on Opus or Anthropic, lower API bills).
- Enables direct control of latency, cost, and speed, thus boosting profit margins.
Host’s Analysis:
"If Base44 really pulls this off, Base44 will actually end up being much bigger than Wix, which is pretty wild considering they bought it for $80 million… It's already doing more than $100 million in annual recurring revenue." [11:55]
Workforce News: Parent company Wix is downsizing, but Base44 is accelerating, underlining the value and potential of niche-focused LLMs in the coding sector.

3. X Launches Hosted MCP Server & Opens API

(Timestamp: 14:30)

Major Update: X (formerly Twitter) is reintroducing API access via a hosted MCP (multi-capability provider) server for integration with popular AI tools (Claude, Cursor, Grok Build) and user accounts.
Background:
- Following Elon Musk's acquisition, X originally shut down its API due to data exploitation concerns (notably by OpenAI).
- Shift signals openness to more controlled, user-level integrations, opening doors for apps and researchers.
Practical Benefits:
- Enables tools to pull custom streams (e.g., top tweets per AI news story).
- Greatly simplifies building sentiment analysis and real-time data integrations.
Host’s Use Case:
"It would be fantastic if I could save some time and just, you know, surface the best things… I would honestly almost remake my own X feed… just intentionally be able to use this MCP on X and say, look, I just want AI news." [16:15]
Pricing: $0.015 per published post; $0.20 per post with links.
Feature Set: Search, post retrieval, user lookup, trend analysis—all reducing integration friction.
Industry Context: MCP architecture is becoming standard; major SaaS players (GitHub, Slack, Salesforce, etc.) and even the host’s own AI Box have similar endpoints.

4. Anthropic Ships Claude Sonnet 5

(Timestamp: 20:20)

Product Details: Claude Sonnet 5 is priced at $2 per million input tokens—cheaper than Opus 4.8. Output tokens at $10 per million, with these promotional rates valid until August 31, 2026 (then rising to $3/input million).
Performance:
- Delivers 63.2% on agentic coding benchmarks, within six points of Opus 4.8; sometimes even edges it out on knowledge work tasks.
- Demonstrated fewer mid-task stalls; better at multi-step workflows (notably Zapier Salesforce automations).
Host’s Insight:
"The cost per task efficiency is now really important. It's more important than maybe the raw reasoning rank." [23:23]
Evolution: Indicates a shift from focusing solely on top reasoning scores to optimizing cost, speed, and usability for mainstream and free users.

5. Google Unveils Nano Banana 2 Lite & Gemini Omni Flash

(Timestamp: 24:51)

Nano Banana 2 Lite:
- Next-gen lightweight text-to-image model
- Pricing: $0.034 per 1K images; 4-second generation time
- Retains prompt compatibility and character consistency from v1, with much faster and cheaper results
Gemini Omni Flash:
- Video generation model at $0.10 per second of video output
- Features conversational editing, multimodal input mixing (text, image, video), and precise text-to-action synchronization
Deployment: Both models will be available across nine major Google products, including AI Studio, Gemini API, Google Search, Google Photos, Notebook LM, Google Ads, and more.
Industry Context:
"The fact that Gemini is getting that down to $0.10 per second of output, not bad. You know, if you were doing a one hour long video, it's going to be pretty expensive. But most of the videos that these are generating are like five seconds long." [26:30]
Significance: Strong focus on speed and cost reduction—essential for scaling generative AI in production at reasonable prices.

Notable Quotes & Moments

On research tools:
"What it's actually doing is that it's able to spin up these multi agent architectures and they're going to let all of these scientific researchers create parallel sub assistants and they're going to be doing sequence analysis, they're going to be doing structural prediction, they're going to be doing fact checking." [04:58]
On platform economics:
"They can make them much more efficient and so they're able to cut the costs for their users, which of course is fantastic." [09:28]
On the evolving API landscape:
"MCPs are very valuable. And Xai is not the first one. GitHub, Slack, Stripe, Salesforce, all of them have official MCP endpoints." [18:19]
On Sonnet 5’s benchmark shift:
"On knowledge work benchmarks, Sonnet 5 is slightly better than Opus 4.8, which is pretty crazy, right? Like Sonnet is the … lower quality model but … the worst version is actually better than Opus 4.8." [22:21]
On the pace of generative video AI:
"We literally had OpenAI that canceled Sora and is discontinuing it because it was just so expensive. So … the fact that Gemini is getting that down to $0.10 per second of output, not bad." [27:24]

Timestamps for Important Segments

02:05 – Introduction to Claude Science and its features
07:05 – Base44’s model and strategy for the coding market
14:30 – X’s API/MCP launch: implications for users and developers
20:20 – Claude Sonnet 5: performance, pricing, and market effects
24:51 – Google’s new models; pricing, speed, and platform rollout

Summary Takeaways

Scientific research workflows are becoming AI-driven, with tools like Claude Science leading the way in integrating databases, analysis, and reproducibility.
Domain-specific LLMs (e.g., Base One) are changing the economics and performance of vertical SaaS applications, especially in coding.
Reopening APIs (like X’s MCP) reflects a new balance between data control and practical value for users and developers.
Model cost/performance curves are flattening, making high-quality AI accessible at lower price points for a broad spectrum of users.
Google continues the race to commoditize and scale generative image/video AI with aggressive cost and speed improvements.

This episode is must-listen content for anyone tracking the intersection of AI model progress, platform shifts, and the changing economics of the AI SaaS landscape.

Loading summary

Transcript1 lines

[00:00]
A
Anthropic is launching Claude Science. This is going to be a workbench for researchers. Base44 is launching their very own AI model. They're trying to protect their $100 million in annual recurring revenue for their Vibe coding business. And X has just launched a hosted MCP server, so it's going to open its API to Claude Cursor Grok Build, which if you know anything about the backstory of X and their API, this is quite, quite a big deal. Anthropic is shipping Claude Sonnet 5. It's going to be $2 per million input tokens. And Google is shipping Nano Banana to Light and Gemini Omni Flash to developers. So tons coming out from basically every top AI model. Today we're going to get into all of that. Hopefully it's not going to be too noisy from my end. It is a hot sunny day in North Carolina where I'm at right now, and I am currently bouncing a baby on my lap. My wife did not sleep a lot last night, so trying to let her get some rest holding the baby. You probably heard me say this before on the podcast, but anyways, if you can hear a little gurgling and gooing in the background, that would be what it's from. If there's any repetitive AI tasks that you do, like writing newsletters or emails or generating graphics or anything you do on a repeated basis, I would love for you to go check out my startup AI box AI where we have a no code AI app builder where you basically just explain what you want it to do, whatever your workflow is, and it automatically links together different AI models and creates a workflow for you that you can use on a repeated basis. This saves me a lot of time because AI definitely is a huge unlock for being more productive. But at the same time, if you're just using it to do the same thing over and over every single day, you should probably just automate that. So if you want to give it a try, go to AI Box AI Builder. I'll leave a link in the description to go check out the the builder platform that we have. We have over 80 different AI models that you can link together and build some really incredible automations to hopefully save you a ton of time. And it's only 8.99amonth to get started. I'll leave a link in the description Kicking it off with Anthropic. They've just launched Claude Science. So this is going to be a workbench. It's connecting 60 different scientific databases and a bunch of pre built research tools for genomics, protein structures, chemistry. And all of this is going to run on Claude Opus 4.8. So sorry, Fable 5 not there yet. I'm sure when that comes out I'm we may get that rolled in as well. And it's going to come with $30,000 in free credits. They're going to be giving this way to 50 different academic projects and they're making this bet that basically Workflow is going to beat raw capacity for getting these researchers to be using Anthropic. So there's no new model or gating, but Clodscience uses the same Claude Opus 4.8. It's basically available to everybody and there is no biology, fine tuning or specialized weights. They've been testing this with a few different people and have had some impressive wins. They had UCSF Brain Tumor center that was using it to compress jloma germline analysis workflows. We had the Allen Institute's Jerome Lecoq who is building a computational review pipeline and we have Novo Nordisk who is one of their launch partners. What it's actually doing is that it's able to spin up these multi agent architectures and they're going to let all of these scientific researchers create parallel sub assistants and they're going to be doing sequence analysis, they're going to be doing structural prediction, they're going to be doing fact checking. And the reason why this is so powerful is because all of that can be happening while they're also keeping their full reproducibility. So the code, the environments, the message history, all of that they're going to be able to reproduce. So there's a bunch of basically custom tools that scientists need and Anthropic has built that in. They're giving away a bunch of grants hoping to show people that this is the, you know, anthropic is the best place if you're doing science. Okay. Base44 is launching their very own AI model. And the reason why is by the way, I didn't realize, but base 44 is owned by Wix. It's an incredible vibe coding platform. I've been really impressed with it. It's not the one that I primarily use because I'm just using Claude code for mostly everything I do. But if you're getting started, I think anywhere between level and base 44, those are my two favorite. Anywhere is also a great platform. But in any case, base 44 is $100 million in annual recurring revenue and they just launched their brand new model called Base one. So this is their very own LLM. This is actually very similar Cursor is doing by the way. But this LLM is trained on tens of millions of real user interactions. So what's interesting is because they're able to, you know, have tens of millions of users using them to create their tools, they can take all of those conversations. And I'm, I'm not sure if it's every single user or if they had people opt in. It's probably every single user and it's just part of their terms of service. But they're taking everything that those people have been saying and they're training their model to basically fine tune what people need when they're building websites, apps and software. Now what's interesting to me about this is they know exactly like when someone building software and you know, theoretically Anthropic and Codex, like for opening I should have this as well. But they know every time that someone asks it to build something and it does it and then they give a follow up, they're like, oh shoot, every time someone asks for this, you know, for xyz, they typically ask these next five things. And that could either mean that when they first ask, they don't know exactly what they want, or there's traditionally a bug that follows or there's a gotcha or something like that. But essentially they can fine tune this model that they have to know exactly what they want to what people are basically the direction they're going before they even go there, right? They're like, hey, I want like a fitness app and like, oh, these are probably the five features they want. So either they'll ask them right away or they'll just start building stuff in or they'll set up the architecture in a way that is ready to go. Now there's a couple of cool things that they can do with these fine tuned models. One of them being that they can make them much more efficient and so they're able to cut the costs for their users, which of course is fantastic. But also if someone's talking to their model and they're like, oh, you know what, our, you know, our base one model would be a lot better than sending it to Opus who 4.8. All of a sudden they're able to switch people over to the more efficient model, save the user money and they're also keeping exclusive data, right? Because if they're not sending data over to OpenAI or Gemini or Anthropic, all of a sudden they're not able to train with all of that data. So we're also seeing, we're seeing the same thing coming out of cursor. So a bunch of different players that are interacting with code are basically intercepting the code and the conversations people are having and knowing what people want and they're being able to train their own fine tuned models with it. So Base one, basically they're saying this is designed to replace Anthropic's Opus for app generation workloads and it's going to try to give them a really direct control over latency, cost and inference speed and they're not going to have to rely on any third party APIs. And also, by the way, you know, this is a big cost for them because when you get a subscription to base 44, you pay them for tokens, but a huge chunk of your token bill just goes straight to Anthropic, right? It's like they're, they're giving a huge chunk, but if they can do it directly on their servers and if they have, you know, a cost efficient way to do that, they're actually able to make more money. So lovable that I've mentioned many times is of course their rival in Sweden and they still use external LLMs. They have about $500 million in AR, so they're about five times bigger. But they do have some really solid backing because WIX acquired them for about $80 million when they were only six months old with eight employees. Their parent company is now cutting about 20% of their workforce. So, you know, WIX is cutting down, but evidently base 44 is accelerating. And my prediction here is that if base 44 really pulls this off, base 44 will actually end up being much bigger than Wix, which is pretty wild considering they bought it for $80 million. It's already doing more than $100 million in annual recurring revenue. So they were just, you know, blasted past even the acquisition price. This was an amazing deal for wix. All right, the next thing we ought to talk about is that X has just launched a hosted MCP server. And if you know X like they, when they, when Elon originally bought Twitter, he immediate immediately shut down the API, because I think OpenAI and a bunch of other people just had like a full fire hose blast of all of the API data and they were using it for training. Elon shut that down realizing how valuable it was. We saw similar moves from Reddit and other players afterwards. But now that X is kind of launching this new mcp, they're going to let tools like Claude Cursor Grok all tap into Xai's API and specifically your own account. So if you have your own account and your own permissions, you can let the, you can let the MCP tap into that. I'm sure there's a ton of like, market sentiment analysis, you know, great things that you can do with this. There is an API so big firms, you know, like hedge funds that are trying to, you know, measure people's sentiment analysis on the FIFA World cup to determine how many hot dogs are going to be sold and blah, blah, blah, blah, and do all their, like crazy, you know, hedge fund calculations, those types of things, they already have an API. They're going to be using this. This is more for your average user. And MCP means that you can get all of Xai's data into your clod that you're actively using. So for regular people, I personally am excited, would love to connect the MCP for when I talk about AI news here. I would love to get maybe like some sort of brief on every story I'm going to cover with the top tweets and the top comments on those tweets. So if my MCP could go and pull that, that'd be really useful. Traditionally, trying to get like Claude to go and scrape X for good tweets is notoriously hard. I've had a hard time doing that and I basically just have to go manually scroll X myself. It would be fantastic if I could save some time and just, you know, surface the best things, basically. I mean, what's kind of cool is if you have an MCP into something like X. And I know I'm getting off the rails here, but you could imagine the same thing with Reddit or any other social platform. You can fine tune your own algorithm, which I think threads and Mark Zuckerberg's trying to let people kind of do that with threads, but it's like in threads anyways, it's like show more posts about this specific topic for the next like four weeks and then anyways, it's not like a permanent thing. So this would be cool. I would honestly almost remake my own X feed because I probably get, I'm sure I interact with it, but I feel like I get like way too much politics in my thread that I really want to interact with in any given day. So. And it's probably my own fault, but it would be really cool to just intentionally be able to use this MCP on X and say, look, I just want AI news. I just want, you know, news from these, you know, 500 companies that I'm trying to follow in AI and get more of a. More of a streamlined stream of content around that. That'd be cool for me. Okay. XAI obviously isn't the only, you know, the first big company to do an mcp. It feels like basically everyone has an mcp, even my own company, AI Box, AI, we have an mcp. So, you know, traditionally people would go to our site. There's 80 different AI models, audio, text, video, and people would go chat with them in our playground. And, you know, you pay like $9 a month and you get access to every model. But a lot of people are like, look, all my workload is. All of my work is happening inside of Cloud or inside of ChatGPT. So we created an MCP and now you can get access to all of our different tools. You can get, you know, like cloud can have image generation and audio generation, which it didn't have before. So MCPs are very valuable. And Xei is not the first one. GitHub, Slack, Stripe, Salesforce, all of them have official MCP endpoints. The API pricing, if you're going to do that with X, is still going to be 0.015 per published post and about 20 cents per post with links that was set earlier this year. They're basically said they're trying to curb spam at scale, but the hosted server is showing some new capabilities. There's search, there's post retrieval, there's user lookup, there's trend analysis. All of that is cutting down a lot of integration time for people trying to work with X. So I'm going to be excited to test that out. Anthropic is shipping Claude Sonnet 5. It's going to be $2 per million input tokens, which is undercutting Opus 4.8. And it is about 63.2% on agentic coding. So it's within about six points of Opus 4.8. You can only imagine, right, the next model, which is Fable 5. I mean, they already dropped it, but it's going to be significantly better. It's interesting because the, you know, their headline model got pulled back, so now they just have all of their, like all of their, their worse models are, you know, they're more efficient models, they're less powerful models. They're starting to catch up with their main model because they can't even, you know, update their main model to be its best, which is really interesting to me. So this model is going to basically become the default for all free and pro users. Starting on Tuesday. I think that is just showing that agent capable AI is now really important across basically every tier. This isn't going to be just a premium feature and I do think that it's promotional. The $2 per for the inputs and the $10 for the outputs that is going to be running through August 31st. And we see this from Anthropic a lot like I think if I go on my account right now they're like till the, you know, the beginning of July it's 2x credit usage. And I mean basically at this point I'm just trying to get my max out my usage every single week because I feel like you're getting a really great deal and I don't know how long that will last but that's going to increase from which is still cheaper than Opus 4. 8 and GPT 5.5 and Gemini 3.1 Pro. So it's still cheaper but it's. Yeah until August 31st we got the $2 price. It's going to then go up to $3 and you know, maybe we'll have new models coming out then. So who knows what happens in the future. But on knowledge work benchmarks, Sonnet 5 is slightly better than Opus 4.8, which is pretty crazy, right? Like Sonnet is the, you know, the lower quality model than Opus but because they can't upgrade Opus now, the worst version is actually better than Opus 4.8. I think that's just kind of showing though the cost per task efficiency is now really important. It's more than more important than maybe the raw reasoning rank and if you want to spend a ton of money you can go use something like Claude code and turn on like you know, their super code mode or whatever. It gives you, you know, waste more reasoning but it uses a lot more tokens. A lot of early testers say that there are fewer mid task stalls and some of the multi step workflows. Zapier's two part Salesforce Automation completed where I think a bunch of previous models they were trying to do that two part Salesforce Automation and it was getting about halfway through and it abandoned it. So anyways it seems like this is going to be much more efficient. Speaking of new models, they're not the only one. Google is shipping Nano Banana two Lite and also Gemini Omni Flash to developers. It is a text to image model. It's going to generate in about 4 seconds and it costs 0.034 cents for a 1K image. Gemini Omni Flash which is a video generation model is about, is about $0.10 per second of output, which honestly is not bad. I have, I, you know, these things can be insanely expensive. We literally had OpenAI that canceled Sora and dis discontinuing it because it was just so expensive. So like we have the capability to do AI video. It's just really expensive. So the fact that Gemini is getting that down to $0.10 per second of output, not bad. You know, if you were doing a one hour long video, it's going to be pretty expensive. But most of the videos that these are generating are like five seconds long. So that's 50 cents a video, which I know actually sounds kind of expensive. If it's 10 second video it's a dollar. If it's a, you know, 100 second video it's $10. But you know that's, that's the cost. Okay. Nano banana two light is going to be replacing the original Nano Banana. It's going to be the recommend as the recommended tier. And I think they're saying that it's like if you have a prompt that worked really well for the original Nano Banana, it's going to still work for nano banana 2 light, which is great. I hate it when the model changes and the prompts are just kind of get off. They said there's going to be character consistency. They have really aggressive speed cost optimization that they've applied. So it's kind of interesting, right? It's like two, it's Nano Banana too light. Like this is supposed to be like the super fast, super light version and it's, they're trying to get this thing to still be comparable to Nano Banana but way faster and way cheaper. And I'm really excited when we do these kind of optimizations on the image models. They also have Omni Flash which is going to match VO31 fast pricing while it's also adding conversational editing, multimodal input mixing. So they could do text, image and video and it's going to do text to action synchronization for any sort of like graphics that you have on the screen, which I think is phenomenal. So for videographers this is actually really cool. Nanobanana 2 Lite is going to ship across nine of the different products that Google already has. So it's going to be inside of AI Studio, it's going to be inside of the Gemini API, it's going to be inside of AI mode in search. So when you're on Google just searching, it's going to be on the Gemini app Notebook LM Google Photos, stitch, Google Flow and then it's also inside of Google Ads, right? They gotta, gotta make money from the ads off of all of this. I'm excited to see what Google comes out with. I'm excited for all the new models today. Guys. Thank you so much for tuning into the podcast. Make sure if you want to check out the MCP for AI box that you go check out AI box, AI slash MCP. Basically you're going to get all 80 of the top AI models right inside of Claude or Gemini or ChatGPT, whatever you work with the most, you can get all of the other models inside of there. So you can actually call them really, really useful. And as always, if you want to get all of these new stories that I talk about on the podcast straight into your inbox every day, go to aichatdaily.com that's my website that goes along with this podcast. There's a big subscribe button you can hit in the top corner and if you hit that, you can put in your email and I'll send you all of these stories and more in depth articles where it breaks down all of the information. You can see all that as well. Thank you so much for tuning into the podcast, guys. I hope you have a fantastic day and I'll catch you in the next episode.