Ep 783: Google’s Gemini 3.5, Codex goes for Goals, Alexa going after NotebookLM and 7 More AI Updates You Should be Using Today - Everyday AI Podcast – An AI and ChatGPT Podcast

Summary9 min read

Everyday AI Podcast – Ep. 783: “Google’s Gemini 3.5, Codex goes for Goals, Alexa going after NotebookLM and 7 More AI Updates You Should be Using Today”
Date: May 22, 2026
Host: Jordan Wilson

Episode Overview

In this jam-packed “AI Friday Features” edition, host Jordan Wilson breaks down the week’s most significant, useful, and sometimes confusing new AI releases from Google, Microsoft, Cursor, OpenAI, and Amazon. The focus: practical AI features you can use today to boost productivity, streamline workflows, and stay ahead of the ever-accelerating AI curve. Key topics include the new Gemini 3.5 Flash, groundbreaking Codex updates, Microsoft's ChatGPT for PowerPoint, Google's new multimodal video model Omni Flash, Alexa’s podcast creator, Cursor’s upgraded coding model Composer 2.5, and more. Throughout, Jordan delivers candid insights, benchmarks, and actionable advice for business leaders and AI practitioners.

Key Discussion Points & Insights

1. Google Gemini 3.5 Flash: A New Flagship Model

[03:20]

What’s New
- Gemini 3.5 Flash is now Google's flagship Gemini model (for now), announced at Google I/O and available immediately in the Gemini app and as the default AI in Google Search worldwide.
- It’s four times faster than comparable models and beats previous Gemini models (3.1 Pro) on coding and agentic benchmarks.
- Multimodal: Improved understanding of text, image, audio, and video.
Performance & Benchmarks
- Outperforms on key agentic benchmarks (MCP Atlas, Terminal Bench 2 GDP, Val AA):
  
  “There’s a handful of benchmarks that actually the Gemini 3.5 flash is the best in the world.” (08:05)
- Not yet the best across all benchmarks (Opus, GPT-5.5, Sonnet 4.7 may be stronger in some areas).
Pricing and Usefulness
- No longer a “cheap” Flash model; price is notably higher than previous Flash generations.
- Remains invaluable for developers, engineers, and operators handling complex, multi-step agent-based workflows.
Who Has Access
- Live in the Gemini app, Google AI mode in Search, Google AI Studio, Vertex API, and Anti Gravity 2.
Quote:

“Frontier-level coding and Long Horizon agent work at the Flash tier speed… You still have the speed of the Flash, but you don’t quite have the same price breakthrough.” (10:00)

2. ChatGPT Comes to Microsoft PowerPoint

[15:55]

What’s New
- Native add-in now brings ChatGPT directly inside PowerPoint.
- Generates slides from prompts, emails, documents; critiques presentations, suggests improvements, and anticipates audience questions.
Contextual Integration
- Integrates with Gmail, Outlook, SharePoint for richer, context-aware presentations.
- Outputs fully editable PowerPoint slides (not just images/JPEGs).
- OpenAI calls this “turning conversations into presentations in minutes.”
Access & Utility
- Available in beta for most ChatGPT plans (Business, Enterprise, Edu, Teacher, K12, Free, Go, Plus, Pro).
- Requires Microsoft PowerPoint and a connected ChatGPT account.
- Saves considerable time; maintains company templates and brand adherence.
Quote:

“This is context engineering inside of PowerPoint, powered by one of the best harnesses around inside of ChatGPT.” (18:10)

3. Google Gemini Omni Flash (“Anything-to-Anything” Video Model)

[25:55]

What’s New
- Gemini Omni Flash: New world model for “anything-to-anything” generation (primarily video for now).
- Generates and edits high-quality video from text, image, audio, or video input.
- Enables conversational editing — change video scenes, backgrounds, and more with text input.
Comparisons & Insight
- Not just “text-to-video”; uniquely excels at video in → video out editing.
- Quality is not quite on par with China’s SeedDance 2 for pure generation.
- Grounded in real-world knowledge; anticipated to support images, audio as output soon.
Access
- Rolled out to Gemini Plus, Pro, Ultra users via Gemini app and Google Flow (YouTube Shorts support coming).
Quote:

“If you just look at this as a text-to-video output and compare it to SeedDance and say, ‘this new Google Omni Flash stinks’, I think you’re missing the point… Instantly the best in the world at video-in, video-out.” (29:52)

4. Cursor Composer 2.5: A New Challenger in Coding LLMs

[35:35]

Background
- Cursor is a multi-model agentic IDE for coding, now shipping an in-house LLM, Composer 2.5, based on Moonshot AI’s Kimi K25.
Improvements
- 25x more synthetic training data than previous models; excels at long-horizon, multi-file, and iterative coding sessions.
- Nearly equals Opus 4.7 performance on key agentic coding benchmarks, at ~10% of the price.
- Input: $0.50 per million tokens, Output: $2.50 per million tokens (vs. Opus’s $5 and $25, respectively).
Impact
- A major new option for teams and developers priced out of Opus, GPT-5.5, or Gemini Pro.
- Especially useful for engineering teams running large batch/agentic workloads or budget-sensitive users.
Quote:

“You’re getting 80–90% of Opus capabilities at 10% of the cost. That is a pretty big update.” (40:20)

5. Alexa Plus: AI Podcast Creation a la NotebookLM

[43:15]

What’s New
- Alexa Plus now offers on-demand AI-generated podcast episodes.
- Two synthetic hosts discuss any topic you propose, with customizable length, focus, and tone.
- Unlike NotebookLM, doesn’t ground in your documents—just web sources.
Access & Process
- Rolling out to Alexa Plus subscribers (free with Prime in the US).
- After you propose a topic, Alexa creates an outline, lets you adjust, then notifies you when the audio episode is ready.
Critique (Jordan’s Take)
- Doesn’t see it working well; “Alexa Plus is hot garbage,” likely to hallucinate and lack accuracy compared to NotebookLM (which grounds answers to your files).
  
  “If you try to replace me out there, don’t you do it…at least not with Alexa Plus Podcast… it’s going to be abysmally bad.” (47:45)
- Amazon intentionally calling out Google’s NotebookLM by name (“no documents needed”), but undermining the value of grounding.

6. Google Anti Gravity 2.0: Agentic IDE/CLI Expansion

[50:44]

What’s New
- Anti Gravity 2.0 is Google’s “Codex competitor”: an agent-first IDE and CLI, now available as a desktop app (Mac, Windows, Linux), terminal interface, SDK, managed agents, and as a harness in Google AI Studio.
- Gemini CLI is being deprecated (open source users particularly upset).
Critique
- Host unimpressed: finds Anti Gravity 2.0 “not very good right now”, rushed to release, receives poor feedback from developers.
- Frustratingly similar UX to Codex (and even mistakenly shows a “Codex” folder in the Google demo video).
Quote:

“It looks identical to Codex… I opened and installed Anti Gravity… I literally thought, ‘Where did all my Codex chats go?’” (54:15)

7. OpenAI Codex: Weekly Update Blitz & Goal Mode Goes Desktop

[57:39]

Major Upgrades
- “App Shots”: Double-command key shortcut instantly captures your foremost app window, including actual text/context, and imports it into Codex for immediate use.
- “Goal Mode”: Now in the desktop app (not just CLI)—lets Codex autonomously pursue defined objectives for hours/days via recursive agents. Powerful, must be carefully scoped.
- “Remote Computer Use”: Codex can now control your laptop even while it’s locked—unprecedented for background autonomous agents.
- “Shareable Plugins”: Teams can now share custom skills/plugins across their organization, solving a major pain point for enterprise deployments.
Host’s Enthusiasm
- Jordan lauds Codex’s pace, focus, and usability:
  
  “The team at OpenAI continues to embarrass the entire AI industry with how good and useful Codex is and how far it is…” (58:28)
- “Codex, Codex, Codex” is the refrain—OpenAI now retaking the lead in AI innovation for 2026.
- Highlights fluid UI/UX (like double-command shortcut for app shots) and superior rate limits vs. Google/Anthropic.
Access & Caveats
- Most new features are Mac-only for now (due to easier development environment).
- Goal mode intended for experienced users with precise prompts; careless use can burn through token limits quickly.
Quote:

“Goal mode—this is your agent that's going to run for hours or even days… that other vendors keep promising. It's here, right?” (1:00:28)

Key Memorable Quotes

On AI complexity

“You have dozens of new AI features and upgrades to tackle available now, including Gemini 3.5 Flash...to make sense of them, you're going to need like a roadmap, a translator, and also dozens of hours.” (00:53)
On practical AI adoption

“We give you the lowdown on some of the most important and impactful AI tools that you can use today and LLM upgrades that you don't have to wait around for.” (02:37)
On OpenAI’s momentum

“Codex, Codex, Codex... The team at OpenAI continues to embarrass the entire AI industry with how good and useful Codex is.” (58:28)
On Google’s Anti Gravity 2.0

“I don’t find it useful. I don’t think Anti Gravity is very good right now, which is interesting to me because Google seemed to push it way more than they should have.” (53:04)

Timestamps for Major Segments

[03:20] – Google Gemini 3.5 Flash deep dive
[15:55] – ChatGPT integration in PowerPoint
[25:55] – Google Gemini Omni Flash world model
[35:35] – Cursor Composer 2.5 release and context
[43:15] – Alexa Plus’s AI Podcast feature vs. NotebookLM
[50:44] – Google Anti Gravity 2.0 (IDE/CLI confusion & critique)
[57:39] – OpenAI Codex weekly updates, Goal Mode, App Shots, team sharing

Summary Table: Updates, Features, and Applications

| Update | What’s New/Key Feature | Who Benefits Most | Notable Issues or Critiques | |--------------------------|-------------------------------------------------------------|--------------------------------------|-----------------------------| | Gemini 3.5 Flash | World-leading speed/benchmarks for coding/agentic tasks | Developers, engineering teams | Price increase, complexity | | ChatGPT for PowerPoint | Full slide creation, story critique, connected context | Knowledge workers, enterprise users | Requires Microsoft/ChatGPT | | Gemini Omni Flash | Multimodal video editing; video in → video out | Marketers, creatives, video editors | Not the best at gen-only | | Cursor Composer 2.5 | Top-tier coding model at 10% Opus price | Engineering, cost-sensitive teams | Still trails top 3 overall | | Alexa AI Podcast | Dual-host podcast generation from prompts | Alexa Plus users, casual listeners | Hallucination, poor context | | Anti Gravity 2.0 (Google)| Agentic IDE/CLI, SDK expansion; Gemini CLI deprecated | Developers needing Google ecosystem | Confusing, not polished | | Codex Goal Mode, App Shots| Autonomous desktop agent, app context snap, sharing plugins| Power users, enterprises | Mac only, rate limit watch |

Final Thoughts & Listener Takeaways

AI is moving faster than ever, with major usability leaps in both “agentic” automation and practical enterprise integration—from slide decks to autonomous coding.
The Codex platform, with its weekly pace and seamless integration, stands out for real-world productivity.
Google’s new models are powerful but fragmented and sometimes opaque (“You need a roadmap and a translator”).
Alexa’s move into AI podcasting highlights the race to create personalized audio, but grounding and context remain essential.
Cursor’s advances signal more competition in the coding model space, which may drive down costs and improve access.
Listeners are encouraged to focus on features actually available today in their plans and platforms—and to seek out tools that fit their workflow, not just the latest headline model.

For more:

Sign up for the free daily newsletter at youreverydayai.com for detailed recaps, links, and actionable guides.
Check out the “Start Here Series” for a curated AI learning path, Episode 691+.

Jordan’s Closing Advice:
“If you don't have 10 hours a day to understand it all, that’s what I do for you… Subscribe to the podcast, sign up for the newsletter. AI moves too fast to follow, but together we can break some barriers.”

Loading summary

Transcript1 lines

[00:01]
A
This is the Everyday AI show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business and everyday life. If you're a Google Gemini user, here's the good news. You have Dozens of new AI features and upgrades to tackle available now, including Gemini 3.5 Flash. But here's the bad news. In order to find them all or make sense of them, you're going to need like a roadmap, a translator, and also dozens of hours. That's not all that's new in AI this week. If you're an Opus user inside of Cursor, there's a new model you might want to take a look at. PowerPoint got a lot easier with Chat GPT. And speaking of ChatGPT, its bigger, stronger brother, Codex again released a handful of AI updates that I don't understand how they're this good and this far ahead of the pack. So if you don't have hours every single day to keep up with all of these new AI releases, because that's how much time it takes, because that's all I do well, this is the show for you. On Fridays, we bring you our new AI Friday features where we give you the lowdown on some of the most important and impactful AI tools that you can use today and LLM upgrades that you don't have to wait around for. Not coming soon. These are ones you can use today to start making your life a little bit better. All right, let's get into it. If you're new here, welcome. My name's Jordan. This is Everyday AI. It's a daily live stream, podcast and free daily newsletter helping everyday business leaders like you and me not just keep up, but how we can use all of this new AI technology to get ahead to grow our company and career. It starts here with the unedited, unscripted live stream podcast. But to be the smartest person in AI, our website is your cheat code, your everyday AI dot com. Go sign up for the free daily newsletter. We're going to be recapping the seven AI features that we're going over in today's show as well as a whole lot more. So make sure you go check that out. And hey, livestream audience, good to see you. Kimberly joining from New York City. Jose joining from Santiago. Joe is Codex Vibing in the mountains. I'm Codex Vibing in Chicago. Sounds a little bit funner, so thanks for everyone for joining. If you do have any questions as we go along, I'll try to get to them. But let's get straight into it. There's a lot to cover this week and let's start with Google 3.5 Flash. So, yeah, there was a lot we didn't get a Gemini 3.5 Pro. I think that there's a reason Google did this. I'll tell you here in a second. So here's what it is. What's new, who has access, all that good stuff. So Gemini 3 Point Flash is Google's new flagship model. Yeah, that's right. Their Flash is the flagship model for now in the Gemini 3 point family and it was announced at Google's IO. We're going to be recapping probably three or four of the the Google I O AI updates. The rest will be shared about and already shared about in our newsletter. But it's already. It is shipped, it is available now. And the crazy thing is, well, it beats Gemini 3.1 Pro on coding and agentic benchmarks, while also running about four times faster than comparable frontier models. So right now it is the default model in the Gemini app. And also so even if you're not a Google Gemini user, you're going to take advantage. This new Gemini 3.5 flash, that's because it is the new default mode in AI mode inside Google Search worldwide and it is going to be powering Google's new Gemini Spark. We're not going to be going over that today because hardly no one has access anyways. All right, so who has access to this now? Well, like I said, it's live, not just in the Gemini app, but also in AI mode and search. If you use Google's AI Studio or Vertex in the API and anti Gravity 2, which we're going to be going over today. All right, so it is an internal, sorry. The Gemini 3.5 Pro is expected next month. So Google did say that next month is going to be the Gemini 3.5 Pro release. Presumably what we think is going to happen. Right. And this is how it's kind of happened in the past around these timed big events. Usually an OpenAI or a Anthropic or someone is going to come out with a model usually in response and, you know, so kind of to dethrone Google here. I don't know if that's going to happen. Right. I don't know if we're going to see an open, you know, GPT 5, 6. We've seen rumblings of that. If we're going to get, you know, Sonnet 4. 7 or, you know, eventually an Opus 4. 8 or Opus 5, whatever it is, you know. So depending on the benchmark you look at. Yeah. Right now a flash model, Gemini 3.1 flash. So this is supposed to be the smaller, lighter, faster, cheaper version, although it's not really cheap anymore. But in some benchmarks it's the best model in the world. Although all around is it? No, it's not. But that's what we might see out of Gemini 3.5 Pro, which is going to be coming later. So here is what Google says on their kind of marketing. So they say Gemini 3.5 Flash delivers intelligence that rivals large flagship models on multiple dimensions at the speeds you have come to expect from the Flash series. It is our strongest agentic encoding model yet, outperforming Gemini 3.1 Pro on challenging coding and agentic benchmarks like Terminal Bench 2 GDP, Val AA and MCP Atlas and leading into multimodal understanding. So Google did share kind of the benchmarks, how this compares to both the old version of Flash as well as Gemini 3.1 Pro and Opus 4.7 and GPT 5.5. And you'll see here, at least on the screen that I'm sharing, there's actually a handful of benchmarks, right. I know it's not just about benchmarks, but there's a handful of benchmarks that actually the Gemini 3.5 flash is the best in the world. Right. So as an example, MCP Atlas, which is multi step workflows using MCPS toolathon, there's a couple others that this new quote unquote small fast model is the best in the world. That will obviously change once Google announces Gemini 3.5 Pro next month. And if we do get a, you know, GPT55 Sonnet 4. 7. Right. Whatever we may get from the other labs, but pretty big news. So here's why I think it's going to be useful. Well, you get Frontier, frontier level coding and Long Horizon agent work at the Flash tier speed. The, the price no longer. Right. So I think if you went back to an older version, I think it's like gemini, you know, 2.0 flash or 2.5 flash. Right. It was essentially 20 times cheaper. You don't have that anymore. So there's actually some things it's kind of going up in price on the Flash side if you look at previous Flash models. So with Gemini 3.1 flash it is getting much more expensive. So you know, no longer when you see the Flash moniker, at least out of Google Gemini, do you assume, oh, this is a cheaper model. I know a Lot of companies had really been using the Flash series for that reason because it gave you a pretty high intelligence level and a very fast output. So you still have the speed of the Flash but you don't quite have the same price breakthrough at least if you're comparing it to previous generations of Flash. So I still think it's going to be, you know, valuable for developers and engineering teams who are running anything on the agentic side, you know, operators that are just running, you know, deep research, multi step workflows, things like that. So it's, it's still a very capable model. I think probably the best, you know, overall, you know, I guess outcome for everyday people is if you use Google's AI mode, it's getting much better than it was before because this was on the previous version of Flash. All right, you know, here's someone from YouTube, funny name here, the permanent underclass 26 said the updates from Google hasn't made me want to try it over codecs. Yeah, for me I can't get out of Codex right now. And it is one of those things when you talk about, you know, speeding speed and price, right? One of the big things with speed as well, if you're just sitting there waiting. But now with Codex, when you can schedule things around the clock, some of the new updates now I think certain things on the speed side are less important for me personally and I think for a lot of other power users as well. All right, let's go next. So Next we have PowerPoint just got a little bit better. That's because there is a new Chat GPT integration. So here's what's new. This is a new native add in that puts ChatGPT directly inside Microsoft PowerPoint to help create, edit and review docs from natural language prompts. So you can pull source material from any of the connected services you have, which is the really cool thing, right? So you get to use a lot of the power of ChatGPT's kind of harness and tooling. It's not just using the model, right. You can use all of your connected services like you know, Gmail or your outlook or your SharePoint, right. So you can, can pull in all of that information, which is huge. So then you know, slides can be built from your existing emails, documents, spreadsheets, right. Without copying and pasting or having to manually go look that thing up. So it also includes a reasoning feature that critiques, drafts, flags weak storytelling and anticipates audience questions, not just slide generation. So this one, you know, just came out yesterday A lot of these are fresh, hot out the oven. So I haven't had a chance to try this one yet, but I'm actually super excited for some of those reasons. Aside from just having, you know, a new ChatGPT integration inside PowerPoint, which is really nice. Just that new reasoning feature that critiques and flags weak storytelling, anticipates audience questions, is really cool. This is a great, you know, use case or example of how it's so important to use a reasoning model day to day, right? For the most part most models are either reasoning or hybrid off the shelf. But this just goes to show you, right with this integration it's not just going to spit out a bunch of, you know, hopefully legible and and good text, but it's going to pull from your day to day, right? This is context engineering inside of PowerPoint, powered by one of the best harnesses around inside of ChatGPT. But also it, well it can kind of take a step outside of the work that you're doing in critique it a little bit. So Here is what OpenAI says about the new integration. So it says it turns conversations into presentations in minutes, so it can create and edit in plain language. So you can start from notes, docs, spreadsheets, prompts or an existing deck. You can ask ChatGPT to create slides, rewrite content, tighten the hierarchy and add a new section, or publish a draft while keeping the outputs editable. In PowerPoints you can get a clearer story from your deck. You can ask what your presentation says, where the narrative is weak, what's missing, and what an executive audience might ask. So you can use Chat GPT to summarize, restructure and turn dense slides into clearer takeaways and then you can follow along. So then you can trust the work. So OpenAI says that ChatGPT works inside of PowerPoint, reads your deck structure and helps preserve editable slide contents, review changes, check important claims and numbers and keep control of that you share. All right, so to install it you are going to want to look in the upper right hand corner, the add in the add ins, kind of toggle there and then search for Chat GPT and then you will need to connect your Chat GPT account. So this one is actually pretty cool because I know that there's still a lot of people that maybe either don't have the experience or the expertise yet and kind of vibing slides, right? So NotebookLM with Nanobanana is great for vibing slides, right? The new GPT images too great for vibing slides. People don't know this. I shared this on one of our shows. Right. You can create a codec skill that can kind of stitch all of those together so you don't have to do it like slide by slide. But here's I think you know well first, who has access. So it's available now for a lot of paid ChatGPT plans. So right now it's available in beta for ChatGPT Business Enterprise Edu, Teachers K12 Free Go Plus Pro. So essentially just about every single ChatGPT account. AI moves too fast to follow, but you're expected to keep up. Otherwise your career or company might lag behind while AI native competitors leap ahead. But you don't have 10 hours a day to understand it all. That's what I do for you when. But after 700 plus episodes of everyday AI, the most common questions I get is where do I start? That's why we created the Start Here series, an ongoing podcast series of more than a dozen episodes you can listen to in order. It covers the AI basics for beginners and sharpens the skills of AI champions pushing their companies forward. In the ongoing series, we explain complex trends in simple language that you can turn into action. There's three ways to jump in. Number one, go scroll back to the first one in episode 691. Number two, tap the link in your show notes at any time for the Start Here series. Or you can just go to start here series.com which also gives you free access to our inner circle community where you can connect with other business leaders doing the same. The Start Here series will slow down the pace of AI so you can get ahead. It does also obviously require a Microsoft account, so you have to have access to Microsoft PowerPoint, obviously. But here's why it's useful. So I mean, it removes the copy and paste from other tools, right? Kind of the. The human duct tape that I've been talking about for so long. It keeps the slide slides editable in the native PowerPoint structure, right. Rather than just producing a flat export. Because yeah, you might be looking at this and being like, wait, the new GPT Images 2 is absolutely amazing, right? Why would I, you know, not just use that? Well, that gives you a flat JPEG, right? And obviously in PowerPoint there's going to be some, you know, some things that you have to use in your, you know, your pre approved company template, right. A lot of times you just get these decks, you have to use them, right. There's all these different things. So you know, just right there, you know, being able to adhere and stick to your company's brand guidelines, but being able to kind of take a step outside of it, right? It's almost like having, you know, a junior consultant come in and they're an outside, you know, version of yourself because they're, you know, poking and prodding, but also being able to use some of the best features as inside of chat GBT as well. So I mean, who's going to find this valuable? Any, any knowledge workers, right? So if you are still kind of, you know, I know a lot of people are quote unquote forced to use PowerPoint. I know some people love PowerPoint by default. Me, I don't know why I'm still stuck on Canva, right? PowerPoint's great, but I know a lot of enterprise companies, I would say the majority of enterprise companies, when you're creating a deck, you still have to do it in PowerPoint, right? Even if you wanted to vibe something up in one of these other tools. So any knowledge worker that is living inside of PowerPoint, but also that has data scattered across any of those other ChatGPT connectors, this is going to be a huge time saver, right? This is why I think this Friday show is actually super helpful and important because chances are hardly no one saw this, right? Because OpenAI yesterday shipped so much on the Codex end, this probably slipped under the radar, right? Like be, be, be honest, live stream audience. Did anyone know about this, right? Did anyone know actually that ChatGPT came out for PowerPoint? Maybe Nissani did, right? Since she works at Microsoft. But I don't, I don't know. I didn't see anyone talk about this, right? I obviously have a lot of agents crawling everything 24 7. So when I saw this pop up yesterday, I'm like, why is no one talking about this? All right, something people are talking about. Our next AI news story. A new, well, technically not a video model from Google, more of an anything to anything model as they're calling is the new Gemini Omni Flash. So this is Google's. Let me kind of give, give everyone the visual there for our live stream audience. This is Google's new multimodal, what they're calling the world model that they announced at IO. So it generates and edits high quality video from any combination of text, image, audio and video inputs. So this combines Gemini's reasoning with DeepMind's Nano Banana, Vo and Genie lineage with claims of better physics, gravity and kinetic motion simulation versus prior video models. So this also supports conversational editing of existing footage, right? So you can upload a video and say, you know, change what's happening in this scene or change the background to make it Chicago instead of New York. Right. So pretty cool. So here's who has access and I'm going to get to my thoughts on it here in a second. So Gemini Omni Flash has already rolled out to paid users. So if you have a Google AI plus Pro or Ultra plan, it is available inside of the Gemini app as well as inside of Google Flow. And it is coming to YouTube shorts at some point, although we don't have a firm date on when. All right, so here's a couple important pieces to think about. So Google, well, they said this very directly. This is positioned as a family of models. So obviously the Flash name attached to it lets you know that there is going to be a bigger version. Whether that's called Omni Pro, we'll see. The other thing, I think people are kind of overlooking the true value of this. People are looking at this as just a model that outputs video, which is not really true, but that's all really you're seeing online is people are just demoing, right? Like, oh, here I put in this text prompt and got this video. But again, this is not a text input video output model. This isn't anything to, well, anything. Eventually it'll be an anything to anything. But right now it's kind of an anything YouTube video model. And that's actually important and an important designation to talk about because a lot of people are right now comparing Omni to Seed Dance. All right, so Seed Dance right now is the best AI video tool out there. So from a quality perspective, Google Omni Omni Flash at least is not the same quality as Seed Dance 2, right? A very popular Chinese AI video model. But I think that's. People are missing the point because this is video input video output. Not only that, but, you know, eventually my guess is it's going to be able to output world models. It's going to be able to output, I don't know, Google Street View, you know, in, you know, output CAD drawings, right? So I think if you just look at this as a text to video output and comparing it to C Dance and saying, oh, this new Google Omni Flash stinks. I think you're missing the point. But I do think one thing that it is instantly the best at in the world is video in, video, video out. And I think there's great examples of that that Google has shared already online that show if you, whether you're using something for production or if you're just, you know, kind of playing around to be able to upload a video without any real technical knowledge and change anything, right? Change the shirt I'm wearing, change the person in this video, but keep everything else the same. You know, like I said, if you. I actually talked with a, a client recently. They had a really good workflow on this using, I think it was a combination of seed, dance and Runway. But it was a lot of human hours, right? And the quality was, oh, you know, it was good, but it wasn't okay. Right. They did an expensive video shoot. They had to say, you know, they had to change some things on the back end. Right? The, the team was like, hey, this doesn't really fit our vision, even though this is how we shot it. So they had to use a lot of these tools and a lot of, you know, human hours to use these AI tools to change some things. But now it's much easier to do that piece with the new Omni model. So here is kind of what Google says about it. They said, we're introducing Gemini Omni, where Gemini's ability to reason meets the ability to create. Omni is our new model that can create anything from any input, starting with video. With Omni, you can combine images, audio, video and text as input and generate high quality videos grounded in Gemini's real world knowledge. You can also easily edit your videos through conversation. They said, today we're rolling out the first model in the Omni family, Gemini Omni Flash to the Gemini app, Google Flow and YouTube Shorts. Like I said, YouTube Shorts is not out just yet. In time, we will support output modalities like image and audio. And then they said, you know, one of the biggest things, like I said, I'll just leave with this and then we'll move on. But the biggest thing is being able to edit your video, right? Any, any video that you upload to be able to edit it conversationally and to transform the world around you. All right, so what do you guys think? Let me know. In the Spotify comments as well. I, like, I'm always at a loss. It's, it's like, I don't know if the, you know, if the average, you know, knowledge worker is really interested in these kind of things. If, you know, marketing departments, creative, you know, people like, should we be covering this more on the show? Should we stick to more on the, you know, agentic harnessing, you know, text input, output, coding side, you know, getting work artifacts done. Personally, I'm very interested in this side, right. In a previous life I did a lot of photo and video, right? I have, I don't know, five, at least five dslr, you know, high grade Canon cameras somewhere in a closet, right. I used to really be into photo and video. So for me personally, I love this kind of stuff. But you know, ultimately, when it comes to growing your business, let me know if this is something that you really care about or not. Not. All right, the next thing, maybe you care about this if you use Cursor. All right, so we don't cover Cursor a ton on the show, but I think this one is actually pretty important for what it represents. So here's our next AI feature this Friday. Cursor composer 2.5. So if you don't know anything, let me give you some background first. You know, Cursor is essentially an agentic ide, right? You can bring in a bunch of different models and you know, you can vibe code something, right? That's a very simplified way to say what cursor is, right? But think of like, you know, Codex or Claude code, but you can use any model. So it's built by cursor. But Cursor also has their own in house models. But until this new update, I don't think people were really using Composer. But with Composer 2.5, I think that could change. So this is Cursor's in house coding model that they just released. It is built on top of Moonshot AI's Kimi K25 checkpoint, but with 25 times more synthetic training data compared to Composer 2. So kind of the sweet spot is they're targeting Long Horizon agent encoding, multi file edits, terminal sessions, iterative debugging, and sub agent recovery. So here's some of the reasons why this is really important. Because for the first time, you know, we have a model from the coding and agentic side that is at least in the frontier column, right. For the most part, it's always just been, you know, gp, the latest version of GPT, the latest version of Opus, and the latest version of Gemini Pro. And then there's usually a pretty big drop off with everyone else. So Composer is not ahead of any of those three models, but it's at least in the same league. It's in the same territory now, which is big, especially when we look at the price, because on things like Terminal Bench, right, and Swee Bench, two of the more important kind of agentic coding benchmarks, it's literally neck and neck with Opus 4.7 on those, it's way behind. Right, terminal bench GPT 5.5 is in another category, but sweet bench multilingual at least. Uh, it's, it's on par with, you know, the big players. And this is really important. So this is live now. So who has access? It's live now for all paid plans inside of Cursor and they also doubled included usage for subscribers for a week after launch. Here's why it's important. All right, so you are getting a near. And I think a lot of people using Cursor are still heavy Opus users, right. Via, you know, Anthropics model. One of the reasons is, I mean, I feel fine saying this because the data backs it up. The harness of Claude is really bad. Right. When you compare it across, there's, there's actually harness benches. Claude is the worst. It's one of the actual worst harnesses or pieces of software to use for their own models. Which is why I think a lot of people have used Cursor in kind of a sweet spot. But the price for Opus is, well, compared to this, it's now doesn't make sense, right? So now with the new Composer 2.5, it is $0.50 per 1 million token inputs and $2.50 per 1 million token output. So let's compare that. That's about a tenth of the price because on the Opus 4.7side, that $0.50 input becomes $5 and that 250 output becomes 25. So it is literally a tenth of the price. But about, about, you know, depending on what metric you're looking at, it's about 90ish percent of the capabilities. Right. Just a general broad overview. If you're looking at benchmarks across the board, maybe 80 to 90%. So you're getting 80 to 90% of Opus capabilities at 10% of the cost. And this is one of the things not to go off on a side tangent. Right? So much of Anthropic's business model is people using their models via the API. Right. Their own, you know, tooling and rate limits and harnessing is fairly bad. So that's why I'm always like from a moat perspective, when you see things like this and when you see these Chinese models come out that are great at coding, but even something like cursor and composer 2.5, pretty big. So this is, you know, going to be useful for people that are running, you know, long sessions that you need to stay coherent. You know, companies that are maybe having to cut back on, you know, anthropic token budgets. Right. Companies, a lot of companies that are using Anthropic are hitting their, you know, budget allocation for the year already or are already having to see themselves cut down on token spending. Right. The token maxing of, you know, quarter one is now showing up on the quarter two budget sheets. So I think that's going to be a key factor here. So engineering teams that are running heavy background or batch agentic workloads are going to like this. Any cost sensitive solo developers or small teams that have kind of priced themselves out of, you know, Opus 4. 7 or, you know, GPT 5. 5 high, you know, or if you're just obviously a Cursor Pro subscriber. So pretty, pretty. Actually big news. Like I said, I know we don't cover Cursor a ton, but I think why it's a big deal is for anyone that's actually using anthropic opus4.7 and aren't liking the, the bugginess and the uptime right in their desktop app or their cli or if you're just seeing all these Codex features and you're like, wait, I want that. Or you're just pricing yourself out. It's actually a pretty, pretty big, pretty big update. All right, got to get a drink of coffee there. All right, our next one, Amazon Alexa. Oh my gosh, there's an update. Are we actually going to talk about Alexa here on the show? Yes, we are. So it's an interesting one. They're just kind of taking notebook LM's most popular original feature that made it go viral, the Deep Dive where it's two AI voices having a great podcast discussion and they're trying to incorporate that into Alexa. So this is a new Alexa plus feature that was just launched and it generates full audio podcast episodes from AI hosts on demand for any topic that the user requests. So here's specifically what it is. So it's two AI generated voices that co host in a conversational back and forth similar to the Notebook LM audio overviews, but they're not prompted from documents. So users can adjust the length, tone and focus. After Alexa proposes an outline, then you can finish the episode and it's delivered as a notification to your Echo show devices and then saved in your Alexa app. So here's the, you know, the downside is, well, you kind of, you got to have Alexa plus. So it is already starting to roll out in the US and it's included if you already have Alexa plus at no cost. So if you do have Alexa plus, FYI, if you are an Amazon prime member, if not, it is a monthly fee. So yeah, if you're already paying for, you know, Amazon prime, which I think a lot of people are like, I don't know how the world existed before you could, I don't know, get like Nespressos delivered to your door in like an hour. But a lot of people probably don't have Alexa plus or they didn't know that they could get it via Amazon Prime. So it might be worth checking out to see if you want this kind of Notebook LM flavor on the go. So here is what Amazon said about it. So they said creating custom audio content is now effortless. No documents to upload, no prep work needed. Just tell Alexa what you're curious about and it does the rest in minutes. Alexa will pull together the relevant information, give you an overview of what it plans to cover, and let you adjust the length and direction conversationally before generating anything. Once you're happy with the plan, Alexa creates a recording with AI generated host voices. When your episode is ready, you'll get a notification on your Echo show device and the Alexa app. Just tap to start listening. You can also find it in the Music and More section or tune in through the Alexa app whether you're on the go. All right, here's one thing that I want to call out. All right, it's not Tuesday, but I have a small little hot take. So clearly this is Amazon calling out Google in their Notebook lm. But the thing I don't think Amazon understands here, and we'll see, because personally, I don't see this working out very well. Number one, Alexa plus is, is hot garbage, right? Alexa plus still garbage. Siri still not usable. Maybe that changes, you know, in a couple of months when and if. Oh, this is funny. Sorry. I just had a Codex Automation pop up on my, on my screen here. I should probably turn that off when I'm doing my live stream, right? But you know, Amazon here really going after and calling out Google and Notebook LM by name by saying you don't need to upload any documents, right? Which, here, here's the thing. Alexa plus is not good, right? It hallucinates. It's terrible. It doesn't understand what you're talking about. So for me personally, at least for my use cases that I use audio overviews for, and I use them all the time, I want to provide those documents, right? I don't want Alexa plus to, you know, let's just say I want. Hey, give me the latest AI news updates this week, right? If you try to replace me out there, don't you do it at Least not with Alexa plus Podcast, because I'm guessing it's going to be abysmally bad, it's going to be laughable, it's going to pull things from months ago, right? So at least for me, Amazon here is trying to, you know, make a point here where, I don't know the point, I think, and why Notebook LM audio overviews are so freaking good is because they do work on your documents and they ground it only in your documents. So I think at least it's worth talking about here because it's interesting that Amazon is, you know, literally intentionally calling out, right? You don't have to upload documents. Everyone knows this is a straight up copy of notebook LM's audio overviews. Personally, I don't see this coming out well. Like, I don't know, some of these big companies, I think you should probably talk about this and say this out loud and I don't know, maybe ask someone outside of your, your own company, right, if this is a good idea or bad idea. Because I think they're trying to make a splash with this to get people to use Alexa plus they probably thought it would, you know, maybe go viral like the audio overviews did. I don't think it's going to turn out well. All right, speaking of that, since apparently I'm giving my opinion on all the AI feature updates, I don't think this one's going to turn out well, at least not in the short term for Google. So the next AI update is Google's AI Anti Gravity 2.0. All right, so a lot of, you know, IDE and CLI terminology, right? Someone told me on LinkedIn the other day, they're like, Jordan, you use too many buzzwords. Explain things. All right, so IDEs are essentially these, you know, agentic coding interfaces, right? And a CLI command line interfaces when you're coding or using something like Claude code or codecs via the terminal, right? So your, your clis, that's kind of how you're coding in the terminal, right? And then you have these desktop applications that allow you to code, but it's a little more user friendly. That's kind of your ides. So that's what Anti Gravity is. Think of it as well. It's Google's new version of Codex. Little confusing, but let's talk about what's new. And yeah, it wasn't all good. So this was also announced at Google IO and it's an expansion of the original Anti Gravity ide. And Google's kind of first foray I guess at least if they're putting a lot of momentum behind into a full agent first development platform. So yeah, here's where it starts to get confusing and why you need a map and a translator. And yeah, it's, it's confusing. So there's five services. So there's now a standalone desktop app which I tried. There's still the Anti Gravity cli which is codenamed agy, which replaced the Gemini cli. Right. There's also an Anti Gravity SDK. There's the managed agents in the Gemini API and also Google's AI studio can use the Anti Gravity harness. So it's also a harness now. And yeah, and there's still the Gemini desktop app. So it's different, so it's confusing. So let's at least talk about who has access. So the desktop app is now available on Mac os, Windows and Linux. There is also a new pricing tier and they even adjusted the, temporarily tripled the limits inside of Anti Gravity. I think that's just a short term thing though because a lot of people were not happy when it rolled out with the limits and just overall with the product. So the other thing that people are very upset about is the Gemini CLI is, is being deprecated for consumer users. So that's I think is the biggest thing that a lot of people were upset about because I believe the Gemini CLI was open source and a lot of people loved it. So essentially Google was allowing this great command line, you know, coding tool. And I do know originally when they announced it it was like free tokens, originally. All right, so now they're sunsetting it in June and this is part of the replacement. So let's talk about why it's useful. Well, if you are, I'm trying to be honest and also factual here because for me I don't, I, you know, I tried it, I don't find it useful. I don't think Anti Gravity is very good right now, which is interesting to me because Google seemed to push it way more than they should have. If I was Google I probably would have announced this on the download. I wouldn't have put a lot of, you know, given this, a lot of, you know, shine at the keynote. But they did and they're continuing to push it afterwards, which is interesting because it's not good. Right. A lot of what Google released is really good, right. Gemini 3, Gemini 3 flash, if you're using it inside the Gemini app, right. Not paying for the usage. Really good. Gemini Omni, really good, really exciting. A Lot of other things that they announced that maybe aren't available yet to anyone. Really good. Anti Gravity 2.0. Really not good. Right? Especially when you compare it to Codex. And why I do have to bring this up is because Google has gotten a lot of flack and probably rightfully deserved. Because in the actual announcement video, I kid you not, in the actual announcement video from Google, it was a 16 minute video. I watched it. About the two or three minute mark. I don't know if no one edited it or watched it, but literally, on the person that's demoing the new Anti Gravity, there is a desktop folder that says Codecs. All right, Or a folder on their computer that says Codex. And why does that matter? Well, visually, this looks like Codex. It looks the exact same, right? When I opened it, I obviously had codecs open on my computer, and I opened and installed Anti Gravity, you know, the hour after it came out. And I'm like, wait, why is this not launching? And where did all my Codex chats go? Like, I literally thought that. Because it looks identical to Codex. Right? And actually the Codex team called this out, you know, tongue in cheek. But yeah, again, I'm not sure I think Google could have gotten this right. Maybe they felt the need to get out an unpolished version, which I commend companies for putting something out that's maybe not quite perfect, but if you're gonna do that, don't highlight it as much as they did. You know, just say, hey, this is a beta product. You know, it's. It's working. You know, work in progress, but go, go use it. So, all right, let's wrap up here because our last one, I think, is the big one. All right. Oh, yeah, sorry. Joe said quiet. Don't say A L E X A's name. Yeah, sorry, I've already done it. So let's see. Alexa. All right, we'll see. I just screwed up, probably for a lot of people. Sorry. Yeah, I think we need a code name if I'm covering, you know, Alexa or Sira to. To not trigger that for you. All right. But yes, Google obviously won the week when it came to total releases. But like I said, you almost have to have a degree in daily AI like me to understand what Google released. Because a lot of the things I wanted to cover this week, but I couldn't because some things are only available to workspace members. Right, Some things. Or, sorry, to only available. Well, some things are only available to workspace members. Some things are only available if you're a free gmail user. Some things are a paid personal Gmail user. Right? You know, you have to be an early tester somewhere. Things are in beta. It was too hard to navigate what's actually available. And that's what this show is. Which I think if you want to talk about the most consequential single announcement this week, I think it's Codex. I know you're probably tired of me, guys like saying Codex, Codex, Codex. But let me repeat what the most important thing for you to hear right now is. Ready? Codex, Codex, Codex. The team at OpenAI continues to embarrass the entire AI industry with how good and useful Codex is and how far it is. Right. I said about two months ago that Anthropic was winning 2026, but I think that tide has shifted and I didn't think anyone would catch them. Maybe not until the end of the year, certainly not in the second quarter, but I think OpenAI has retaken the lead in 2026, especially after a somewhat lackluster, at least, response to Google I O I think a lot of people thought Google I O we were going to get the Gemini 2 point or 3.5 Pro. We were going to get a more cohesive and easier to understand way to use all of Gemini's products. But it was just fragmented, right? Some good products, some confusing, but all over the place. But Codex continues, and OpenAI continues to put more and more useful features that are easy to use inside of one single platform. So here's what's new. Yeah, because now OpenAI has moved to this new weekly Thursday drop. So a lot new. So one called App Shots. So this is essentially you can press both command keys when you have Codex open and it takes the foremost app window as a kind of like a screenshot and it brings it into Codex. But it's not just a screenshotting tool because you'd be like, okay, well I can screenshot anything and drop it into Codex. It also brings the text and the context of that window into Codex. So it just makes day to day working with other platforms so much faster. One of the things that I think slows me, quote unquote, slows me down from my Codex flow. Even though I have a good screenshotting program, right. I use Clean Shot, but it slows me down to, you know, do my little keyboard command, you know, shift command four, drag my mouse and then drag the screenshot over, even though it's a very efficient way to screenshot. Right. But then I still don't get the context and the text from that app. So that's a lot of. At least what I do. And I know I have to explain this because you might just be like, okay, what's the big deal? It's actually a big deal, right? This little thing called app shots. So that's new. The other thing, Goal mode is finally in the desktop app. So Goal mode is absolutely bonkers. So Codex was first, but they only brought this to the cli, to the command line interface version of Codex. Then Anthrapic followed suit in broad gold mode to their cli. So this is the first Goal mode that is in the user friendly desktop interface and it is in Codex now. Huge. So this essentially graduated out of the experimental phase and is now available in the Codex app. And it allows Codex to drive autonomously toward a specific objective for hours or even days. All right, so I will warn you, all right, Goal mode requires, well, you need a little bit of experience and you need a very specific prompt with use cases, examples, all of that. The last thing you want to do is go give Codex a very wide and general goal because it's going to work to that. And, well, even though OpenAI's rate limits are the best in the business. Yeah, Google even downgraded their paid rate limits after IO anthropics are laughable. You know, open AIs are the best in the business by far. But if you give Codex a goal that's way too broad, it's gonna work and work and work to get it done. So if you're on the $20 a month plan, it's not gonna last as long as you might want. All right, so the three big ones, app shots, number one, two, goal mode, number three. This one's actually big. You no longer have to carry your laptop halfway open. So they have a new remote computer use. Okay, so this is different than being able to control codecs via your phone inside of the Chat GPT app. This allows when your computer is locked. All right, so when the lock screen is up, I don't know how, how is the Codex team shipping these things so quickly? That just defy the laws of software, right? So yes, you can use computer use now while your computer, if you, you have a laptop, it's, it's closed, it's locked. Absolutely bonkers. And you can still control it, obviously via the Codex mobile section inside of the Chat GPT app. So who has access? Well, everyone. Yeah, that's the other thing. Everyone. So go use Codex, I already told you. And go, go make sure you repost Wednesday show if you haven't already gotten the cookbook. People are loving it. All right, we gave that away. If you go repost Wednesday's show show, which was part two of our codec series. All right, so the codecs, obviously a lot of these things are only available for Mac users. I know window users aren't going to like that. But that's why, you know, it's very hard to develop things for Windows. FYI, it's much easier to develop things for Mac, which is why the cutting edge of AI lives inside of Mac for now. All right, so here's why it's useful. So I mean, app shots, it collapses. One of the most annoying handoffs in the agentic workflow is just explaining or pasting what's in another app. I mean, literally the way they have it set up by default. This is how much I, I am loving what OpenAI is shipping. Just think how do your hands sit on the keyboard, right, for the most part your, your two thumbs sit near the space bar. So to launch this on a Mac, you hit the command key twice, which is the key just to the left and the right of the space bar. So talk about the small details that really matter. This stuff is huge. So goal mode, that's going to be extremely useful. This is your, you know, your RALPH loop, right? This is your agent that's going to run for, you know, hours or maybe even days. Right. That other vendors keep promising. It's here, right? And then remote computer use obviously just closes the loop for genuine background autonomy. So you don't have to like leave your Mac, you know, your MacBook open and plugged in and you know, turn off all the power settings overnight. My gosh, I forgot another one. This is how much open AI just shipped yesterday. And then they also shipped shareable plugins across teams, which is a huge pain point for enterprise teams. Right. I did a quad actually Claude, training for a huge enterprise client a couple of weeks ago and one of their biggest pain points company wide was how do we share skills within Claude? And it actually is a big problem that there's no built in way to do that. And OpenAI just fixed what I think is one of the biggest problems that no one really talks about in the enterprise, which is the plugins, right? So plugins are skills plus app use and now you can share those across teams if you have a team account inside of Codex. My gosh, do you guys see why now I'm just like, codex, Codex, Codex. Google shipped a lot of great stuff. It's hard and confusing to use none of it works together right. So a lot to cover this week. Sorry this one went a little long. There was a lot to cover, but I hope this one was helpful. If so, please, if you're not already subscribed to the podcast, please, please do that. That really helps both me keep the show going and it helps other people find this who don't just want, you know, a bunch of people reading a script and, you know, shilling for some product. I try to give it to you straight, you know, I always have and I continue to always will do that. So please subscribe to the podcast and then go to our website. Your everyday AI.com Sign up for the free daily newsletter. Thanks for tuning in. We'll see you back next week, actually Tuesday. All right, Monday Holiday. We'll see you Tuesday with the AI News and every day after that for more Everyday AI. Thanks, y'. All. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going for a little more AI magic. Visit your everydayai.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.