EP 423: AI News That Matters - December 16th, 2024 - Everyday AI Podcast – An AI and ChatGPT Podcast

Summary6 min read

Everyday AI Podcast – EP 423: AI News That Matters - December 16th, 2024

Main Theme & Purpose

In this energetic roundup, host Jordan Wilson unpacks the week’s most significant AI stories, focusing especially on the rapid-fire rivalry between OpenAI and Google, while also touching on key developments from Apple, Klarna, and the broader AI landscape. The episode is structured as a fast-paced briefing to keep listeners ahead of the ever-accelerating AI curve and make practical connections to business and career growth.

Episode Structure

OpenAI news avalanche (including new tools, features, and integrations)
Other notable AI stories: Apple, Klarna, Devin, Microsoft’s Phi-4, and more
Google’s massive announcement spree (Gemini 2.0, Project Astra, new agents, and more)

Key Discussion Points & Insights

OpenAI News Highlights (02:14 – 17:00)

OpenAI Sora: Text-to-Video Generator (03:09)
- Sora, OpenAI’s new tool, creates videos from text prompts.
- Available for ChatGPT Plus ($20) & Pro ($200) users: Only Pro users can generate human videos.
- Notable: Signups reopened after prior server overwhelm.
- “Sora is wiping the competition... by far the best ones are also going to be Sora. But this is the worst it will ever be.” (04:43, Jordan Wilson)
- Compared to established rivals (Runway, Kling, Luma Labs), Sora is noted for its unprecedented generative ceiling.
ChatGPT Canvas Expanded (06:10)
- Canvas, a real-time collaborative workspace for content editing, is now available to all users, including free tier.
- Adds ability to run Python directly in chat (previously only cloud artifacts allowed in-depth browser coding).
- “Go ahead and upload a spreadsheet in there... say like hey, create me a visual that could be helpful. Create a dashboard that could be helpful and it will do it.” (10:00)
Advanced Voice Mode Gains Video & Screen Share (11:20)
- Now includes vision: ChatGPT can “see” via phone camera—think: virtual whiteboards, math homework, live business brainstorming.
- User Remarks: “I love the video feature but haven’t discovered much usefulness to it yet… so far just used it for fun.” (13:05, Michael)
- “It remembered my dog’s names.” (13:15, Dr. Harvey Castro)
ChatGPT ‘Projects’ Feature—Folders & Metadata (14:00)
- Users can organize files and chats via folders, reminiscent of Anthropic Claude’s projects and Google’s Notebook LM.
- Early testing shows better document retrieval than custom GPTs for some tasks.
Apple Intelligence Officially Taps ChatGPT (17:00)
- Apple’s “significant update” integrates ChatGPT into Siri for more complex queries, now active on iOS 18.2 and Mac 15.2.
- Jordan: “It’s a nothing burger... all stuff that we had through other tools pre-ChatGPT…” (18:57)

AI News Beyond OpenAI & Google (19:50 – 32:55)

Cheaper iPhone for Apple Intelligence (20:40)
- Rumored iPhone SE4 to feature Apple’s A18 chip, enabling advanced AI features at a lower price (potentially $499–$599).
Eric Schmidt (Former Google CEO) Warns of Runaway AI (22:18)
- Cautions on self-improving AIs; proposes dual-system approach—one monitoring the other for safety.
- “AI systems capable of independent decision making could emerge within two to four years.” (23:50, paraphrased)
Klarna Embraces AI Over Human Rehiring (25:15)
- Klarna’s CEO Sebastian Siemiatkowski touts workforce reduction from 4,500 to 3,500—attrition is now mostly filled by AI, not new hires.
- “We’re just not hiring people anymore... as many human roles over to AI as possible.” (26:00)
Devin by Cognition Releases, $500/month (28:26)
- Positioned as a junior developer AI; handles ongoing engineering tasks without frequent human prompting.
- Integrates with Slack, IDEs, APIs; intended for code cleanup, bug fixing, drafting press releases, etc.
Microsoft Phi-4, Mini Model—Big Performance (30:49)
- New 14B parameter model reportedly outperforms much-larger rivals in math and reasoning.
- “Small models... the future is hundreds of specialized small models.” (31:30)

Google's AI Avalanche (33:04 – 48:05)

Gemini 2.0 Launches; Agentic Era Beckons (33:30)
- Debuts with “Flash” model (cheap, fast), already besting old larger models in benchmarks.
- “Even though it is a flash model... this thing is a banger.” (34:50)
Google Agent Space for Enterprise AI Agents (36:12)
- New cloud platform for branded company AI agents, integrates with Google/Microsoft tools.
- “Google goes typical go-to-market... you can sign up for early access—but will you get it? Probably not.” (37:30)
Project Astra Progress—AI Glasses and Physical World Assistants (39:17)
- Demonstrated but still unavailable.
- Hint at future where persistent on-body agents (glasses, XR) assist contextually.
- User Opinions: Divided on return of AR glasses—“I don’t know how realistic [it is].” (41:17)
Android XR: Extended Reality Collaboration with Samsung, Qualcomm (43:07)
- Platform bringing Gemini to “glasses and headsets.”
- Early launch with Samsung Project Muhan expected next year, aims at much cheaper/lighter alternative to Apple Vision Pro.
Google Notebook LM—Paid Plan & Audio ‘Call-In’ (44:32)
- “Call-In” feature lets users interrupt AI podcast-mode overviews for real-time Q&A.
- “This one little feature... I think is one of the most exciting small features that everyone is overlooking.” (46:09)
- Paid (Plus) tier enables team sharing, higher limits, and new content creation pane.
Project Mariner: Autonomous Chrome Agent (47:20)
- Chrome extension lets Gemini perform tasks for you online (e.g. research, shopping).
- Competes with Anthropic’s “computer use,” but more accessible.
Deep Research: Google’s Perplexity Rival (48:20)
- Tool within Gemini Advanced for generating in-depth, multi-source reports.
- Jordan: “It visited 169 websites... took about 2 or 3 minutes. Can you imagine that?” (49:01)
- Direct shot at Perplexity’s crown for AI research assistants.
Google AI Studio: Real-Time Multimodal Voice & Vision (49:44)
- Users can now interact with Gemini via live voice/vision for instant answers and analysis, both desktop and mobile.
- No waitlist; first fully-integrated, vision-enabled voice mode among major AI providers.

Notable Quotes & Memorable Moments

On Sora’s capabilities: “Sora is wiping the competition... Sora by far has the highest, highest ceiling... This is the worst it will ever be.” (04:43, Jordan Wilson)
On Apple’s ChatGPT/Siri integration: “Siri is the middleman or middlewoman now. Just, you know, passing off all our queries to... ChatGPT. That’s funny.” (19:45, Jordan referencing audience comment)
On AI workforce disruption: “Klarna was one of the big, you could say AI case studies of just essentially unabashedly handing over human roles, being like, nope, nope, we’re not hiring humans anymore.” (26:44)
On Google and AI agents: “Google did not really create any marketing, any messaging, seemingly any real strategy... and then the middle of last week, Google went bananas. B A N A N A S. Like they went wild.” (33:08)
On Google’s Notebook LM ‘call in’: “This one little feature... I think is one of the most exciting small features that everyone is overlooking. So don’t sleep on that.” (46:09)
On Gemini vs. Perplexity in research: “I use this to help me research one of my shows last week. A single prompt inside... it visited 169 websites... took about 2 or 3 minutes.” (49:01)

Timestamps for Key Segments

02:14 – OpenAI News: Sora, Canvas, Voice Mode with Video, Projects, Apple Intelligence/Siri
19:50 – AI News Beyond OpenAI/Google: iPhone SE4, Eric Schmidt, Klarna & Workforce, Cognition Devin, MSFT Phi-4
33:04 – Google Segment: Gemini 2.0, Agent Space, Project Astra, Android XR, Notebook LM, Project Mariner, Deep Research, AI Studio Multimodal
44:32 – Google Notebook LM paid tier, “Call-In” feature
47:20 – Project Mariner
48:20 – Deep Research
49:44 – Google AI Studio, real-time multimodal vision/voice

Listener Q&A & Tone

Jordan maintains an informal, quick-witted, and often humorous tone. He actively involves live stream comments and gives hot takes (“meh,” “nothing burger,” “bananas,” “banger”), while making technical concepts approachable for both techies and everyday professionals.

Summary Takeaway

December 16th’s episode embodies the whirlwind pace of the AI industry. OpenAI’s and Google’s race to outdo each other yields a wave of new capabilities—from Sora’s video generation and truly collaborative multimodal tools to Google’s research and agentic breakthroughs. Meanwhile, the workforce (see Klarna), user device requirements (Apple/SE4), and small model innovations (Phi-4) show AI’s tentacles reaching everywhere. Jordan’s rapid-fire insights make the news actionable—for those seeking to apply AI in business, career, or even everyday communication.

If you want to be the most AI-savvy person in the room, this week’s Everyday AI news recap is an essential listen—or, thanks to this summary, an essential read.

Loading summary

Transcript3 lines

[00:01]
A
This is the Everyday AI show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business and everyday life.
[00:17]
B
It was never actually called this, but everyone has been calling the last week in AI AI week, mainly because OpenAI and Google have been in a straight up slugfest, going punch for punch in releasing very impressive new updates to its respective Chat, GPT and Gemini products. But there was a lot else going on this week in AI News, and we're going to be covering it all today. What's going on, y'? All? My name is Jordan Wilson and this is Everyday AI. Welcome. This is your daily livestream podcast and a free daily newsletter helping everyday people like you and me not just keep up with what's going on in the world of AI, but how we can all actually get ahead to grow our companies and to grow our careers. So if that sounds like you, you are in the right place. Thank you for tuning in. So everyone from Brian tuning in from LinkedIn, Michael Philip joining on YouTube, thank you all. We do this almost every single Monday, going over the AI news that matters because you can spend literally hours every single day trying to keep up with AI news, but you can't, right? That's what I do. So that's what you pay me for. Well, you don't pay me anything. You just tune in live and spend on Mondays 40ish minutes. And you become the smartest person in AI at your company because you know everything that's going on and why it actually matters. So if you haven't already, please go to your everydayai.com. you know, you learn here, but you leverage it there with our free daily newsletter where every single day where we recap the show insights that you really need to know to grow. I didn't mean to rhyme. That was an accident. But you can do that there as well as go listen to and watch and read more than 420 episodes that we've had of the Everyday AI show. So no matter what area you're in, marketing, sales, healthcare, doesn't matter. We've had expert guests from around the world there. So without further ado, let's get straight into the AI news that matters for the week of December 16th. All right, let's get after it, y'. All. And we're gonna do it a little different this week because there is literally so much open AI and so much Google News. We're going to sandwich it like this. We're going to start with everything OpenAI. We're Going to get to everything else that's not Google and then we're going to get to all the new Google AI news at the end. Yeah, it was one of those kind of weeks. I think we have five stories from OpenAI, like seven from Google and a lot other and, and a lot more. So it's going to be fast and furious, y'. All. But to our live stream audience, thank you for tuning in. Let me know which one of these show or, sorry, which one of these news pieces is going to impact you. I love going through after the show and reading comments from our live stream audience. Thank you so much. So let me know what you guys think of all of this that's going on. All right, first, and like I said, this is going to be a very, very fast roundup of each story because we got so many to get to. So OpenAI has released Sora. Yeah, this happened like three hours after our show last week, so we had to fit it in. But OpenAI has released Sora, their new AI tool for generating videos from text accessible to premium chat GPT users. So whether you are on the twenty dollar chat GPT plus or the new 200 chat GPT Pro, you will have access to Sora. And they did reopen signups, so they did have to shut signups down because their servers were legit melting. Signups are back up. So the new tool can create high quality videos like whatever, sumo wrestling bears, right? But you cannot create videos of humans if you are on the $20 plan. That's important to know. You do have to be on the $200 Chat GPT Pro plan in order to do that. So right now the company is prioritizing the prevention of harmful content and is working with artists and policymakers to address concerns. Because there's a lot of concerns when a AI video tool as powerful as Sora hits the market. I'll say this, there's plenty of other AI video tools on the market that have had way more time out, way more training data than Sora, such as Runway, such as cling, such as Luma Labs open AIs with Sora, their ceiling is higher. So for every single generation, right? You can run the same generation 10 times and get 10 different outputs. If you do that with every single video out generator that's out there, Sora is going to be the best. Right? There's in the same way that there's ELO scores for chatbots, right? You know, you put in a prompt and you get two outputs. Same thing for video. And guess what? Sora is wiping the competition. So Sora by far has the highest, highest ceiling. It doesn't mean its floor is the highest. Right? If you do 10 of those, you know, some of the worst ones might be Sora, but by far the best ones are also going to be Sora. But this is the worst it will ever be. All right, we got to get to all the chat GPT updates. They launched updates to their Canvas tool and they also made Canvas available to all users, including free users. So Canvas, if you haven't heard of it, it enhances the user experience by allowing real time content editing while conversing with ChatGPT and also the other big updates. So number one, it's now available to free subscribers. Number number two, it can run Python directly within the tool. Pretty cool. So this new integration, well, this updated integration with GPT4O enables automatic activation of Canvas because it is now a tool. So previously it was a mode that you had to select from a dropdown menu when starting a new chat. Now it is kind of near the chat bar as a tool. So pretty cool. But you know, the most powerful models, such as the O1 model, Canvas, does not work there just yet. Also, the Python thing, that's a shot at Anthropic, y'. All. You know, I'm going to try to give you my two cents on all these news stories. Anthropic has had a huge, one, huge advantage over every other large language model, I would say, by far, with its Artifacts feature, which can render all kinds of code right in the browser. So, so this is kind of a shot at Anthropic, and there's another one coming here soon from OpenAI. But right now, at least with ChatGPT's canvas, you can only run Python, where in Cloud Artifacts you can run all kinds of code. And even if you're not a technical person, that doesn't mean anything, y'. All. Like, you can use Python, you might not even know you can upload a huge spreadsheet and just be like, yo, create me some graphs, some visualization, right? That's Python. So, you know, if you don't think that you would might use this new Python development, you're a ChatGPT subscriber, go ahead and upload a spreadsheet in there and toggle on the Canvas mode and say like, hey, create me a visual that could be helpful. Create a dashboard that could be helpful and it will do it. All right. More OpenAI news video has come to advanced voice mode. So OpenAI has enhanced its advanced voice Mode with video and screen share capabilities allowing it to see through a phone camera as and see as in quotes as users perform tasks like, you know, whatever you may be doing and wanting to show Advanced voice mode. So this updates comes just after Google announced its Gemini 2.0 which we're going to get to, which also had some of these live video features. So this new Chat GPT feature, essentially Advanced voice mode has been out Since September and OpenAI demoed this advanced voice mode with video back in it's spring event. So like what that's like six, six months ago. So what this means you now have the advanced voice mode inside of chat GPT. We did a quick review on our YouTube channel that we also shared in our newsletter. I had some problems with it at first. It's, I've, I've tried it since. It's gotten a little better. You know, like I said, OpenAI's servers were getting crushed last week. But essentially think of the advanced voice mode. If you've used it before but now with video, right, you could be working on a math problem on a sheet of paper. You could be, you know, drawing something out on a whiteboard, you know, something strategizing for your business and you automatically have a smart AI assistant that can quote unquote see and collaborate with you in real time. Michael says, I love the video feature but haven't discovered much usefulness to it yet. Interesting so far he's just used it for fun. Dr. Harvey Castro says I love the screen share and video OpenAI features it remember remembered my dog's names. Yeah. Hey, podcast audience too. Hit me up. I always have, you know, in the show notes you can email us, you can text the show, although I can't text you back, FYI also I put my LinkedIn in there so I, I would love to hear use cases for those of you that are using advanced voice mode with video and have found great uses for it. I would love to hear. All right, more OpenAI. So another shot or maybe taking taking a page out of Anthropic's book. OpenAI has introduced projects to in ChatGPT for, well just better file organization and some new features. So OpenAI has launched projects in ChatGPT, a feature that allows users to organize files and conversations similar to, well Anthropic Claude's projects and also similarly ish to Google's Notebook LM. So right now this is initially available to ChatGPT plus Project Pro and Teams subscribers. This feature enables users to create folders, customize project settings and upload documents. This has been one of the lowest pieces of hanging fruit, I guess, right. For since ChatGPT came out, right. So many people are like, it's hard to organize everything, right. Why can't we have folders? Well, you finally have folders with projects, but it's a little bit more than that because you can also upload down documents that are shared between all the different chats. So again, if you've used projects inside of Claude Anthropic or Anthropics Claude, it's the same thing, right? So think of it like this. You can have a project and you could, let's say, upload five documents inside of that project and then you can create new chats, separate chats that all live within there that share access to those documents as well. So I'm sure there's going to be updates to projects in the future. And right now, unfortunately, at least as of like eight, nine hours ago, the new 01 model, you cannot put those chats or work with those chats inside projects. Also good to know you can move old chats into new projects. All right. So you can kind of right click or option click on your chats on the left hand side and add them to a new project. So very useful. It's a feature that, honestly, I'm surprised it took this, took this long, but I'm glad it's out. Yes, Miriam from LinkedIn says thanks. Thank goodness for projects. We can tidy those chats up. Yes. But also you can upload. It's, it's, it's not, you know, you know, kind of like what Dr. Harvey is, is saying here, you know, saying this is kind of like mini rag. It's kind of right. It is a way to bring your companies or your first party, first company, your data into ChatGPT and to work with that first. I have been doing some testing and so far actually it seems like project does this document retrieval process actually a little bit better than custom GPTs. So there's still huge benefits to custom GPTs. We'll save that for another day. All right, next piece of OpenAI news. Probably the smallest one of the week, but actually it should have been huge. But we just heard about this for so long, Apple Intelligence. So yeah, Chat OpenAI and Apple got super official with the release of Apple Intelligence powered by Chat GPT. So Apple has rolled out a significant update to its Apple Intelligence platform, focusing on AI powered everything. So essentially now it has access to ChatGPT. Siri is hopefully a little smarter. So this new Chat GPT integration works on the newest operating system. So that's iOS 18.2. Or if you are on a Mac, that's Mac 15.2. So the update introduces a Chat GPT integration with Siri, which is kind of funny, right? Siri has always been our AI assistant, but hasn't been very smart. Now Siri has an AI assistant because essentially when you give a query to Siri and Siri doesn't know, Siri just like says, Yo, ChatGPT, can you help me out with this? That's literally what happens for complex queries. Siri, slash, Apple Intelligence just essentially calls on ChatGPT, right? And there are settings within Apple's new updates where you can kind of have that process go automatically, otherwise it will prompt you. Yeah, funny, right? You're giving a query to what is supposed to be a smart Siri, and instead Siri prompts you to use another AI. So meta. And I'm not talking about the company. So like I said, users can now prompt Siri to use ChatGPT to answer complex queries and improve productivity across iPhones, iPad, and Mac devices. So, yeah, and then, you know, all the new writing tools are out, which I don't know. All the Apple Intelligence stuff, to me, it's a big fat meh. You know, it's a. It's. It's a nothing burger. This is all stuff that we had through other tools pre chatgpt. Right? Pretty. We've had a lot of access to what we now have access to in Apple Intelligence, aside from some of the new stuff that I think is marketing at best, right? Like, oh, AI Genmoji. I. I don't care. I don't need to send custom emojis. I'm already bad enough at texting. So. Yeah, Interesting. What do you all think? Yeah, Jack, Love what Jackie here says. She says Siri. Siri is the middleman. Yeah, Siri's the middleman or middlewoman now. Just, you know, passing off all our queries to. To chatgpt. That's funny. All right, now we're technically pivoting away from we got through OpenAI news. Now let's get to non open AI, non Google news. So there may be a cheaper iPhone that can handle Apple Intelligence. So according to reports, the upcoming iPhone search is set to include Apple intelligence leveraging Apple's A18 chip to bring advanced AI features to Apple's budget smartphone. So here's why this is pretty important. Well, right now, if you want a lot of these new Apple Intelligence features on your iPhone, you have to have one of the most powerful iPhones, you have to have an iPhone 15 Pro or higher. And in most cases that's more than $1,000. So. So Apple has for many years had a kind of budget version of its iPhone called the se, which I believe is that standard edition. Does anyone out there know? I think it's standard edition. Anyways. Now this rumored new app, iPhone SE4 may have the essentially the chip required to run Apple intelligence. So reports are saying that the phone could be either between 499 and 599. So essentially if you do care about having Apple intelligence, but you don't want to pay, you know, somewhere between a thousand to, you know, 1500, 1800, if you want one with a lot of storage, there's a budget option. So pretty, pretty cool there. And I do think that that is actually going to move, move the needle. All right, moving on. Can we unplug AI? I don't know. Former Google CEO Eric Schmidt says it's not too late. So in a recent interview, the former CEO of Google, Eric Schmidt has raised concerns about AI systems that can self improve, suggesting it might be necessary to quote, unquote, unplug them once they reach this level, the level of being able to self improve. So that's according to an interview that he was on in ABC's this Week. So as the AI field rapidly evolves, Schmidt highlighted the unprecedented scale, scale of innovation, warning of the possible unforeseen dangers and the need to careful, carefully manage AI. He emphasized the importance of America leading the global AI race, particularly against China, and suggests building a secondary AI system to monitor the first AI system for safety. Yeah, what happens when AI goes rogue? Well, you got to create another AI that is supposed to keep the AI from going rogue. Right. So he also predicted that AI systems capable of independent decision making could emerge within two to four years. Right. These are things that 10 years ago people said were 50 years away. The ability for AI to self improve, self heal essentially when the next version of AI is built by AI. So someone that knows a thing or a million, the former Google CEO said that that could be two to four years away. All right, well here's another thing that is really changing in the AI space. Klarna is making news again for not hiring humans and instead giving all of the human work to AI. So Klarna's CEO, whose name I'm definitely not going to get, right? Sebastian Simiatowski. I didn't get that. Right. I'm sorry, Sebastian. So we'll just say Sebastian so Sebastian said that the company in an interview has reduced its workforce from 4,500 to 3,500 over the past year. So essentially what's happening is, he said. Sebastian said, well, there's natural attrition at any company, including Klarna, and Instead of hiring 20% of its workforce, so he said in General, maybe about 20%, you know, you might have to rehire year over year. And instead, Sebastian says, ah, we're just not hiring people anymore. When people leave or when they retire or when we maybe terminate their contract or their, their position, we just don't rehire anymore. So they're down and y' all going from 4,500 to 3,500. That was a big drop off. So despite the CEO's assertion that AI can replace human jobs, Klarna is still at least posting for over 50 roles right now. So indicating that there is still some reliance. Whether they're actually hiring for those or not, we're not sure. So Klarna's hiring activity is mostly focused on backfilling essential positions in engineering rather than expanding its workforce. So as Klarna prepares for a potential ipo, the company is showcasing its AI integration to appeal to investors. Though broader AI adoption of remains gradual across industries. So, yeah, Klarna was one of the bigger companies earlier this year that essentially said, nah, we're just giving all or as many human roles over to AI as possible. So Klarna was one of the big, you could say AI case studies of just essentially unabashedly handing over human roles, being like, nope, nope, we're not hiring humans anymore. We're just giving all these human roles over to AI. So it should be interesting to see, number one, if Klarna does ipo, are they going to hire more people? Right? Are they going to continue to reduce their workforce and rely more on AI? Well, time will tell. Are you still running in circles trying to figure out how to actually grow your business with AI? Maybe your company has been tinkering with large language models for a year or more, but can't really get traction to find ROI on Gen AI. Hey, this is Jordan Wilson, host of this very podcast. Companies like Adobe, Microsoft and Nvidia have partnered with us because they trust our expertise in educating the masses around generative AI to get ahead. And some of the most innovative companies in the country hire us to help with their AI strategy and to train hundreds of their employees on how to use Gen AI. So whether you're looking for ChatGPT training for thousands or just need help Building your front end AI strategy. You, you can partner with us too. Just like some of the biggest companies in the world do. Go to your everydayai.com partner to get in contact with our team or you can just click on the partner section of our website. We'll help you stop running in those AI circles and help get your team ahead and build a straight path to ROI on Gene. More big price tags, right? If you thought that 200 chat GPT Pro price tag was a lot, well, how about Devin from Cognition, which is now available for 500amonth per user. So Devin is an AI tool for engineering teams. It is now generally available at 500amonth, offering no seed limits and integrations with Slack, different IDEs and APIs. So essentially, if you haven't heard of Cognitions, Devin what they're really kind of positioning it as well it's a junior developer. All right. So I think there's so many great tools out there, right? You got to tip your hat. Claude is great at coding. You have GitHub Copilot. Now you have Zero1, right from OpenAI. You have all these large language models that are great at individual coding tasks. So Devin aims to be a little different. It's more of like hey, you don't have to keep prompting, just give, you know, give your files, give your commands, come back later, it'll be done kind of thing. So it kind of grabbed a lot of headlines when it was first teased many months ago. So now it is generally available. So a lot of companies have been wanting to get their hands on Devin from Cognition, but they haven't. Now you can. So recommended uses include handling small front end bugs, creating first draft press releases, and making targeted code refactors, enhancing workflow efficiency. So Devin has successfully assisted in real world scenarios like resolving issues in open source projects, adding features to libraries, and fixing bugs in various repositories. I don't know any, any people in software development out there. I would love to hear directly from you all. Is this is Devin, is it shaking up your industry? Right? If you are in software development, if you're a software engineer, if you're a coder, web developer, etc. I don't know, is Devin something you're excited to. Is it kind of like Klarna in the dark, right? Or you're like this thing's going to maybe take my job or I'm only going to be using Devin? Yeah, let me know. Super curious. Yeah, Dr. Harvey was saying my business partner was telling me about Devin AI Tara saying I Wonder how it compares to replit. Yeah, Replit's got a great AI agent. I mean there's so many now in this space. You have Windsurf, which is another newer kind of tool. You have Cursor. I mean there's so many great kind of AI coding tools now that essentially connect right to your database. So it's no longer you have to copy and paste two ways, right? Copy the code into a large language model, work with it, go back and forth, copy that code out, put it back into your repository, into your software development stack. No, it just connects directly now. So should be, should be pretty one worth keeping an eye on there. All right, my peersoft very quietly unveiled Phi 4. So that's Phi 4 in case you're looking it up, a new language model, a small one. So Microsoft Research introduced Phi4, a 14 billion parameter language model designed for efficient reasoning tasks, offering a competitive edge over even larger models on certain benchmarks. So yes, this mini, mini model, right, a 14 billion parameter model is out punching GPT4O llama 3 in certain benchmarks like math. All right, so just for reference, GPT4O is reportedly 1.8 trillion parameters. So this 14 billion parameter, that is a fraction that is like 1% of the size and it's already out punching. This is where AI is going. I've been saying this for a long time. Small models. I do think the future we are going to be working with hundreds of specialized small models. We're not going to be working with a jumbo model or I think all that jumbo model is really going to do is it's going to handle some general tasks, but eventually it's just going to pass your query on. I think these jumbo models are going to have hundreds or thousands of smaller models housed within them that are built for specialized tasks. So 5, 4 utilizes high quality synthetic data. So going over the trend like meta using synthetic data or AI generated data to help create it. So the model's post training refinement includes direct preference optimization, enhances output accuracy and usability, making it practical for real world applications. So Like I said, Phi4 excels in benchmarks like GPQA math and human Eval, showing its advanced problem solving capabilities and validating its utility in real world math competitions. All right, I gotta take a sip of the coffee because here we go. We are done with the OpenAI news. We are done with the biggest news that is not OpenAI and not Google. And now we are officially on to the Google portion, y'. All. So open AI, I Think did a great job, right? They had this, you know, 12 days of open AI. So we are seven days in, we have five days left to go. Google didn't really create any marketing, any messaging, seemingly any real strategy around what was dropping. So their head of development, Logan Kilpatrick, I tweeted at him like two weeks ago and he essentially said, yo, we're going to be releasing so many new updates in, in the coming weeks. I didn't really think of it. I'd like, I didn't think of anything, right, because the first kind of week of open AIs or the first couple of days, right, we didn't hear anything from Google. And then the middle of last week, Google went bananas. B A N A N A S like they went wild. And I will say this, a lot of the stuff with that Google quote unquote released, it's not released. So in typical Google fashion, we got some great stuff and then we got some, you know, some, some teases, some updates. So I think OpenAI brought us more things that we can use today. Google, though, had, I think, their best three days in the last three years. All right, let's go over it very quickly because we got a lot. All right, so first, Gemini 2.0 big jump. And Google is entering what it calls the agentic area. So Google DeepMind has launched Gemini 2.0, a highly advanced AI model designed for the evolving, quote unquote agentic area, offering significant enhancements in multimodal capabilities, including native image and audio output. So the new model that's being released initially is the Gemini 2.0 flash. So we don't have the Big Boy, we don't have Gemini 2.0 Pro, we don't have Gemini 2.0 Ultra. We have Flash, which is supposed to be the cheap, fast model. And guess what, it is already out benching the big boy 1.5. So even though it is a Flash model, which is supposed to be similar to OpenAI's mini models, right, you think of it as, ah, not very powerful. It's just the fast, the cheap version that's great for API use. No, this thing is a banger. All right, so the new model Gemini 2.0 is also being used across all of its new products. So we're going to talk about a couple of these or most of these, but like Project Astra, Project Mariner and Jules. So Jules is essentially a new AI coding tool. We're not going to get too much into that today, but big news there, we do have Gemini 2.0 at least the Flash version. Version. So what I would assume in the coming months that we're going to see the Gemini 2.0 across the Pro or the Ultra. As do all companies. Gemini. Google does have, uh, some unnamed new models that are being tested out in the wild, uh, on the, uh, LMS Arena Chatbot leaderboard. So, uh, we're gonna have more than just Flash pretty soon. Yeah. Thank you, Tara. Got my Gwen Stefani reference. I don't know if I age myself. I don't know if anyone else got that. It was accidental. I promise. When I'm making these random cheesy quips, they're not planned. I'm just a dork. All right, next. Well, if you are a dork, you're gonna like this. From Google. They launched Agent Space for enterprise AI solutions. So Google Cloud has introduced Google Agent Space Space, a multimodal search agent aimed at enhancing enterprise operations by integrating advanced AI reasoning and search capabilities. That was a mouthful. So this platform allows businesses to create a company branded search agent. Right. This is wild. All right. Providing conversational assistance and proactive support through integration with tools like Google Drive confluence and Microsoft SharePoint. Yeah, you can even work with your Microsoft tools over there in Agent Space. So employees can access AI agents and use low code tools to build custom expert agents with embedded features such as Gemini for advanced reasoning and integration with image and video generation tools. So not everyone can get it. Right. Yeah. Google. Here we go. Typical. Google go to market. I'm not a fan. I think OpenAI is crushing it. And go to market aside, you know, I think the SORA left a little bit to be desired. The advanced voice mode with video. Right. Having to wait multiple months. But everything else, you know, OpenAI go to market. Great. Claude just ships. They don't even really bring much marketing around it. Right. But if you want this Agent space from Google, sorry, you can sign up right now for early access. Are you going to get it? Probably not. But you can go at least sign up. Who knows, maybe. Maybe you're already in Google's good graces. I know Google does have their kind of trusted tester program for individuals and I think they have something similar for enterprise organizations. So who knows, you may be able to go get it right now and we'll see over time if this is kind of a one to one competitor with Microsoft. Microsoft 365 Copilot's Copilot Studio. It does look like that's what it is, but we don't really have any great information on this right now. Because so few people have this. Right. I'm trying to see a bunch of reviews about Agent Space and there's nothing out there. So I don't know if, you know, two companies have access to this, if 200,000 companies have access. But you can go at least in, in Google fashion, you can go put your name on a wait list. All right, speaking of wait lists, there's updates to Project Astra, but it's still not available. All right, so Google did showcase some new updates to Project Astra, its advanced AI assistant that offer a glimpse into how AI can assist in navigating the physical world. So Astra, well, let me just say it in basic terms. You're gonna probably in the end want. You will be wearing glasses, our glasses coming back again, I don't know. Well, Google Meta, everyone else, they're really going all in on these glasses. Even though the Google glasses of, I don't know what that was 10 years ago didn't really work. Right. But essentially, Google showcase some updates to Project Astro, which think of it as a live Gemini, right? But it can see what you see. So at its IO conference, Google first demoed it mainly with the app, with the, the Google Gemini app. So if you are one of Google's few trusted testers, you'll have access to that which not many people are. And I do believe it does require acquire a new Google smartphone from Samsung as well. But for everyone else, you can still go get on a wait list. But think of it like this. It looks like what Google's trying to do here is bring the glasses back. I don't know if glasses are the form factor of AI. I think for limited use cases. Great. My wife actually just got me the, the Meta glasses. I've been so busy, I haven't even been able to try them out yet. I think for certain spurts, they're great. But I don't know, I don't know what these, these Google ones, right. The way that they're kind of marketing them is, oh, you should be wearing them all day, right. Because they can, they can navigate you. You can have like, you know, essentially, you know, projecting things onto the screen of these glasses. So I don't know how realistic that is. Right. And these are a little heavier. Right. The thing I like about the, the Meta Ray Bans, they look just kind of like Ray Bans, right. They don't look like these big fat thick things. Right. A lot of these newer quote unquote smart glasses, the ones from Meta as well, the other ones, not the Ray Ban collaborations, they're. I, I think that one's the Orion. They're big, fat, thick things. So I don't know, I don't know if people are going to want to wear around these, you know, super thick, heavy glasses. I don't know. Tara says she wants them. Dr. Harvey says, I think Meta AR VR glasses is more of the future. We shall see. Speaking of glasses, well, these two Project Astra and Android XR kind of go hand in hand because when I'm talking about smart glasses, that's kind of where we're headed here. So Google, in collaboration with Samsung and Qualcomm, announced Android xr, a new platform for extending reality devices, including both headsets and glasses. So this brings the AI driven Gemini Assistant right straight to your eyes, straight to your ears. And it is central to the Android XR platform being able to understand user intent and assist with tasks such as planning and research through conversational interaction. So Android XR will debut first on headsets with Samsung's Project Muhan expected next year. Offering immersive experiences like virtual big screens and for various apps. The platform invites developers to utilize familiar tools for creating diverse XR experiences, aiming to build a robust ecosystem for new devices. So, yeah, obviously Apple's Vision Pro, that thing flopped before it came out. I said, this thing's gonna flop. I said, this is gonna be the least successful Apple device ever. And it literally no one, no one bought it, right. There's reports that they're, you know, that they're, you know, now really, you know, slowing down production and they might not be updating it as much as they originally thought because who would have thought there's not a bunch of people with an extra four grand in their pocket that want to wear a 20 pound headset on their head. So, I mean, we'll see what the Android xr, it's obviously way cheaper, it looks way lighter, but again, I think maybe for certain people, right, if you're working at home, maybe something like that could be great. But for out in the real world, I don't know, I'm not sold yet on wearing around, you know, an AR XR mixed reality headset. Right? So what that is is, you know, it's a headset. You wear it and it projects, you know, you can see both what is happening in the real world, but then it has this mixed reality, this XR element to it and then it brings in this conversational agent with Gemini. Couple more stories and I'm saving the three biggest ones for last. Yeah, we're still on the Google segment. My gosh, I said they went all Gwen Stefani on us. So Notebook lm, yes, one of my favorite tools, what I would say has to be in the running for AI tool of 2024. Notebook LM from Google has some big updates that are rolling out. So the biggest one is a new paid tier, right? So it's been free. Now there is a paid tier. So Google has launched Notebook LM plus, enhancing its popular app with features aimed at enterprises, teams and individuals who use the app's research tools extensively. So I believe I did. I've been chatting a little bit on Twitter with some of the team there from Google's Notebook lm and it seems like if you are already on the paid version of Google Gemini that you will have access to this. So it does seem like it is both a personal premium plan, but also a team premium plan as well. So being able to share this with businesses, with or, sorry, within your, your team members. So I, I have obviously multiple paid Gemini accounts, both on my personal Gmail and on my Google workspace plan. I haven't seen this roll out yet. So, you know, who knows, it may be rolling out soon, but it does look like it's already being gradually released. The big feature that I'm really looking forward to is the updated audio feature. So essentially now you can quote, unquote, call in or buzz in, right? So if you haven't used Notebook lm, it is a state of the art rag model, right? If you don't upload your data, you literally can't use it, right? Which is really cool. I like that. I think more AI models should at least have an option to operate like that. But there's always been this kind of deep dive podcast, right? So you can put in millions of words, literally, you can put in millions words, millions of words of something you're trying to learn about your company's data, whatever, click a one click audio overview and it creates a cool personalized podcast, right? With two hosts that seem human esque, right? So now the last update about two months ago, you could customize or give instructions to the AI hosts. So now you can quote, unquote, call in, which looks like a groundbreaking feature for learning. All right, so essentially as you're listening to the Deep Dive, you can essentially interrupt them and ask a question. You can say, hey, what does this mean? Or hey, could you explain that a little more and maybe use a basketball reference on the fly. So again, my brain hurts and I thought about this a lot since it was announced. I don't think people are talking about this enough. I think this one little feature, this wasn't even the big update, right? The big update here is now there's a pro plan and you get five times the limits. That's great, right? And you can share all this with your team. That's great. There's a new, you know, writing pane to create content, you know, so now it's going to this, this three tier pane, which I think is really cool. So now it's also turning into a content creation tool and not just a learning tool that you can ask questions on. But I think this kind of call in feature is one of the most exciting small features that everyone is overlooking. So don't sleep on that. Yes, Jackie, now. Hey, now the live stream audience is playing along with me. Jackie says call in question mark. That's bananas. Yeah, old school radio style. All right, two more and I think again I saved the best three for last. Next. Don't, don't worry about the headline here. Live stream audience. This is Google's Project Mariner, a new AI agent navigating the web. So this was previously codenamed Jarvis. We talked about it on the show a couple of times. But now this is again, it's released to quote, unquote, trusted users, which I think is like hardly no one, but it is starting to roll out. So Google's new Project Mariner is powered by Gemini 2.0 and it is an AI agent that performs Internet tasks through the Chrome browser. So it is a Chrome extension that just does things for you on the web. So very similar to Anthropic's computer use, which we demoed here on the show a couple of months ago. However, computer use super buggy, it's very technical. You actually have to download and install multiple programs, right? You have to download or, you know, you have to grab a bunch of information off, off GitHub. So it's not for non technical people. You have to be pretty technical to use Claude Anthropics or sorry, Anthropic Quads computer use. So Project Mariner, it's a Chrome extension, right? And then you essentially say, yo Mariner, go do a bunch of this stuff and then it goes and does it. The big caveat here though, or the big downside is it only works in your active Chrome tab. Okay. So I mean what this means is I'm not sure if it's going to be able to work with like as an example, split screen monitors, but once this comes out, I'm going to be using it all the time because I have a bunch of extra computers sitting around lying, collecting dust, right? So one of them, if you can only work with the active tab, so you can't really necessarily do a lot of other work, at least inside Google Chrome, right? You can always open up Edge, which I like. Microsoft Edge browser. Based on Chromium. Based on chromium. But an AI assistant coming soon, powered by Gemini 2.0 that can essentially just do whatever you tell it, right? Go do this research, you know, go to this website, find the price on this, right. Make sure these criteria are met and it just does all that for you. All right, let's see. Do we just have one more? Oh, no, we have multiple. All right, Deep Research. I didn't do a good job at updating my headlines today for our live stream graphics, but Google also announced Deep Research. And I'm telling you, y', all, this thing, Perplexity, gosh, Perplexity is on notice. All right, so Deep Research is a new AI tool available to Gemini Advanced subscribers. So the paid plan, it is designed to generate detailed reports by scouring the web for relevant information. So the tool uses Google's Gemini bot to create a multi step research plan allowing users to edit or appropriate, improve the process as it finds and compiles key information from various sources. Then once the research is complete, users receive a report with key findings and links to original sources with the option to expand on specific areas or export the report to Google Docs. I use this the minute it came out. Not the minute. Well, the minute I saw, I'm like, wait, this seems very much like Perplexity. I went and use is so much better than Perplexity. Perplexity. I've talked about it very recently. It's gone downhill recently. I think a lot more hallucinations, the quality I think has gone downhill, especially since they now introduced the new Shopping feature. Right. So much of what I use Perplexity for, it's comparing different products and services, right? That's something I would generally go to a lot of different websites for. And now it's. Instead of doing that work, Perplexity, with its new shopping feature just shoves products down your throat and it doesn't actually always adhere to the prompt that you give it, right? A lot of times I'm trying to use it to research five different products. You know, make me a chart, you know, show me the pros and the cons. Show me who's it for, who's it's not for. Instead of Doing that perplexity. Now with this new shopping mode that is way too. It just, it needs guardrails, right? But it essentially, instead of answers answering questions consistently, instead just shoves products down your face. Right? But this new deep research, y', all, I'm not kidding. I use this to help me research one of my shows last week. A single prompt inside. Again, you have to have the paid plan. It visited. All right, And I'm not exaggerating here, 169 websites. All right, I gave it one prompt to do a bunch of research for me. It visited 169 websites. Took about two or three minutes. Can you, can you guys imagine that? Right? We've been blown away. Right, right. Rightfully so. Perplexity. Great. You know, perplexity might go anywhere from, you know, six to 20 websites. Chad GPT with a new GPT search, pretty good, you know, can handle five to 10 websites. This did 169. All right. Saving what I think might be the best for last, Google AI Studio has released. Yes, released no wait list, released Real Time Vision and Voice with its new multi, excuse me, multimodal live options. So Google AI Studio has introduced Stream Real Time, allowing users to interact with Gemini via voice and vision, providing spoken responses and visual analysis of your screen or camera feeds. So this new feature positions Google Ahead of competitors like OpenAI by offering a fully integrated vision enabled voice mode, enhancing user experience on both desktop and mobile platforms. So alongside this, AI Studio also launched some starter apps including Map Explorer and Video Analyzer, showcasing the capabilities of the Gemini API and also available for exploration on GitHub. So I played around with this a little bit over the weekend. Let me know if we should cover this a little bit more. Without getting into a rant. This isn't available on Gemini. Right. I still don't understand. Luckily, Google Gemini on the front end finally got updated. Right. So you do have this deep research that I just talked about. Thank you for bringing that to Gemini. You have the Gemini now 2.0 flash. But previously Gemini really all it had was old models. Right. Google itself said that usually those models are between three to nine months old, which in AI years is like so far behind. So most of Google's new, you know, AI features and innovation, you have to go into Google AI Studio, not its front end Gemini Chatbot. But I would encourage you to do this because all these things that we've been waiting for from OpenAI with advanced voice mode, right? The ability to share your screen, the ability for it to interact with video. Well Now, Google AI Studio does this already, right? So we don't even have it yet on desktop. I do think that that may be rolling out this week or next for OpenAI's Chat GPT advanced voice mode, but we have it now for Google. That's it, y'. All. I can't even re recap these because we had like a hundred stories, but I hope this was helpful. If so, please click that repost button. Share this with your friends we put in so much work making sure you are the smartest person in your company at AI. Making sure you can outsmart the future with us. So please, if this is helpful, share this with your friends. If you're listening on the podcast, please, you know, there's the all the nice little share buttons, but first, you know, make sure you subscribe and follow the show on Spotify or Apple Podcasts. Leave us a rating if you can, but share this Share this episode with your friends, family, co workers, your neighbors, babysitters, boyfriends, dog walker, whoever it is. Because we all need to understand AI and that's what we do at Everyday AI. You don't have to have a PhD in machine learning to stay ahead. You just have to tune in with us every day. Thank you for tuning in. Make sure to go to your everyday AI.com Sign up for the free daily newsletter. We'll see you back tomorrow and every day for more Everyday AI. Thanks y'. All.
[50:40]
A
And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going for a little more AI magic. Visit youreverydayai.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.