
Loading summary
A
Ever feel like just keeping up with AI and tech news is like a full time job? I mean, it's, it seems like new models are constantly dropping, pricing structures are shifting, and then you've got studies that make you question like everything.
B
Yeah, it can be a real whirlwind trying to stay on top of it all.
A
Absolutely. So that's what this deep dive is all about, helping you cut through all that noise and figure out, like, what really matters.
B
Right. We're going to be your guides today, helping you sort through some of the latest and greatest in AI and digital platforms.
A
Yeah. So basically we've been digging into some recent news and we found some really fascinating stuff covering, like, new AI models, changes in how these AI coding assistants are priced, even a study that looks into the data that's actually used to train these AIs, and then some big updates to how advertising works on Spotify.
B
Lots to cover.
A
Okay, so to kick things off, let's talk about Meta, because they just dropped a whole new lineup of AI models called Llama 4. And they've got some interesting names like Scout, Maverick and Behemoth, which is still in the works, but they didn't even wait for like a normal workday. They released all this on a Saturday.
B
That's right. Straight into the weekend.
A
Yeah. And what really caught my eye was that they're saying these models have like this broad visual understanding because they were trained on so much data, like text, images, video, all unlabeled.
B
Yeah. And you know, that unlabeled part is really interesting because usually AI models, they learn from this really carefully organized and labeled data. Right, but training on unlabeled data, that's a whole other approach.
A
Right, so it's more like letting the model just figure things out on its own.
B
Exactly. It's about finding patterns and connections without being told exactly what to look for. And then saying it has broad visual understanding that suggests it's getting much better at understanding images alongside text and video.
A
You know, and the word on the street is that Meta was in a real rush to get these models out there.
B
Yeah.
A
Apparently those open source models coming out of China, especially from a company called Deepseek, they kind of lit a fire under Meta.
B
That makes sense. The competition is really heating up.
A
Yeah, and I heard they even had teams like, dedicated to figuring out how Deep Seek managed to make their operations so cost effective.
B
That's the key. Right now it's not just about making a better model, but making one that's actually practical to use.
A
Right. So if you're a developer or just, you know, curious to tinker around. Scout and Maverick. Those are already available on llama.com and through platforms like Hugging Face.
B
Yep. You can get your hands on them right now.
A
And then they've also got Meta AI, which is their assistant that's built into, like, WhatsApp, Messenger, Instagram, and it's now powered by llama4 in a lot of countries.
B
That's a pretty big move right there.
A
Yeah, but it looks like those really fancy features that work with, like, all different types of data, you know, that multimodal stuff that's only in the US and only in English for now.
B
Yeah. They're probably still working out the kinks, you know, making sure it works smoothly in different languages and all that makes sense.
A
But here's where it gets really interesting. The llama 4 license.
B
Okay, what about it?
A
Well, it looks like they've put some restrictions in place. If you or your company is based in the eu, you can't actually use or distribute these models.
B
Wow, that's a big deal.
A
Yeah. And just like with the previous Llama models, if your company has over 700 million monthly active users, you need a special license directly from Meta.
B
So they're really keeping a tight rein on who can use these models, especially the really powerful ones.
A
Exactly. And on a more technical note, this is actually Meta's first time using what's called a mixture of experts, or moe, architecture for their models.
B
And moe, that's a really interesting approach. It's all about efficiency.
A
Yeah. So instead of having one massive model that does everything, they break it down into these smaller expert models.
B
Right. It's like a team of specialists, each one really good at a specific thing.
A
So when the model gets a task, it figures out which expert is best suited and sends that part of the problem to them.
B
And that way they can have this huge overall model with tons of knowledge, but they don't have to use all of it for every single task.
A
Right. So it's like having a brain surgeon, a rocket scientist and a chef all on call, but you only have to pay for the one you need at that moment.
B
Exactly. It's much faster and uses way less computing power.
A
Okay, so let's talk specifics. Maverick. This one has, like, a whopping 400 billion parameters, but only about 17 billion are active at any one time. And those are spread across 128 different experts.
B
Yeah. And Meta saying, this one is great for general tasks like chatting and creative writing, you know, your everyday AI stuff.
A
Right. And they're even claiming it beats out some pretty well known models like GPT4.O and Gemini 2.0 on certain tests.
B
Those benchmarks are always interesting. Right. It gives you an idea of how they stack up, but it doesn't tell the whole story of how a model performs in the real world.
A
Right, then we've got Scout. This one's a bit smaller. 109 billion total parameters, 17 billion active, cross 16 experts. And this one's strengths are in summarizing documents and really getting deep into code.
B
That's a critical area these days.
A
And get this, it has a 10 million token context window.
B
Wow. Okay, so for those who aren't familiar with tokens, that's like being able to read and understand several entire books at once.
A
Yeah, and apparently it's efficient enough to run on a single high end Nvidia.
B
Graphics card, so that makes it much more accessible for researchers and developers who don't have access to these massive data centers.
A
Now, Behemoth, this one's still in training and the numbers are just mind blowing. Nearly 2 trillion parameters. 288 billion active across 16 experts.
B
That's getting into some serious, serious territory.
A
And Meta's internal tests show that it's already beating out some of the top models like GPT 4.5 and Claude 3.7 Sonnet, especially when it comes to, like, STEM stuff.
B
That's pretty impressive. But you know, with that many parameters, it's probably going to need some serious hardware to run.
A
Yeah, definitely not something you're going to be running on your laptop. But here's an interesting thing. Meta specifically says that none of these Llama 4 models are quite like some of the other reasoning models out there.
B
Right. Those reasoning models are designed to be more like fact checkers, giving really reliable answers, even if it takes a little longer.
A
And then one last thing about Llama 4, Meta is claiming they've tweaked these models so they're less likely to refuse to answer certain types of questions.
B
Hmm.
A
And they're supposed to be better at giving more balanced perspectives on controversial topics.
B
Well, you know, it's a tricky thing trying to make sure these AI models are both helpful and unbiased.
A
For sure. Okay, so let's shift gears now and talk about GitHub Copilot, which is that AI coding assistant that a lot of developers love.
B
Yep, I've heard good things about it.
A
So they're introducing this new pricing structure with something called premium requests. And it's basically like this. If you want to use those more advanced AI models for certain tasks, you Know, like doing really complex coding or editing multiple files at once. You have a limited number of these premium requests each month.
B
Okay, so they're basically creating tiers of access based on which AI model you're using.
A
Exactly. So if you're using the standard model, which is OpenAI's GPT4.0, you've got unlimited requests. But if you want something fancier, like Anthropic's 3.7 sonnet, then you've got to watch those premium requests.
B
And those limits depend on your subscription level.
A
Right, so Copilot Pro subscribers get 300 premium requests a month, business users get 300 as well, and enterprise subscribers get 1,000.
B
Okay, makes sense. And if you run out, you can always buy more. Yep.
A
Or you can upgrade to their new CoPilot Pro plus plan, which costs $39 a month, but gives you 1500 premium requests and access to what they're calling the best models, including GPT 4.5.
B
So they're definitely betting that there are enough people willing to pay for the premium experience.
A
And it seems like Copilot is a pretty big deal for GitHub's bottom line. I read that Microsoft CEO Satya Nadella said that Copilot was responsible for over 40% of GitHub's revenue growth this year and that it's already bigger than GitHub was when Microsoft acquired it.
B
Wow, that's a huge impact. It really shows how these AI tools are transforming the developer world.
A
Okay, so now for something completely different. There's a new study out that's adding fuel to the whole debate about whether AI models are being trained on copyrighted material.
B
Yeah, this is a big issue with all the lawsuits from authors and programmers.
A
Right. So these researchers, they came up with a clever way to investigate this and see if AI models are actually memorizing copyrighted content.
B
It's like trying to peek inside the black box of AI.
A
Yeah. So they focused on what they call high surprisal words, which are basically words that are unexpected in a given context.
B
So like a word that you wouldn't normally see in that particular sentence or phrase?
A
Exactly. They took snippets of text from fiction books and New York Times articles, and they hid these high surprisal words, and then they asked the AI models to guess the missing word.
B
Interesting. So if the model could consistently guess those unusual words, it would suggest that it had actually seen that specific passage before during its training.
A
Yep. And they tested this on models like OpenAI's GPT4 and GPT 3.5. And what they found was that GPT4 did show signs of memorizing parts of popular fiction books, especially those from a data set called Book mia, which has a bunch of copyrighted e books.
B
Okay, so there's some evidence there.
A
And they also found evidence of memorization from New York Times articles, although not.
B
As much that makes sense since the New York Times has a massive amount of content.
A
One of the study's authors, Balasha Ravachandra, she said that we need to be able to really examine these AI models and audit them scientifically. And she called for more transparency from companies about what data they're using to train their models.
B
Yeah, it's a valid point. If we're going to trust these AIs, we need to understand where their knowledge is coming from.
A
Okay, last but not least, let's talk about Spotify. They've just announced some major updates to their advertising platform and it seems like they're trying to make things easier for advertisers to buy, create and track their campaigns.
B
So they're really investing in their ad business.
A
Yeah. The biggest news is the launch of something called the Spotify Ad Exchange or ssx. And this allows advertisers to access Spotify's users through real time auctions. And they're already partnering with big names like the Trade Desk and Google display and video 360.
B
So they're moving towards that programmatic advertising model where advertisers can bid on ad space in real time.
A
Exactly. And are also making improvements to their self service Spotify Ads Manager, giving advertisers more control over targeting and measurements. And then there's something called Spotify Gen AI Ads which lets advertisers in the US and Canada use AI to generate ad scripts and even voiceovers.
B
Wow. So you can just tell the AI what you want to advertise and it'll create the whole ad for you?
A
Pretty much. They're also extending their measurement tools and partnering with third party measurement providers.
B
So they're really trying to give advertisers that confidence that their Spotify ads are actually working.
A
Exactly. And Spotify's global head of advertising, Lee Brown, he talked about how Spotify's free users are engaged for like two hours a day on average. And that Gen Z specifically sees Spotify as a positive escape from all the negativity online.
B
Yeah, I've heard that too. It's their little sanctuary from the doom scrolling.
A
Yeah, in their 2024 Culture Next report, they found that 71% of Gen Z said Spotify is like an antidote to doom scrolling.
B
So it's more than just a music platform for them. It's like a positive space.
A
And then quickly they've got these in house agencies, Creative Lab and aux. Creative Lab works directly with brands on campaigns and AUX is their music consulting agency.
B
So they're offering a full suite of services to advertisers.
A
Right. They're not just selling ad space, they're trying to be partners in creating those connections with listeners.
B
And that Gen Z audience is a prime target for a lot of brands.
A
Okay, so wrapping up this deep dive, we've covered a lot. Meta's new Llama 4 models with all those interesting licensing issues, GitHub, Copilot's new pricing structure, the study about AI training data and copyright, and Spotify's big push to attract advertisers.
B
It's a lot to take in, but it really shows how dynamic this whole AI and tech landscape is right now.
A
Absolutely. So for all you listeners out there, what part of all this do you think will have the biggest impact in the near future? Or maybe what are you most curious about as this all continues to unfold?
B
Yeah, it's a question worth pondering. And we encourage you to keep exploring these topics. Maybe go back and read those original articles and just keep that curiosity going.
A
Understanding all this. It's an ongoing process, but hopefully this deep dive gave you some food for thought.
AI Deep Dive – Episode: Meta’s Llama 4, GitHub Copilot’s New Pricing, and Spotify’s AI Ads
Release Date: April 6, 2025 | Host: Daily Deep Dives
In this episode of AI Deep Dive, the hosts from Daily Deep Dives navigate through a whirlwind of the latest advancements and shifts in the artificial intelligence landscape. From Meta's launch of the Llama 4 AI models to GitHub Copilot's revamped pricing structure, a groundbreaking study on AI training data, and Spotify's innovative AI-driven advertising platform, this episode is packed with insightful discussions, expert opinions, and notable industry updates. Whether you're a technophile, a developer, or simply curious about AI's evolving role, this summary will keep you informed and ahead of the curve.
Release and Features
Meta has unveiled its latest suite of AI models under the banner Llama 4, introducing models with distinctive names such as Scout, Maverick, and the upcoming Behemoth. The sudden release on a Saturday underscores Meta's aggressive strategy to maintain its competitive edge, especially in light of emerging open-source models from Chinese companies like Deepseek.
A [00:54]: "Meta just dropped a whole new lineup of AI models called Llama 4. And they've got some interesting names like Scout, Maverick and Behemoth...released all this on a Saturday."
Training and Architecture
Unlike traditional models that rely on meticulously labeled data, Llama 4 leverages unlabeled data encompassing text, images, and videos, fostering a broader visual understanding. This approach allows the models to identify patterns and connections autonomously.
B [01:10]: "It's about finding patterns and connections without being told exactly what to look for. And then saying it has broad visual understanding that suggests it's getting much better at understanding images alongside text and video."
A pivotal innovation in Llama 4 is the adoption of a mixture of experts (MoE) architecture. Instead of a single monolithic model, Llama 4 comprises specialized sub-models or "experts" that handle specific tasks, enhancing efficiency and reducing computational overhead.
B [03:37]: "And moe, that's a really interesting approach. It's all about efficiency."
Licensing and Accessibility
Meta has instituted strict licensing restrictions for Llama 4. Companies based in the EU are barred from using or distributing these models, and organizations with over 700 million monthly active users must obtain a special license directly from Meta. This tight control underscores Meta's intent to manage the distribution and utilization of its powerful AI models carefully.
A [03:03]: "Meta is claiming they've tweaked these models so they're less likely to refuse to answer certain types of questions."
B [03:12]: "That's a big deal."
Model Specifications
Maverick: Boasts 400 billion parameters with 17 billion active at any time across 128 experts, suited for general tasks like chatting and creative writing, outperforming models like GPT-4.0 and Gemini 2.0 in specific benchmarks.
B [04:29]: "Yeah. And Meta saying, this one is great for general tasks like chatting and creative writing, you know, your everyday AI stuff."
Scout: Features 109 billion total parameters, 17 billion active, across 16 experts. Excels in document summarization and deep code analysis with a 10 million token context window, making it highly efficient and accessible for developers.
A [05:02]: "That's a critical area these days."
Behemoth (In Development): Nearing 2 trillion parameters, with 288 billion active across 16 experts, targeting high-performance tasks in STEM fields, though requiring substantial hardware resources.
B [05:34]: "That's getting into some serious, serious territory."
Bias and Ethical Considerations
Meta has also focused on reducing the instances where Llama 4 models refuse to answer certain questions, aiming for more balanced perspectives, especially on controversial topics. This tweak is part of a broader effort to enhance the model's reliability and fairness.
A [06:00]: "Meta is claiming they've tweaked these models so they're less likely to refuse to answer certain types of questions."
Introduction of Premium Requests
GitHub Copilot, the AI-powered coding assistant beloved by developers, has introduced a tiered pricing structure incorporating premium requests. This move differentiates access based on the complexity of tasks and the sophistication of the underlying AI models.
A [06:32]: "So they're introducing this new pricing structure with something called premium requests...if you're using the standard model, which is OpenAI's GPT4.0, you've got unlimited requests."
Subscription Tiers and Limits
A [06:52]: "They have a limited number of these premium requests each month."
B [07:36]: "So they're definitely betting that there are enough people willing to pay for the premium experience."
Impact on GitHub’s Revenue
GitHub Copilot has become a significant revenue driver for GitHub, with Microsoft CEO Satya Nadella highlighting its contribution to over 40% of GitHub's revenue growth this year, surpassing even the platform's pre-Microsoft acquisition performance.
B [07:11]: " Copilot is a pretty big deal for GitHub's bottom line. I read that Microsoft CEO Satya Nadella said that Copilot was responsible for over 40% of GitHub's revenue growth this year and that it's already bigger than GitHub was when Microsoft acquired it."
Strategic Implications
This updated pricing strategy underscores the value of premium AI tools in the developer ecosystem and GitHub’s pivotal role in facilitating efficient coding practices through advanced AI assistance.
Investigative Approach
A recent study has reignited discussions on whether AI models are memorizing copyrighted content during training. Researchers employed a method involving high surprisal words—unexpected or rare words within a given context—to determine if models like GPT-4 and GPT-3.5 could accurately predict these missing terms, indicative of memorization.
A [08:20]: "So they focused on what they call high surprisal words...asked the AI models to guess the missing word."
Findings
A [09:09]: "GPT4 did show signs of memorizing parts of popular fiction books, especially those from a data set called Book mia, which has a bunch of copyrighted e books."
Expert Commentary
Balasha Ravachandra, one of the study's authors, advocates for increased transparency and scientific auditing of AI models to ensure ethical training practices and to build trust in AI applications.
A [09:17]: "One of the study's authors, Balasha Ravachandra, she said that we need to be able to really examine these AI models and audit them scientifically."
B [09:29]: "Yeah, it's a valid point. If we're going to trust these AIs, we need to understand where their knowledge is coming from."
Ethical Implications
The study emphasizes the necessity for clearer guidelines and transparency from AI developers regarding training data sources to address ongoing legal and ethical disputes related to copyright infringement.
Spotify Ad Exchange (SSX)
Spotify has launched the Spotify Ad Exchange (SSX), enabling advertisers to access Spotify’s user base through real-time auctions. Collaborations with industry leaders like The Trade Desk and Google Display and Video 360 are pivotal in advancing Spotify’s move towards a programmatic advertising model.
A [09:48]: "The biggest news is the launch of something called the Spotify Ad Exchange or ssx..."
Spotify Ads Manager Enhancements
Improvements to the Spotify Ads Manager provide advertisers with enhanced control over targeting and measurement, facilitating more effective and accountable advertising campaigns.
Spotify Gen AI Ads
Spotify has introduced Gen AI Ads, allowing advertisers in the US and Canada to leverage AI for generating ad scripts and voiceovers. This innovation simplifies the ad creation process, making it more accessible and efficient.
B [10:30]: "Wow. So you can just tell the AI what you want to advertise and it'll create the whole ad for you?"
Engagement Metrics and Target Audience
Spotify reports that free users engage with the platform for an average of two hours daily, with Gen Z particularly valuing Spotify as a haven from online negativity.
A [10:45]: "And Spotify's global head of advertising, Lee Brown, he talked about how Spotify's free users are engaged for like two hours a day on average. And that Gen Z specifically sees Spotify as a positive escape from all the negativity online."
B [11:11]: "So it's more than just a music platform for them. It's like a positive space."
In-House Agencies: Creative Lab and aux
Spotify's launch of Creative Lab and aux provides comprehensive services to advertisers, from campaign creation to music consulting, ensuring effective audience engagement and robust ad performance.
A [11:24]: "And then quickly they've got these in house agencies, Creative Lab and aux. Creative Lab works directly with brands on campaigns and AUX is their music consulting agency."
This episode of AI Deep Dive encapsulates the dynamic and rapidly evolving nature of the AI and tech industries. From Meta’s ambitious Llama 4 models and GitHub Copilot's strategic pricing changes to the ethical quandaries surrounding AI training data and Spotify’s innovative advertising solutions, the landscape is both challenging and opportunistic.
B [11:53]: "It's a lot to take in, but it really shows how dynamic this whole AI and tech landscape is right now."
Listeners are encouraged to reflect on these developments and consider their implications for the future of AI and technology. As AI continues to shape industries and everyday life, staying informed and critically engaged remains paramount.
A [12:18]: "Understanding all this. It's an ongoing process, but hopefully this deep dive gave you some food for thought."
AI Deep Dive continues to serve as an essential resource for anyone looking to stay abreast of the latest in artificial intelligence, providing clarity and depth on complex topics shaping our technological future.