Summary5 min read

Podcast Summary: This Week in AI

Episode: Everything You Need to Know About Gemini 3.0!
Date: December 1, 2025
Host: Jason Calacanis (+ CEO-level expert guests)

Overview

In this episode, Jason Calacanis and a roundtable of CEO-level AI experts break down the launch of Gemini 3.0, Google’s flagship multimodal AI model. The discussion covers Gemini’s new features, real-world demos, benchmark results, and hands-on experiments for both end-users and developers. The panel compares Gemini 3.0 to other major LLMs, explores its agentic and creative capacities, tests its multimodal understanding, and discusses impacts on workflow, development, and learning.

Key Discussion Points & Insights

1. Launch and Availability of Gemini 3.0

New Models: Gemini 3 Pro (consumer) and Gemini 3 Deepthink (enhanced capabilities).
Accessibility: Available for everyone via the Gemini app, developers through AI Studio and "Anti Gravity" developer platform, plus enterprise solutions.
Unified Subscription: One subscription spans all platforms and use cases.

2. Performance and Benchmarks

Superior Reasoning and Multimodal Understanding:
- “Gemini 3 performed well over almost all of the different benchmarks, including Humanity's last exam and Vending Benchmark.” (A, 00:35)
- Notable improvements in Vibe coding and agentic (autonomous, multi-step) task-solving.
Outperforming Competitors:
- On Vending Bench 2, Gemini 3 surpasses Claude Sonnet, Grok 4, and ChatGPT 5.1 in coherence and long-horizon goal management.
- “Vending Bench 2 tests models’ ability to stay coherent and successfully manage a simulated business over the course of a year…” (A, 03:10)

3. Real-World Applications and Demos

Coding and Project Management

Helpful, Contextual Explanations:
- “Gemini’s response was much better… walked me through a problem… gave me a project progression to avoid what it calls ‘tutorial hell.’” (A, 01:33)
Compared favorably to ChatGPT for beginner coding advice and project planning.

Complex Planning Tasks

Road Trip Planning:
- Gemini can organize multi-step, multi-day travel (San Diego to Canada), factoring in real-time info like Pacific Coast Highway closures.
- “It rerouted the trip to accommodate for this… It was to the point and understood the regions I’m going to and didn’t overload me with information.” (A, 02:50)
- Highlights practical, up-to-date reasoning and restraint in information delivery.

Multimodal Understanding

Video Analysis for Thumbnails:
- Gemini can select optimal scenes for thumbnails by understanding video context, e.g., “anime race car driver at 5 minutes and 7 seconds, or the guitarist on stage at 8 minutes and 30 seconds.” (A, 04:35)
Sports Coaching from Video:
- Analyze sports videos, identify errors, and provide personalized improvement suggestions.
Music Learning Tools:
- Gemini creates interactive piano chord labs, with visual, auditory, and theory explanations.
- “It created a chord laboratory… interactive piano, an audio engine, a chord selector, and visual theory as well…” (A, 05:42)

Interactive Visuals & Learning

PDF & Code Integration:
- Upload complex documents and receive explorable, interactive visuals.
- Combines chat, coding, and visualization in a single workspace, making concept learning dynamic.

4. Developer Tools & AI Studio

Building Websites and Apps:
- Gemini 3 can create design-forward websites and apps with just a simple prompt.
- “I asked Gemini to create a website for my Vibe coding business…and I’m still really impressed just by giving it that simple prompt.” (A, 06:12)
Game Creation with Multimodal Controls:
- Simple prompts can generate games with interactive features and multimodal input (camera, sound).
- Example: “Tempo Strike” tracks hands using the camera, “Shader Pilot” responds to sound and clicks.
Design Inspiration from Screenshots:
- Upload website screenshots as inspiration, Gemini recreates the UI style, color palette, and layout.
- “It basically took the typography and the general design from WhisperFlow and just made it its own.” (A, 07:41)

5. Agentic Features and Future Directions

AI Voice Coach Example:
- Automatically adds features like a voice coach to generated apps, enabling interactive, domain-specific help.
- Notable exchange:
  - A: “How can I get better using AI?” (A, 08:24)
  - B (Gemini AI): “That’s a fascinating topic. What particular applications of AI are you most interested in exploring?” (B, 08:26)
  - A: “I’d love to learn a little bit more about prompt engineering.” (A, 08:32)
  - B: “Prompt engineering is a really important skill… Are you working with text based models or something else?” (B, 08:36)
Integration and Extensibility:
- Code snippets and features can be copied into developers’ own projects.
- Upcoming: Deeper dive into Google’s agentic developer platform, Anti Gravity.

Notable Quotes & Memorable Moments

On Gemini’s real-world helpfulness:
“It kind of gave me a project progression to avoid what it calls tutorial hell.” (A, 01:34)
On long-horizon tasks:
“It doesn’t just start right away, it outlines the structure, defines the itinerary, and breaks the problem down in multiple steps.” (A, 02:31)
On video understanding:
“Gemini was previously not able to understand video content like this.” (A, 05:11)
On possibilities for developers:
“Being able to create a design-forward website with one simple prompt is crazy.” (A, 06:27)
On agentic, surprising features:
“It added an AI voice coach directly in the app without me even asking.” (A, 08:07)
On the power of interaction:
“It even seems like it trained the model that I’m talking to, to understand that it’s an AI tutorial.” (A, 08:46)

Timestamps for Key Segments

00:00 — Gemini 3.0 launch and model overview
01:00 — Coding advice comparison: Gemini vs. ChatGPT
02:10 — Benchmarks & long-horizon performance (Vending Bench 2)
02:55 — Road trip planning demo: reasoning, context, up-to-date information
04:10 — Multimodal understanding: video analysis, thumbnails, and sports coaching
05:35 — Interactive learning tools: chord laboratory, PDFs & visuals
06:10 — Developer tools: building websites, games, and using AI Studio
07:35 — Design inspiration from screenshots; embedding AI voice coach
08:24 — Live exchange with Gemini AI voice coach
08:46 — Agentic capabilities, developer extensibility

Conclusion

This episode offers a deep, hands-on look at Gemini 3.0, showcasing its benchmarks, real-world intelligence, and creative potential. From advanced reasoning and context-aware planning to multimodal interactivity and developer-friendly tools, Gemini 3.0 stands out as both a consumer and developer platform. The panel’s demos and candid commentary highlight a near-future where AI is not just responsive but agentic and contextually engaging—poised to reshape how we build, learn, and collaborate with machines.

Loading summary

Transcript5 lines

[00:00]
A
Gemini 3 is here and you can now use one subscription for every task. They launch with two models, Gemini 3 Pro and Gemini 3 Deepthink. Of course, it's available in the Gemini app for everyday use and then it's also available for developers in the AI studio as well as their new developer platform called Anti Gravity, and it's also become available for enterprises. Gemini 3 performed well over almost all of the different benchmarks, including Humanity's last exam and vending benchmark. Some of the improvements I've seen with Gemini three are its ability to do Vibe coding and even agentic coding. It's increased multimodal understanding between images, videos, audios, text and even code. And overall I've seen the improved reasoning where it'll think deeper and understand the nuance in my questions. I've been trying to learn a little bit more about the basics of coding as I start to dive into Vibe coding And I asked ChatGPT and Gemini the same question create a five step plan to learn how to code and I thought Gemini's response was much better Overall, while ChatGPT's response was still helpful, step three kind of tells you to build tiny projects as you learn and gives you some small tips. I thought Gemini's response was much more helpful as it walked me through a problem that I may face which is kind of following tutorials but not actually starting to write. So it kind of gave me a project progression to avoid what it calls tutorial hell. And it gave me an easy three step project progression which I will definitely follow. Gemini also really excels over long horizon tasks. For example, on a benchmark that tests this fending bench 2 Gemini performs much better than any of the other models including Cloudsonnet, Grok 4 and ChatGPT 5.1. Vending Bench 2 tests models ability to stay coherent and successfully manage a simulated business over the course of a year using a vending machine as an example, it gives LLM the primary goal of maximizing profits while giving it control over things like pricing and the different items that are in the vending machine. I've been planning on going on a road trip along the west coast from San Diego all the way to Canada. This is a complex trip, involves multiple stops and I want to do this trip over the course of a week. I thought that having Gemini 3 plan this trip would be a great way to test its abilities in reasoning and also multi step tasks, so I gave it the prompt. Plan a one week road trip along the US west coast from Mexico to Canada, planning all stops including scenic or landmark stops, gas stations, food and hotels. Let's see how it does. You can see here using the thinking model, it doesn't just start right away. It'll start by outlining the structure and defining the itinerary, and then break down the problem in multiple steps. Something I love about the response is it doesn't go overboard and give me too much information. It really lays out the information that I asked for in a neat way. And also during its research it found that the Pacific coast highway is closed in the Big sur region until 2026, so it rerouted the trip to accommodate for this. Overall, I love that it was to the point and understood the regions that I'm going to and didn't overload me with information. It even tells me some pretty specific details about how you must get the clam chowder at Splash Cafe in Pismo Beach A few days ago I made a video and I wanted to make a thumbnail using Gemini's help. With its increased multimodal understanding, it's able to take in a video and understand its context like never before. So I'm going to bring in the video and ask it to help me find scenes of the video that would make for a good thumbnail, whether it's the reaction that I'm making or the context that is on the page. And just by giving it a quick prompt, it gave me three great options for a thumbnail. It was able to understand what's going on in the video at different timestamps, mentioning the anime race car driver at 5 minutes and 7 seconds, or the guitarist on stage at 8 minutes and 30 seconds. Gemini was previously not able to understand video content like this. You're also able to just upload videos of you playing sports, and Gemini will be able to output detailed instructions on what you're doing wrong and what you can work on, giving you a detailed plan to get better. Gemini is now able to create interactive visuals directly in the chat. As a guitarist, I've been wanting to learn a little bit more about different chord structures and how they sound different when compared to each other. So I told Gemini I want to learn about different chords in music from major to minor to major, seventh to diminished, and I wanted to use a piano as the main interface while me through understanding different chords and use sound to help me understand this as well. After selecting Canvas in the toolbar, I sent in this prompt and it created a chord laboratory that had an interactive piano, an audio engine, a chord selector, and visual theory as well, which can help me learn more about chord structure. Let's see what I came up with. This is really wild. It also gives you a theory analyst at the bottom and a listening guide to help you better understand the chord structure. You can even ask Gemini to help you learn about complex topics by just typing into the chat interface or uploading complex PDFs and it'll generate code for an interactive visual which then you can play with right inside of Gemini to further understand what you're trying to learn. For developers, you can now use Gemini 3 and third party tools like Cursor, GitHub or Replit, and in Google tools like AI Studio and their new agentic development platform called Antigravity. A great place to start exploring new Gemini models is Google's AI Studio. AI Studio is a browser based development environment where you can build, test and even deploy different AI powered applications. I actually used AI Studio in a previous demo where I made an app that used nanobanana and I've noticed some crazy improvements since then. We can see some of the differences here. One of the main differences is Gemini 3's ability to create beautiful UI in Google's AI studio. You want to make sure you head to build in here you can browse some apps that were made using Gemini 3. Compared to the app that I made using Gemini 2.5, these apps have insane design. Just looking at this page, you would think it would take thousands of dollars to get your website to look this good, but you're now able to make websites at this level in just one prompt. I asked Gemini to create a website for my Vibe coding business where I wanted a white ui, soft shadows, minimalist typography, kind of matching that Swedish design aesthetic. And while what it created here may not be as insane as some of the examples, I'm still really impressed just by giving it that simple prompt. It created this full website filled with customer reviews, a curriculum with great animations, a daily Vibe section where you can generate new tips, and overall I just feel like it is a pretty great ui. I definitely think I can prove it over a little bit more prompting, but being able to create a design forward website with one simple prompt is crazy. Another way you can use Gemini is to create games like this one, Tempo Strike, even uses your camera to track your hands. This one Shader Pilot actually uses sounds while you control some sort of plane. And the sounds are even interactive with whatever button you're clicking. So if I click the up button it seems like the tone kind of rises and if I click the down button the tone kind of gets a Little lower. Pretty insane. I asked Gemini to create a game where you can control a plane in a 3D environment. Let's check it out. So it gave me quick directions where W is throttle up, S is air brake, and then you can kind of control the pitch as well. It even gave me a mission briefing where I'm supposed to clear for takeoff and then navigate through the obstacle course. This game is really hard, almost impossible to play, but it is really cool that Gemini was able to create a game like this. You can see that this plane gets up to 2,000 km per hour, which makes it almost impossible to hit any of these achievements, but still pretty fun. While that game wasn't amazing, you can see the possibilities that are available with Gemini 3. And a trick that I can do using AI Studio with Gemini 3 is actually bring in a screenshot of a site that I really like and have it use that website as a template for my site. I use Whisper Flow's website design, which is really cool, to help me create a website of my own. And while I would really never just steal someone else's website, using it as inspiration for your own is a great way to use this tool. In my prompt, I asked Gemini to use the photo I uploaded for inspiration and create a website for my AI tutoring company. You can see that it basically took the typography and the general design from Whisperflow and just made it its own. While it doesn't look as good as the Whisper Flow website, it really did a good job copying all the different typographies, colors, and just general design. One thing that's insane though is that it added an AI voice coach directly in the app without me even asking. Let's check it out. So I click the button and just ask, how can I get better using AI?
[08:26]
B
That's a fascinating topic. What particular applications of AI are you most interested in exploring?
[08:33]
A
I'd love to learn a little bit more about prompt engineering.
[08:36]
B
Great. Prompt engineering is a really important skill with AI. Are you working with text based models or something else?
[08:46]
A
That's so insane it even seems like it trained the model that I'm talking to to understand that it's an AI tutorial. And as a developer you can even take some of this code and implement it into your own projects. And Google also just released its agentic developer platform, which I'll dive into deeper in a different video. Thanks for tuning in to this week in AI and I'll see you next time.