Transcript
A (0:00)
Gemini 3 is here and you can now use one subscription for every task. They launch with two models, Gemini 3 Pro and Gemini 3 Deepthink. Of course, it's available in the Gemini app for everyday use and then it's also available for developers in the AI studio as well as their new developer platform called Anti Gravity, and it's also become available for enterprises. Gemini 3 performed well over almost all of the different benchmarks, including Humanity's last exam and vending benchmark. Some of the improvements I've seen with Gemini three are its ability to do Vibe coding and even agentic coding. It's increased multimodal understanding between images, videos, audios, text and even code. And overall I've seen the improved reasoning where it'll think deeper and understand the nuance in my questions. I've been trying to learn a little bit more about the basics of coding as I start to dive into Vibe coding And I asked ChatGPT and Gemini the same question create a five step plan to learn how to code and I thought Gemini's response was much better Overall, while ChatGPT's response was still helpful, step three kind of tells you to build tiny projects as you learn and gives you some small tips. I thought Gemini's response was much more helpful as it walked me through a problem that I may face which is kind of following tutorials but not actually starting to write. So it kind of gave me a project progression to avoid what it calls tutorial hell. And it gave me an easy three step project progression which I will definitely follow. Gemini also really excels over long horizon tasks. For example, on a benchmark that tests this fending bench 2 Gemini performs much better than any of the other models including Cloudsonnet, Grok 4 and ChatGPT 5.1. Vending Bench 2 tests models ability to stay coherent and successfully manage a simulated business over the course of a year using a vending machine as an example, it gives LLM the primary goal of maximizing profits while giving it control over things like pricing and the different items that are in the vending machine. I've been planning on going on a road trip along the west coast from San Diego all the way to Canada. This is a complex trip, involves multiple stops and I want to do this trip over the course of a week. I thought that having Gemini 3 plan this trip would be a great way to test its abilities in reasoning and also multi step tasks, so I gave it the prompt. Plan a one week road trip along the US west coast from Mexico to Canada, planning all stops including scenic or landmark stops, gas stations, food and hotels. Let's see how it does. You can see here using the thinking model, it doesn't just start right away. It'll start by outlining the structure and defining the itinerary, and then break down the problem in multiple steps. Something I love about the response is it doesn't go overboard and give me too much information. It really lays out the information that I asked for in a neat way. And also during its research it found that the Pacific coast highway is closed in the Big sur region until 2026, so it rerouted the trip to accommodate for this. Overall, I love that it was to the point and understood the regions that I'm going to and didn't overload me with information. It even tells me some pretty specific details about how you must get the clam chowder at Splash Cafe in Pismo Beach A few days ago I made a video and I wanted to make a thumbnail using Gemini's help. With its increased multimodal understanding, it's able to take in a video and understand its context like never before. So I'm going to bring in the video and ask it to help me find scenes of the video that would make for a good thumbnail, whether it's the reaction that I'm making or the context that is on the page. And just by giving it a quick prompt, it gave me three great options for a thumbnail. It was able to understand what's going on in the video at different timestamps, mentioning the anime race car driver at 5 minutes and 7 seconds, or the guitarist on stage at 8 minutes and 30 seconds. Gemini was previously not able to understand video content like this. You're also able to just upload videos of you playing sports, and Gemini will be able to output detailed instructions on what you're doing wrong and what you can work on, giving you a detailed plan to get better. Gemini is now able to create interactive visuals directly in the chat. As a guitarist, I've been wanting to learn a little bit more about different chord structures and how they sound different when compared to each other. So I told Gemini I want to learn about different chords in music from major to minor to major, seventh to diminished, and I wanted to use a piano as the main interface while me through understanding different chords and use sound to help me understand this as well. After selecting Canvas in the toolbar, I sent in this prompt and it created a chord laboratory that had an interactive piano, an audio engine, a chord selector, and visual theory as well, which can help me learn more about chord structure. Let's see what I came up with. This is really wild. It also gives you a theory analyst at the bottom and a listening guide to help you better understand the chord structure. You can even ask Gemini to help you learn about complex topics by just typing into the chat interface or uploading complex PDFs and it'll generate code for an interactive visual which then you can play with right inside of Gemini to further understand what you're trying to learn. For developers, you can now use Gemini 3 and third party tools like Cursor, GitHub or Replit, and in Google tools like AI Studio and their new agentic development platform called Antigravity. A great place to start exploring new Gemini models is Google's AI Studio. AI Studio is a browser based development environment where you can build, test and even deploy different AI powered applications. I actually used AI Studio in a previous demo where I made an app that used nanobanana and I've noticed some crazy improvements since then. We can see some of the differences here. One of the main differences is Gemini 3's ability to create beautiful UI in Google's AI studio. You want to make sure you head to build in here you can browse some apps that were made using Gemini 3. Compared to the app that I made using Gemini 2.5, these apps have insane design. Just looking at this page, you would think it would take thousands of dollars to get your website to look this good, but you're now able to make websites at this level in just one prompt. I asked Gemini to create a website for my Vibe coding business where I wanted a white ui, soft shadows, minimalist typography, kind of matching that Swedish design aesthetic. And while what it created here may not be as insane as some of the examples, I'm still really impressed just by giving it that simple prompt. It created this full website filled with customer reviews, a curriculum with great animations, a daily Vibe section where you can generate new tips, and overall I just feel like it is a pretty great ui. I definitely think I can prove it over a little bit more prompting, but being able to create a design forward website with one simple prompt is crazy. Another way you can use Gemini is to create games like this one, Tempo Strike, even uses your camera to track your hands. This one Shader Pilot actually uses sounds while you control some sort of plane. And the sounds are even interactive with whatever button you're clicking. So if I click the up button it seems like the tone kind of rises and if I click the down button the tone kind of gets a Little lower. Pretty insane. I asked Gemini to create a game where you can control a plane in a 3D environment. Let's check it out. So it gave me quick directions where W is throttle up, S is air brake, and then you can kind of control the pitch as well. It even gave me a mission briefing where I'm supposed to clear for takeoff and then navigate through the obstacle course. This game is really hard, almost impossible to play, but it is really cool that Gemini was able to create a game like this. You can see that this plane gets up to 2,000 km per hour, which makes it almost impossible to hit any of these achievements, but still pretty fun. While that game wasn't amazing, you can see the possibilities that are available with Gemini 3. And a trick that I can do using AI Studio with Gemini 3 is actually bring in a screenshot of a site that I really like and have it use that website as a template for my site. I use Whisper Flow's website design, which is really cool, to help me create a website of my own. And while I would really never just steal someone else's website, using it as inspiration for your own is a great way to use this tool. In my prompt, I asked Gemini to use the photo I uploaded for inspiration and create a website for my AI tutoring company. You can see that it basically took the typography and the general design from Whisperflow and just made it its own. While it doesn't look as good as the Whisper Flow website, it really did a good job copying all the different typographies, colors, and just general design. One thing that's insane though is that it added an AI voice coach directly in the app without me even asking. Let's check it out. So I click the button and just ask, how can I get better using AI?
