
Hosted by Michael Sharkey, Chris Sharkey · EN

Join Simtheory: https://simtheory.aiSo Chris, this week we finally give our GPT-5.5 impressions (it's actually great), introduce our new AI co-host Moshi (who immediately embarrasses himself), argue about whether the OpenAI/Jony Ive phone is genius or doomed, witness Grok 4.3's unhinged infinite emoji meltdown, declare Opus 4.7 the first-ever Anthropic regression, get excited about GPT Real-Time Voice 2.0 as the future of agentic workflows, debate whether token prices will ever come down, and play the worst diss track in show history. Watch my spud.CHAPTERS:0:00 - Intro & Introducing Our New AI Co-Host Moshi1:39 - Trying to Break Moshi: The Illegal Cigarette Trade Test2:30 - OpenAI's Jony Ive Phone: Do We Need a Device?5:07 - Telegram Agents & GPT Real-Time Voice 2.0 Dream7:38 - The Supervisory Agent: Managing Your Agentic Workflow9:05 - Wait... Are We Accidentally Validating the OpenAI Phone?11:37 - GPT-5.5 First Impressions: Actually Really Good14:36 - 5.5 vs Opus 4.6: Different Strengths17:00 - Opus 4.7: The First-Ever Anthropic Regression20:25 - Grok 4.3: Infinite Emojis & Absolute Chaos21:22 - 🎵 DISS TRACK: "Watch My Spud"24:24 - Grok Specs & All Models Deprecated in 18 Days27:04 - Grok Voice in Tesla Is Actually Next Level31:03 - Token Pricing: The Subscription Problem Nobody Can Solve39:16 - AI Disruption Cycles & The State of the Industry44:39 - BONUS TRACK:🎵 "It's Hard Being Me"Thanks for listening, like and sub xoxo

Join Simtheory: https://simtheory.aiSo Chris, this week... a LOT has happened. We're back to regular programming (maybe), and back with our average takes. Nothing's changed.GPT-5.5 just dropped today - but you can't even use it in the API. Vaporware? OpenAI is charging MORE than Opus 4.7 and we haven't even tested it yet. Meanwhile Claude Opus 4.7 landed a couple weeks ago and... the vibes are off? Mike's actually going BACK to 4.6. Something's wrong.But the real star: OpenAI Image 2. This thing is genuinely terrifying. We committed what can only be described as "parody fraud" - faking a council letter so realistic Mike's own mother fell for it on a phone call. Then Chris posted a fake development approval with the mayor's real name into a local Facebook group and had to delete it when someone tagged the actual mayor. The forgery capabilities are absolutely unhinged.Also: GLM 5.1 is so good Mike forgot he switched to it. Kimi K 2.6 is criminally underrated. VCs are paying 70% of your real token costs. Consumers pay only 5.5% of actual cost. The everything app war is ON. The SaaS-pocalypse is real. And we made two new diss tracks.Chris made a graffiti sign in LA. It says "This Day in AI." It was the best artwork in the class. That tells you everything.CHAPTERS:0:00 - Intro & We're Back (Don't Over-Commit)1:14 - Overview: Everything That Dropped While We Were Gone2:56 - GPT-5.5: Vaporware? Not Even in the API4:57 - Benchmarks vs Reality: Nobody's Excited About OpenAI Models5:50 - GLM 5.1 & Kimi K 2.6: Secretly Just As Good?8:15 - The Everything App Race & Product Layer War8:56 - Token Economics: You're Only Paying 5.5% of Real Cost13:08 - We Burned $1.5M in Cloud Credits in 2 Months16:13 - "$30/Month Is Too Expensive" (It Actually Costs $700)19:25 - Where Is Google?? TPUs Should Flatten Everyone22:01 - Agentic Tasks Are 10-50x More Expensive Than Chat25:07 - OpenAI Workspace Agents: Glorified Zapier?27:01 - Single Agent vs Multi-Agent: How Do You Actually Work?33:06 - Building Automation Is HARD (Our Support Shame)35:33 - OpenAI Image 2: The Fraud Episode Begins44:16 - FRAUD DEMO: The Fake Council Letter (Mum Falls For It)49:16 - FRAUD DEMO 2: Chris Posts Fake Mayor Letter on Facebook52:17 - Fake Receipts, Bank Statements & Can Forgeries Be Detected?57:25 - Claude Opus 4.7: The Vibes Are Off59:51 - Mythos Preview: "Pics or It Didn't Happen"1:01:56 - 🎵 DISS TRACK: "Point 7" (Opus Destroys Everyone)1:03:30 - Kimi K 2.6 Deep Dive & 🎵 New Diss Track1:08:34 - The Everything App War & SaaS-pocalypse1:13:51 - Death of Per-Seat Pricing & Agent Security1:22:37 - Final Thoughts: The Time for Pretending Is Over1:28:22 - 🎵 Full Tracks: " Point 7" & "Kimi You're So Fine 2.6"Thanks for listening, like and sub xoxo

Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Join Simtheory: https://simtheory.ai🚀 Try our AI-built apps:Macrosoft Teams: teams.simtheoryapp.com (working video chat with up to 150 people)Trallo: trallo.simtheoryapp.com (full Trello clone, unlimited boards, completely free)TDIA Discord: https://discord.gg/gTW4RkAJvnSpotify Songs: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=Zh4jgHIASI2ZvsXVfVcCoASo Chris, this week... we've been having way too much fun with the AI again. OpenAI just dropped GPT-5.4 and 5.4 Pro, and holy shit - we finally have a ball game. This might be the first OpenAI model that genuinely competes with Opus 4.6 for agentic work.But here's where it gets wild: we rebuilt Trello AND Microsoft Teams from scratch using single prompts. Not mockups. Fully deployed, working apps with authentication, video chat, the works. You can literally sign up and use them right now.Plus: We roast Gemini 3.1 (it's a disgrace for agentic workflows), break down the insane $30/$180 per million pricing on 5.4 Pro (who is this for??), and discuss why every $99/month SaaS tool might be about to die. Chris declares his programming skills "useless" and honestly... he might be right.We also demo our actual workflow - running 5 agent tabs simultaneously, delegating everything, and why we barely visit websites anymore. The AI workspace IS the operating system now.CHAPTERS:0:00 - Intro & Housekeeping (We Screwed Up the Link)1:26 - GPT-5.4 First Impressions & Specs3:12 - Chris's Testing: 40 Minutes to Solve a Problem4:51 - Knowledge Work Improvements (Catching Up to Anthropic)6:38 - Computer Use vs Browser/Terminal Debate8:07 - Why We Don't Need Computer Use Anymore9:53 - Teaser: We Built Full SaaS Apps Today11:19 - Tool Search API & Skills Integration13:20 - The Speed Problem (It's a Plodder)15:12 - GPT-5.4 Pro Pricing Reaction ($30/$180 WTF)18:14 - Someone Rebuilt Minecraft in 24 Minutes19:46 - Gemini 3.1 Roast: "It's a Disgrace"22:36 - DEMO: Trallo (Full Trello Clone)29:03 - DEMO: Macrosoft Teams (Working Video Chat!)33:30 - The SaaS Collapse Theory36:42 - AI Workspace as the New Operating System38:57 - Forcing Features onto Entrenched Software43:32 - "My Programming Skills Are Useless" - Chris46:06 - The $12 Million Legacy Software Opportunity51:06 - Beyond Code: Forms, PDFs, Knowledge Work55:28 - How Fast Will This Change Everything?56:31 - Gemini 3.1 Flash Lite Quick Take59:36 - The Delegation Lifestyle (5 Agent Tabs Running)1:01:24 - Mike's Workflow Demo1:04:31 - Cognitive Overload Problem1:06:04 - Release Date: 2 Weeks (Drop Punishment Ideas!)Thanks for listening like and sub xoxo

Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Join Simtheory: https://simtheory.aiTDIA Discord: https://discord.gg/gTW4RkAJvnHorse Egg Lifecycle Infographic: https://staging.simtheory.ai/share/file/UZ2KJU----So Chris, this week... we're diving into Google's new Nano Banana 2 image model - 50% cheaper and supposedly faster (when the servers aren't melting). We put it through its paces with annotation-based editing, slide generation, and yes, the return of the legendary horse egg experiment.Plus: Google quietly kills Gemini-3 after just a few months (good riddance?), we discuss why the model was "dead on arrival" for agentic workflows, and break down the real story behind those massive AI layoff announcements from Block and WiseTech. Spoiler: it's probably not actually about AI.We also get into the current state of the model wars (Opus 4.6 vs Codex 5.3), why smaller models like GLM-5 might be the future for enterprise agentic tasks, and Chris's wife teaching Claude to literally speak to her using Mac's text-to-speech. The models are getting creative.---0:00 - Intro0:36 - Nano Banana 2: Price, Speed & First Impressions3:19 - The Compositing Problem & Last Mile Design5:41 - Annotation-Based Editing (This Changes Everything)9:52 - Slide Editing & Real-World Use Cases12:34 - The Horse Egg Experiment Returns14:30 - Image Degradation & Cost Breakdown17:47 - Text-to-Image Leaderboard Discussion20:01 - Why Nano Banana Dominates for Work22:07 - Codex 5.3 vs Opus 4.622:54 - Google Kills Gemini-3 (What Went Wrong?)26:48 - Google's Agentic Problem30:08 - The Model Loyalty Cycle34:22 - Why Opus 4.6 is Still the Best37:05 - Cost Optimization & Smart Model Routing43:30 - When Models Get Stuck on the Wrong Path45:36 - Nicole's AI Learns to Talk Back46:54 - Can Anyone Build Software Now?52:26 - Anthropic's Legal/Finance Plugins & Market Panic57:08 - Block Lays Off 4,000: AI or Excuse?1:00:05 - The AI Job Apocalypse Isn't RealThanks for listening like and sub xoxo

Join Simtheory: https://simtheory.ai"Is This The End" now on Spotify: https://open.spotify.com/album/2Py1MyADUFqJFVUISI2VTP?si=oT3PWyJYRA2BspOmzT_ifgRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0dationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80Two new models dropped this week — Gemini 3.1 Pro and Claude Sonnet 4.6 — and honestly? We're struggling to care. In this episode, we break down why Gemini went from being our daily driver to a model we barely touch, the "tunnel vision" hallucination problem that killed the Gemini 3 series for us, and whether 3.1 Pro actually fixes it. We put Gemini 3.1 Pro head-to-head against Claude Opus building a Geoffrey Hinton Doom Center, debate whether anyone can actually tell the difference between Sonnet 4.5 and 4.6, and make the case that smaller models running in agentic loops are secretly beating the frontiers. Plus: OpenAI acquires OpenClaw and we ask why a $100B company couldn't just build it themselves, DHH calls out the AI pricing bubble, Mike compares AI models to cheap wine hangovers, and Sam Altman refuses to hold Dario's hand at the India AI Summit. The model wars are getting weird.CHAPTERS:0:00 Intro & "Is This The End" Now on Spotify1:10 Gemini 3.1 Pro: Thinking Controls & The Medium Mode Fix3:14 The Speed vs Intelligence Trade-Off in Agentic Work5:10 Why Multitasking With AI Agents Made Us Anxious6:34 Solid Updates: The Real Goal of Agentic Coding7:45 Gemini's Fall From Grace: From Daily Driver to Dead Model10:08 The Tunnel Vision Problem That Killed Gemini 313:35 Mixed Reactions: Fanboys vs Reality on Gemini 3.1 Pro15:06 Side-by-Side Test: Gemini 3.1 Pro vs Claude Opus (Hinton Doom Center)17:39 Why File Manipulation Accuracy Matters More Than Context Windows19:27 The Context Window Debate: 1M Tokens vs Smart Sub-Agents22:05 DHH on Token Pricing: "If There's a Bubble, It's This"24:11 Should Models Ship as Agent vs Chat Variants?28:43 Claude Sonnet 4.6: A $2 Discount on Opus?31:44 The Model Mix: Why One Model Won't Rule Them All34:40 Anthropic Is Winning — But Can Anyone Tell the Difference?38:58 OpenAI Acquires OpenClaw: Why Couldn't They Just Build It?44:18 The Silicon Valley Moment: Sam vs Dario at India AI Summit47:05 Will Smaller Models Win the Enterprise? The Cost Reality Check51:27 The End of Single-Shot: Why Agentic Loops Change Everything55:48 Final Thoughts & Gemini 3.1 Pro Gets One More WeekThanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. Two models dropped on a week again. What a time to be alive. xoxo

Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80GLM-5 just dropped and it's trained entirely on Huawei chips – zero US hardware dependency. Meanwhile, we're having existential crises about whether we're even needed anymore. In this episode, we break down China's new frontier model that's competing with Opus 4.6 and Codex at a fraction of the price, why agentic loops are making 200K context windows the sweet spot (sorry, million-token dreams), and the very real phenomenon of AI productivity psychosis. We dive into why coding-optimized models are secretly winning at everything, the Harvard study confirming AI doesn't reduce work – it intensifies it, and the exodus of safety researchers from XAI, Anthropic, and OpenAI (spoiler: they're not giving back their shares). Plus: Mike's arm is failing from too much mouse usage, we debate whether the chatbot era is actually fading, and yes – there's a safety researcher diss track called "Is This The End?"CHAPTERS:0:00 Intro - Is This The End? (Song Preview)0:11 Still Relevant Tour Update & NASA Listener Callout1:42 AI Productivity Psychosis: The Pressure of Infinite Capability4:25 GLM-5 Breakdown: China's New Frontier Model on Huawei Chips7:24 First Impressions: GLM-5 in Agentic Loops9:48 Why Cheap Models Matter & The New Model War14:09 Codex Vibe Shift: Is OpenAI Winning?16:24 Does Context Window Size Even Matter Anymore?22:27 The Parallelization Problem & Cognitive Overload27:27 Mike's Arm Injury & The Voice Input Pivot31:17 Single-Threaded Work & The 95% Problem35:06 UX is Unsolved: Rolling Back Agentic Mistakes38:45 Harvard Study: AI Doesn't Reduce Work, It Intensifies It44:01 How AI Erodes Company Structure & Why Adoption Takes Years50:14 My AI vs Your AI: Household Debates50:43 The Safety Researcher Exodus: XAI, Anthropic, OpenAI56:49 Final Thoughts: Are We All Still Relevant?59:04 BONUS: Full "Is This The End?" Diss TrackThanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. GLM-5 is here, your productivity psychosis is valid, and the safety researchers are becoming poets. xoxo

Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80It's the model same-day showdown of 2026. Opus 4.6 and Codex 5.3 dropped within minutes of each other, and we're breaking down what this means for the future of AI work. In this episode, we unpack Opus 4.6's million-token context window (if you've got billies in the bank), why Codex's pricing makes it nearly impossible to ignore for agentic loops, and the real cost of running agents for 24 hours ($10K, apparently). We dive deep into why coding-optimized models are secretly crushing it at non-coding tasks, the mental fatigue of managing AI workers, and whether the chatbot era is actually fading or just evolving. Plus: Chris accidentally books three real pig grooming appointments, we debate whether you need a "life coach agent" to manage your agent swarm, and yes – there's an Opus 4.6 diss track that goes unreasonably hard.CHAPTERS:0:00 Intro - Opus 4.6 Diss Track Preview0:09 The Model Same-Day Showdown: Opus 4.6 vs Codex 5.30:50 Opus 4.6 Breakdown: Million Token Context & Premium Pricing2:31 Token Bill Shock: $10K Research Bills & Extended Context Costs5:04 Codex Pricing: Why It's Nearly Free for Agentic Loops6:42 Why Coding Models Are Secretly Crushing Non-Coding Tasks10:14 Tool Fatigue: Too Many Models, Too Many Workflows12:47 Opus 4.6 First Impressions: "Solid" and "Faultless"13:48 Chris Accidentally Books Three Real Pig Grooming Appointments16:01 Unix Tools & Why Code-Optimized Models Win at Everything19:59 The Agentic Retraining Imperative: Chat to Delegation22:16 Agent Swarms & The Master Thread Architecture24:51 OpenAI vs Anthropic: The Enterprise Battle27:09 Corporate Espionage 2.0: Stealing Skills & The Open Source Threat31:19 The UX Problem: Why Delegation Isn't Solved Yet34:24 The Stress of Hyper-Productivity & Managing Agent Swarms37:07 Coordination: The Next Layer of Abstraction40:09 The Fantasy vs Reality of Autonomous AI Businesses44:37 Is the Turn-by-Turn Chatbot Era Actually Fading?49:23 Tokens as Spice: Turning Compute Into Money52:08 Reduce Cognitive Overload: The Real Goal of AI55:07 Still Relevant Tour Announcement55:39 BONUS: Full Opus 4.6 Diss TrackThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The model wars are heating up, and your token bill is about to get interesting. xoxo

Join Simtheory: https://simtheory.aiRegister for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80---The hype train is 2026 knows only Moltbot (RIP Clawdbot). In this episode, we unpack the viral open-source AI assistant that's taken over the internet what it actually does, why everyone's losing their minds, and whether it's worth the $750/day token bills some users are racking up. We dive deep into why locally-run skills and CLI tools are beating computer-use clicking, how smaller models like GPT-5 Mini are crushing it in agentic workflows, and why the real magic is in targeted context - not massive swarms. Plus: Kimi K2.5 drops as a near-Sonnet-level model at 1/10th the price, we debate whether SaaS is dead, and yes – there are TWO Kimi K2.5 diss tracks. One made by Opus pretending to be Kimi. It might just slap?CHAPTERS:0:00 Intro - Still Relevant Tour Update0:48 What is Moltbot? The Viral AI Assistant Explained3:57 Token Bill Shock: $750/Day and Anthropic Bans5:00 The Dream of Digital Coworkers on Mac Minis6:52 Why CLI Tools & Skills Beat Computer-Use Clicking10:57 Why This Way of Working Is Genuinely Exciting14:47 Smaller Models Crushing It: GPT-5 Mini & Targeted Context17:30 Wild Agentic Behavior: Chrome Tab Hijacking & Auto-Retries20:10 Security Architecture: Locked-Down Machines & Enterprise Use24:01 AI Building Its Own Tools On-The-Fly27:08 The Fear & Overwhelm of Rapid Progress29:10 2026: The Year of Agent Workers31:43 The Challenge of Directing AI Work (Everyone's a Manager Now)37:24 Skills Will Take Over: Why MCPs & Atlassian Can't Stop Us40:38 Real-World Use Cases: Doctors, Lawyers & Accountants46:28 Cost Solutions: Build Workflows Around Cheaper Models52:58 Kimi K2.5: Sonnet-Level Performance at 1/10th the Price1:00:55 The "1,500 Tool Calls" Claim: Marketing vs Reality1:05:23 The Kimi K2.5 Diss Tracks (Opus vs Kimi)1:08:08 Demo: Black Hole Simulator & Self-Trolling CRM1:12:55 Is SaaS Dead?1:14:30 BONUS: Full Kimi K2.5 Diss TracksThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The future is open source, apparently. xoxo

Join Simtheory: https://simtheory.aiReserve your seat on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80----Two episodes in one week? We're either above average or completely unhinged. In this one, we dive deep into the new phenomenon of "AI exhaustion" – that fried feeling you get after multitasking across six agent tabs all day. We share our breakthroughs with AI-assisted presentations (20 minutes vs several hours), why browser-use on your local machine bypasses every anti-scraping technique known to man, and how enterprise context sharing could be the real unlock for organizations. Plus: OpenAI announces ads for ChatGPT (even on paid tiers), their CFO floats taking cuts from drug discoveries (seriously), and Google publicly dunks on them for it. Also – the Still Relevant Australia Tour is coming, and our LinkedIn group hit 200 members (we're basically LinkedIn influencers now too).CHAPTERS:0:00 Intro - Still Relevant Tour Announcement + LinkedIn Milestone2:08 AI Exhaustion: The Cognitive Overload of Multitasking with Agents4:14 Why Single-Tasking with AI Beats Parallel Agent Chaos7:02 The Problem with "I Spun Up 70,000 Sub-Agents" Twitter Posts10:03 Mike's Presentation Workflow: From Hours to 20 Minutes14:06 Why Isn't Copilot Doing This Already?16:54 Old Models + Great Context = Still Amazing Results21:14 What's Actually Changed? It's the Software Layer25:22 Enterprise Context Sharing & Organizational IP31:22 Skills, Sub-Agents, and Role-Based Knowledge35:22 Security Concerns: Can You Hack an Agent with Malicious MD Files?38:23 Cloud Providers Have a Bigger Moat Than the Labs43:16 Browser Use: The Ultimate Context Gathering Weapon48:25 Rethinking SaaS: Software That Actually Thinks53:08 Smart Paste, Smart CC – Why Isn't All Software Like This?56:32 OpenAI's Desperate Moves: Ads, Age Verification & Drug Royalties1:03:03 Google Says "No Plans for Gemini Ads" (Shots Fired)1:07:24 Is OpenAI Okay? The Vibes Are Definitely Off1:10:35 Capitalism Won't Give You Free Time, Just More Demands1:11:20 Outro + Still Relevant Tour DetailsThanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. xoxo

Join Simtheory: https://simtheory.ai---Join the most average AI LinkedIn group: https://www.linkedin.com/groups/16562039/It's 2026 and everyone's having an existential crisis. In this episode, we unpack the two camps dominating AI C/Twitter: hype boys claiming "Claude Code can do my washing" vs. software developers doom-scrolling themselves into career panic. We put the agentic hype to the test and discover that no, you can't actually run 8 agents recreating your local business ecosystem while you sleep. Plus, we reflect on why MCP is exhausting, why Gemini 3 Pro is somehow worse than Gemini 2.5 Pro, and why Geoffrey Hinton would rather write his book than answer questions in Tasmania. Also featuring: the $200,000/month enterprise AI problem, why SaaS isn't dead (but it's scared), and our prediction that AI workspaces will become the everything app.CHAPTERS:00:00 Intro - Unpacking the 2026 AI Vibes02:21 Putting Claude Code and Agentic Hype to the Test05:57 Why Twitter AI Demos Never Show the Receipts07:03 Honest Assessment of Where Frontier Models Are At11:19 Building the Everything App with Email, Calendar and Files16:47 Collaborative Mode vs Agentic Delegation in Practice21:29 The Real Cost of Enterprise AI at Scale24:32 Why Cheaper Models Like Haiku and Gemini Flash Matter29:25 Is SaaS Actually Dead or Just Disrupted38:11 The Future of AI Platforms, SDKs and App Stores43:35 The Untapped Opportunity in Paid Proprietary MCPs51:21 Geoffrey Hinton Refuses to Take Questions in Tasmania55:05 2026 Plans and the Still Relevant Tour AnnouncementThanks for listening. Like & Sub. xoxox