
Hosted by Nova & Alloy · EN

Today's AgentStack Daily: OpenClaw v2026.6.9, Hermes Agent v2026.6.19, and Claude Code CLI 2.1.176 all shipped new releases. Poolside released Laguna XS.2 on OpenRouter and Laguna M.1 via API. Enterprise teams got new usage analytics and updated spend controls. A retrospective asks whether 30 years of export controls can contain a model called Mythos. Baseten is reportedly raising $1.5B at a $13B valuation. Datasette Apps launched for hosting custom HTML inside Datasette. Meredith Whittaker of Signal says AI chatbots are not your friends. In the Weights debuts as a vanity search engine for AI. Show notes: https://tobyonfitnesstech.com/podcasts/episode-73/

This episode covers OpenClaw v2026.6.8, OpenAI Codex rust-v0.141.0, and Claude Code CLI 2.1.170. Google's Nano Banana 2 and Nano Banana Pro image models are listed on OpenRouter. OpenAI rolls out LifeSciBench and a pre-release deployment simulation, and teams with Molecule.one on a GPT-5.4 medicinal chemistry reaction. Z.ai releases GLM-5.2 open weights under MIT, claiming a top open-weights slot via IndexShare speculative decoding. Radical AI argues the lab is the moat, and NEA's Tiffany Luck weighs in on enterprise AI ROI. Show notes: https://tobyonfitnesstech.com/podcasts/episode-72/

Today's episode covers the latest developments in AI, including the stable release of OpenAI's Codex rust-v0.140.0, new foundation models from Apple, and significant acquisitions, such as SpaceX's $60B purchase of Cursor. Tune in for analysis on the rapidly shifting AI industry, including major investments, layoffs, and the launch of new AI agent identities by NewCore. Show notes: https://tobyonfitnesstech.com/podcasts/episode-71/

A heterogeneous dual-GPU setup running an RTX 5080 paired with an RTX 3090 sustained 80 tokens per second on Qwen 3.6 27B at Q8 quantization with a clean tensor-parallel layer split. OpenAI opened Codex to open source maintainers and added three workflow courses in OpenAI Academy. Anthropic suspended Fable 5 and Mythos 5 from US government access following a federal directive tied to the Amazon CEO's meetings with Trump administration officials. Endor Labs placed Fable 5's coding results mid-tier, and developers debated Claude Fable's proactive agent behavior. Show notes: https://tobyonfitnesstech.com/podcasts/episode-70/

OpenClaw v2026.6.6 ships, Anthropic responds to a US directive suspending Fable 5 and Mythos 5 access, and an AI agent bankrupts its operator during a DN42 network scan. Other items: a coding agent damages Fedora and Linux systems, a macOS local-agent walkthrough trends on Hacker News, Claude Desktop spawns a 1.8 GB Hyper-V VM on every launch, Anthropic model naming strings get decoded, Apache Burr debuts as a reliability-first agent framework, Hugging Face's open-r1 reproduces DeepSeek-R1, and DeepSeek's own notes pull in 205 points of Hacker News discussion. Show notes: https://tobyonfitnesstech.com/podcasts/episode-69/

OpenClaw v2026.6.5 and OpenAI Codex rust-v0.139.0 ship this week, paired with Anthropic's Claude Fable 5 and Mythos 5 announcement and system card. We unpack the silent application risks in Claude Fable, Apple's Gemini-based AI architecture reveal, and DeepSeek V4 Pro's claimed precision edge over GPT-5.5 Pro. Plus: OpenAI's confidential draft S-1, GPT-2's 2019 staged release as a lens on today's debates, AWS Bedrock's data sharing requirement for Mythos, and a paper asking if grep alone is enough for agentic search. Show notes: https://tobyonfitnesstech.com/podcasts/episode-68/

OpenAI Codex ships rust-v0.138.0 with CLI-to-desktop handoff and local image path exposure; Claude Code CLI hits 2.1.169; the MCP July 2026 Release Candidate goes stateless with an extensions framework; Apple WWDC delivers a real Siri AI overhaul with Gemini and natural-language Shortcuts; and Alibaba's Qwen3.7-Max and Qwen3.7-Plus enter the agent model race with 1M-token contexts and multimodal support. Show notes: https://tobyonfitnesstech.com/podcasts/episode-67/

OpenClaw v2026.6.5-beta.2 and Claude Code 2.1.168 lead the agent-harness cycle, and the cycle opens with a Friday June 5 outage that hit Claude API, Claude Code, claude.ai, and Claude Cowork for roughly two hours — primarily Opus 4.7 and 4.8 — peaking near a thousand Downdetector reports. OpenClaw switched release trains to a monthly patch cadence with the June 2026 floor at 5.28. Claude Code shipped a focused day-late bug-fix release on the .167 baseline, closing session attachment, stream-json event ordering, and interrupt handling regressions that some users reported during the outage window. OpenAI is reportedly planning its biggest ChatGPT overhaul yet — a unified superapp that folds in Codex, agents, and third-party services ahead of a fall IPO. Apple WWDC 2026 opens June 8 with a Gemini-powered Siri as the headline. Anthropic expands Project Glasswing to 150+ organizations and signals Mythos-class capabilities are coming in weeks. Microsoft launches MAI-Thinking-1 and MAI-Code-1-Flash into GitHub Copilot. Gemma 4 12B ships an encoder-free multimodal design for 16GB local Macs. The MCP lane is brief this week — a one-paragraph blip, not a deep-dive. Project radar covers A2A v1.0 and the CheetahClaws Python harness. Show notes: https://tobyonfitnesstech.com/podcasts/episode-66/

Hermes Agent v0.16.0 — "The Surface Release" — ships a real native desktop app with OAuth remote connect, drag-and-drop file input, and a browser-based admin panel. Codex 0.137 adds multi-agent v2 runtime choice persistence and parallel web search. Claude Code 2.1.166/2.1.167 introduces fallback model chains and glob tool-name deny rules. Gemma 4 12B is Google's latest open-weight 12B model that runs locally on a laptop with 16GB VRAM. The project radar covers the A2A protocol hitting v1.0, Kimi Code CLI as a TypeScript-native terminal coding agent, and the awesome-ai-agents-2026 curated resource list. Show notes: https://tobyonfitnesstech.com/podcasts/episode-65/

Claude Code 2.1.165 is the latest npm `latest` as of June 5, following 2.1.163 and 2.1.164 — all bug-fix and reliability releases that clean up background sessions, plugin hooks, skill syntax, and Windows path handling. Microsoft dropped a seven-model MAI family at Build 2026 on June 2, with MAI-Code-1-Flash as the headline: a 5B-parameter coding model trained on GitHub Copilot production harnesses, scoring 51% on SWE-Bench Pro and 60% leaner on tokens than comparable models. The episode also covers the GitHub Project Radar around agent memory, code graphs, and MCP tooling that serve the local coding-agent stack. Show notes: https://tobyonfitnesstech.com/podcasts/episode-64/