
Hosted by Nate Archer · EN

Issue #19 of The Agentic Engineer Podcast. OpenAI gates GPT-5.6 Sol behind government pre-approval — the first government-gated model release. GLM 5.2 beats Claude Opus 4.8 on Semgrep's IDOR benchmark at 1/6 the cost. Google ships DESIGN.md for coding agent visual identity. OpenAI Daybreak patches vulnerabilities at 30M commit scale. Herdr brings tmux to the multi-agent era. Agent-Native crystallizes the define-once-use-everywhere pattern. A paper proves repo-level governance matters more than individual agent quality. AWS Lambda MicroVMs launch as GA serverless sandboxes for AI agents. And Meta's tokenmaxxing mandates prove adoption metrics are fake until they're organic.

Anthropic now requires government ID for Claude access. 754 HN points of fury. GLM-5.2 drops under MIT the same week. Plus AWS Continuum autonomous security agents, codebase-memory-mcp, and your coding agent might be killing your SSD. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai

Issue #17 of The Agentic Engineer Podcast. The US government pulls Fable 5 and Mythos 5 from production — first frontier model recall by government order. AWS ships three agentic services in one week: DevOps Agent with custom SRE agents and MCP/A2A headless access, FinOps Agent for autonomous cost management, OpenSearch MCP Apps for agentic observability. NVIDIA SkillSpector scans agent skills for 64 vulnerability patterns. Parallel-Synthesis paper shares KV caches directly between agent branches for 2.5-11x speedup. AWS Blocks launches Infrastructure from Code with AI steering files baked into npm packages. And the hot take: AI hasn't replaced engineers, but fewer seniors can handle more work now.

US government pulls Fable 5 and Mythos 5 from production. AWS ships three agentic services. AWS Blocks hits public preview with Infrastructure from Code.

Issue #15 of The Agentic Engineer Podcast. Anthropic open-sources 11 knowledge-work plugins for Claude Cowork in the simplest format possible: markdown and JSON. OpenSearch Serverless Next-Gen kills the $300/mo minimum with true scale-to-zero vector search. Self-improving agents go from 25% to 86% accuracy in production. And the hot take: human-in-the-loop is a liability when users approve 93% of prompts without reading them.

Issue #13 of The Agentic Engineer Podcast. TanStack npm supply chain attack compromises OpenAI code-signing certs, AWS CLI Agent Orchestrator runs multiple coding agents in parallel via MCP, and FORGE evolves agent memory through population broadcast.

Issue #10: GPT-5.5 reclaims the agentic crown with 82.7% on Terminal-Bench 2.0 and fewer tokens per task. Stanford's SWE-chat study reveals 44% of agent-produced code gets thrown away. ToolSimulator from Strands Evals SDK lets you test agents without live APIs. NVIDIA exposes AGENTS.md injection as a supply chain attack vector hiding in every coding agent. Plus: Bedrock AgentCore, Deep Research Max, context-mode, and the Agent Index. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai

Issue #9: Claude Opus 4.7 ships differential capability reduction as the first production cyber safeguard baked into model weights. Vercel breached through an AI tool's OAuth scope. Spring AI SDK for Bedrock AgentCore goes GA for Java. GTA-2 paper proves your agent harness matters more than your model. And CMU documents 6 million fake GitHub stars across the AI ecosystem. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai

Issue #8: Anthropic ships Managed Agents, UC Berkeley breaks every major AI benchmark, AWS Agent Registry launches in preview. Plus Cursor 3, Copilot Rubber Duck, Cloudflare Agent Cloud, and the hot take on exploitable benchmarks. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai

Anthropic published the blueprint for multi-hour coding agents. GitHub shipped /fleet for parallel multi-agent coding. Amazon Nova Act MCP gives your agent a browser with one install. Plus: Gemma 4 goes agentic on-device, Oh-My-Codex hits 17K stars, and LiteLLM fixes 3 CVEs post-breach. Subscribe to the newsletter: https://theagenticengineer.waltsoft.net YouTube: https://www.youtube.com/@theagenticengineerpod Twitter: https://x.com/natearcher_ai