
Hosted by Kris Moore · EN

Kimchi Stew, Part Three closes the trilogy by going one financial-engineering layer deeper than Parts One and Two walked. Parts One and Two named individual financial perversions across the AI capex cycle — the Anthropic accounting smell on the run-rate figures, the back-fill question on the Anthropic-to-xAI compute commit, the IPO concentration as financing signal, the substrate-tiering monetizing the Opus 4.8 regression, the vendor-deflection counter to the procurement-model-is-broken framing. Part Three threads all of those into a single mechanism — the stake-and-multiple loop, where hyperscaler equity stakes in AI labs are marked to fair value through profit and loss each quarter, the paper appreciation gets price-to-earnings multiple expansion as if it were operating earnings, and the entire financial-engineering layer becomes a constituency for the AI capex flywheel continuing regardless of operational reality. Three external voices anchor the editorial work: Upton Sinclair (1934) on why the participants in the loop cannot understand the mechanism; John Kenneth Galbraith (1955) on the bezzle dynamics of booms and crashes; Mike Green of Simplify Asset Management on the contemporary circular-financing structure being identical to dot-com vendor financing. The trilogy close extends the canonical "Do Your Own Work" doctrine from the model layer down to the financial-engineering layer.

Part Two of the Kimchi Stew cycle. Part One covered the capability tilt and its counterweights. Part Two covers the structural layer underneath. Four themes. The substrate mechanism — multi-token prediction crossing from research to production, the Trainium-versus-NVIDIA substrate split, non-determinism amplified through agentic loops, and the unified theory of the Opus 4.8 regression. The six-point-three percent reality — twelve gigawatts operational against one hundred ninety gigawatts announced, the structural bottlenecks behind every story walked in Part One, the IPO concentration as financing signal, the per-token transition mathematically forced by constrained supply. The vendor-deflection counter to Sam Altman — the procurement model works fine when the inputs are honest; vendor narrative-shaping plus weak transparency produced the variance the diligence would have predicted. And the Connective Tissue layer where adoption actually breaks. Then the whistle past. Then the kimchi-stew close — the hard data will come out, and your own arena is where you find out which dish you are actually eating.

Kimchi stew. The same pot of food. Delicious to many, rotten cabbage to others. Same data, two reads, different palates. That is AI in 2026, and the editorial framing across this two-part cycle. Part One covers the capability tilt and its counterweights — three leaderboards changed hands on May 28, and there are at least three different reads of that headline depending on where you are sitting. The Anthropic financial counter-weights (run-rate climbing forty-seven percent in seven weeks against vendor-reported single-source numbers, with named IPO advisors). The Anthropic-to-xAI compute deal accounting smell (a billion dollars a month to a closed-model competitor). The substrate-as-hidden-variable mechanism explaining the Opus 4.8 production-quality regression. The substrate-tiering brutal implication — Fable 5 and Mythos 5 served on the premium substrate while Opus 4.8 on Max gets demoted to spot capacity, monetizing the regression. The host's own Max-cancellation as practitioner attestation. Then the four-layer stack — Model, Harness, Control Plane, Connective Tissue — walked as editorial discipline. Then the harness-to-control-plane crossing with Dynamic Workflows shipping in the SDK, Microsoft platform-neutralizing, and the per-token-pricing transition becoming inevitable. Part Two carries the structural reality underneath.

Four theses are fighting for supremacy in artificial intelligence right now — scaling still works, the paradigm is peaking, the buildout is on track, the public is coming around. Each has primary-source support; none can be honestly crowned or dismissed. This episode holds them at educated-observer altitude across capacity, forecasting, the Anthropic-OpenAI shift, Musk as compute landlord, the three-horse race, post-transformer and quantum threads, the public-perception trend, and the agent-versus-chatbot return-on-investment split.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).

Three procurement-relevant events landed inside eight days. The Wall Street Journal scoops the OpenAI revenue and weekly-active-user miss with same-day market reaction across Oracle, AMD, Broadcom, NVIDIA, and SoftBank. The UK AI Safety Institute publishes the first independent third-party measurement that puts a generally-available model — GPT-5.5 — in the same cyber-capabilities band as Claude Mythos Preview, with overlapping confidence intervals. And the harness layer underneath both — five Claude Code releases in five days, a new persistent-goal primitive in Codex, and the open-research harness class crossing into real procurement viability for the first time. The W18 frame: revenue stress at the top of the proprietary stack, harness consolidation while open-research catches up, and a third-party evaluation that challenges the access-control argument the leading lab has been using to justify its restricted release tier. Plus a one-act handoff on the Musk-Altman trial (full treatment on The Guardrail this week), the AI Feature Tracker, and five Monday-morning principles for CTOs.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).

Four major labs moved list prices up in April. Two open-weight shops moved prices down in the same eight days. Capability commoditized at the top of the leaderboard while unit economics diverged in three directions underneath. DeepSeek V4 shipped as the first serious frontier-class open-weight model trained without CUDA as a required dependency. SpaceX and Cursor announced a compute partnership with a 60-billion-dollar acquisition option attached. This episode walks where margin is actually being defended (subscription and scope, not per-token rate), why monthly release cadence is now possible (sparse-RL consensus across four independent research groups), and what the week changes for Monday-morning procurement — plus the debut of the AI Feature Tracker recurring segment.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).

Three arcs: (1) Opus 4.7 + the nerfing narrative + Mythos/Glasswing consortium capability decoupling — led by the AMD Senior Director telemetry case (GitHub #42796, 6,852 sessions, Pearson 0.971 correlation to redaction rollout, 125x cost spike); (2) Antigravity vs Codex 2026 vs Cursor vs Windsurf — marketshare/mindshare divergence (Cursor $2B ARR) and fit-for-task patterns; (3) Models past code — FrontierScience Olympiad 77% vs Research 25% gap, benchmark saturation, custom silicon inflection (Maia 200, Trainium 3, TSMC 3nm bottleneck). Thesis: "Trust Crisis" = capability-vs-served-behavior decoupling. Cross-show pair with Guardrail Ep 8.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).

A wide-aperture survey of the most concentrated AI news cycle of Q1 2026. In fourteen days: Meta launched Muse Spark under Alexandr Wang and walked away from the open-weight default that defined Llama. Zhipu shipped GLM-5.1, a frontier-class open-weight coding model trained end-to-end on Huawei Ascend silicon with zero NVIDIA in the stack. Anthropic unveiled Claude Mythos Preview via Project Glasswing — seeded to eleven named enterprise defensive partners (AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, the Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks) and explicitly declined GA release over cybersecurity dual-use risk. NVIDIA put Vera Rubin into production at 2,300 watts per GPU with mandatory liquid cooling. OpenAI killed Sora because the unit economics didn't work and redirected the compute to Codex and enterprise agents. Google went GA with Ironwood, the seventh-generation TPU. MemPalace v3.0 hit 21,700 GitHub stars in four days claiming the top of the LongMemEval benchmark (amid significant community skepticism about the benchmark methodology and one of the two named creators' actual technical involvement). Kimi K2.5 cut its input price again.This episode walks the field lab by lab and chip by chip — US frontier, Chinese open-weights wave, silicon, coding agents, the memory layer — and closes with what got heavier and what got lighter for a CTO making vendor decisions right now. Three Forward Look predictions are logged for accountability.Honest about which claims are vendor self-reports and which are independently verified. Two single-source claims (GLM-5.1 SWE-Bench Pro 58.4, MemPalace LongMemEval 96.6%) are flagged in-episode as pending independent reproduction.Runtime: 58 minutes. Coverage window: 2026-03-26 to 2026-04-08.---AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).edited to fix TTS defect.

KV cache compression on your own hardware: what works, what doesn't, and when to care.Google's TurboQuant paper compresses KV cache to 3 bits per coordinate — 6x memory reduction, 8x faster inference, zero accuracy loss, no retraining required. This deep-dive walks through what it actually is, the three-layer compression stack, real benchmark results on a consumer RTX 4090, community implementations available today, the Hugging Face ecosystem integration, and a CTO decision framework for when this matters to your org. Companion to the LinkedIn article of the same name.25 sources cited. Full source list in show notes.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).

Mythos changes the threat model, three agent runtimes compete, OpenAI kills Sora, and four compliance deadlines land in four months.A leaked frontier model codenamed Mythos revealed AI-driven cyberattack capabilities that compress vulnerability exploitation from days to hours. Three agent runtimes are now competing for the enterprise stack. OpenAI shut down Sora. Private credit markets are reshaping AI infrastructure financing. And four compliance deadlines — Colorado AI Act, EU AI Act transparency, NIST agent standards, and the Pentagon's 30-day deployment directive — all land within four months.123 sources cited. Full source list in show notes.AI Disclosure: This episode was produced with AI assistance. Research synthesis and script writing used Claude (Anthropic) under human editorial direction. Audio narration by Microsoft Edge TTS (en-US-AndrewNeural voice).