Pop Goes the Stack | All Episodes

Local-first AI: Keep context out of the cloud

“Just throw it in the cloud” gets complicated when the data is your meetings, your IP, and your operating context. In this episode of Pop Goes the Stack, Lori MacVitti...

May 26, 2026 / 21:57/E42

DevOps meets AI agents: Risk, audit, and the Deming playbook

AI is no longer a lab tool; it’s showing up in pipelines, production systems, and the places where “seemed like a good idea” becomes a 2 a.m. incident. In this episode...

May 19, 2026 / 23:29/E41

Model routing isn’t load balancing (And that’s why you’re not ready)

Multi-model AI isn’t a buzzword anymore, it’s how organizations are actually operating. In this episode of Pop Goes the Stack, Lori MacVittie and Joel Moses dig into f...

May 12, 2026 / 20:06/E40

KV cache is the real inference bottleneck (Not GPUs)

GPUs get all the attention, but in inference, the real bottleneck is often memory, specifically the KV cache. In this episode of Pop Goes the Stack, Lori MacVittie sit...

May 5, 2026 / 21:09/E39

Measuring what matters: Observability for agents

Agents break the old rules of observability. Latency, throughput, and error rates still matter, but once software starts making decisions and taking actions on someone...

April 28, 2026 / 20:24/E38

Alien autopsy of LLMs: Constitutions, deception, guardrails

Why do researchers keep describing large language models like aliens? Because in enterprise environments, they often behave like something we didn’t build and can’t fu...

April 21, 2026 / 20:54/E37

Why Prompt Filters Fail Against LLM Attacks

Prompt injection has been the headline security problem for the last year, but have we been guarding the wrong layer? Lori MacVittie is joined by cohost Joel Moses and...

April 14, 2026 / 22:05/E36

OpenClaw: Multi-agent autonomy, secrets, and blast radius

OpenClaw is what happens when the industry looks at autonomous agents and decides they should have more autonomy, more persistence, and more chances to surprise you. I...

April 7, 2026 / 26:34/E35

CISO Hot Takes on MCP, PQC, and Data Center Attacks

Recorded live at F5 AppWorld 2026 in Las Vegas, this episode of Pop Goes the Stack puts Field CISO Chuck Herrin in the hot seat for a fast-moving conversation on what ...

March 31, 2026 / 17:00/E34

AI Red Teaming in Practice: Scores, guardrails, auto-remediation

AI in production isn’t just another feature to ship. It’s a non-deterministic system that can be socially engineered, fuzzed, and pushed into failure states you won’t ...

March 24, 2026 / 26:48/E33

Agent Identity Crisis: Access, audit, and “soul.md”

Coming to you from the AppWorld show floor, Joel Moses and guest co-pilot Oscar Spencer cut through the conference polish to tackle a problem that’s quickly becoming u...

March 17, 2026 / 20:33/E32

VibeOps: Guardrailed agents for deterministic production

Ops used to be a world of YAML, caffeine, and careful deploy rituals. Now it’s probabilistic models, token-based cost surprises, and reliability questions that sound m...

March 10, 2026 / 25:16/E31

WebAssembly: A programmability paradigm shift

Programmability is experiencing a paradigm shift, and this episode explains why WebAssembly is at the center of it. F5's Lori MacVittie and Joel Moses are joined by We...

March 3, 2026 / 21:36/E30

Unstructured Integration: The hidden surface area putting AI privacy & compliance at risk

"It's just a chat" is the most dangerous sentence in AI. In this episode of Pop Goes the Stack, F5's Lori MacVittie and Joel Moses are joined by data science expert Sc...

February 24, 2026 / 24:17/E29

Logging for Giants: High-Speed Telemetry in an AI World

When OpenAI discovered they could reclaim 30,000 CPU cores simply by tuning the log-forwarding agent Fluent Bit—disabling a single function that ate ~35 % of one serve...

February 17, 2026 / 21:57/E28

Low-Code Automation Tools with Teeth: FlowFuse & N8N

Low-code automation has grown up, and the competition is getting spicy. In this episode of Pop Goes the Stack, F5's Lori MacVittie and Joel Moses are joined by Aubrey ...

February 10, 2026 / 21:35/E27

The New New User Interface: AI in your brain

The capability to map brain activity to language isn’t just another UI shift—it’s a paradigm shift in how humans and machines might communicate. If you’re building sys...

February 3, 2026 / 18:08/E26

The Impact of Inference: Reliability

Traditional reliability meant consistency. Given identical inputs, systems produced identical outputs. Costs were stable and behavior predictable. Inference reliabilit...

January 27, 2026 / 22:49/E25

The Impact of Inference: Performance

Traditional performance meant deterministic response times. Identical inputs produced near-identical execution times. Optimizations reduced latency, but variance was m...

January 20, 2026 / 20:34/E24

The Impact of Inference: Availability

What does "availability" mean in a world of AI inferencing and ever-shifting workloads? It’s no longer just about servers responding or apps being online—availability ...

January 13, 2026 / 22:13/E23