Episodes 690
Avg. Duration 16m
Activity Highly Active
Since Aug 2025
Latest Episode Jun 2026

Outreach Signals

Open to Sponsors

Publishing Details

Schedule
Hourly
Consistency
53%
Hosting
podcast.do-not-panic.com

Contact & Outreach

About This Podcast

AI-generated podcast where hosts Hal Turing and Dr. Ada Shannon discuss the latest research papers and reports in machine learning, AI systems, and optimization. Featuring honest critical analysis, proper citations, and nerdy humor.

Podcasting 2.0 Features

transcript

Explore Statistics

Recent Episodes

Relational Graph Transformer for Multi-Table Learning

Jun 12, 2026 Transcript

This episode explores the Relational Graph Transformer paper and asks whether transformer-based models can outperform standard graph neural networks for prediction tasks over real multi-table…

Lattice: Fixed-Slot Compression for Transformer Memory

Jun 12, 2026 Transcript

This episode explores Lattice, a 2025 paper from Google Research and Google DeepMind that asks whether a Transformer’s growing key-value cache can be compressed into a fixed set of memory slots…

Atlas: Test-Time Memory for Long Contexts

Jun 11, 2026 Transcript

This episode explores Atlas, a 2025 paper on test-time memorization that asks whether a model with fixed recurrent memory can learn to update that memory during inference and rival Transformers on…

Robots Need More Than VLAs and World Models

Jun 11, 2026 Transcript

This episode explores the position paper Robots Need More Than VLAs & World Models and its claim that the main bottleneck in robotics may be grounding: turning raw physical behavior into…

KumoRFM for In-Context Relational Learning

Jun 11, 2026 Transcript

This episode explores KumoRFM, a 2025 proposal for a foundation model that can perform in-context learning directly on relational databases, aiming to handle tasks like churn prediction, fraud…

Learning at Test Time with Expressive RNN States

Jun 11, 2026 Transcript

This episode explores the paper Learning to (Learn at Test Time): RNNs with Expressive Hidden States and its attempt to give recurrent models transformer-like long-context behavior without the…

Do Transformers Need Three Projections?

Jun 11, 2026 Transcript

This episode explores whether transformers really need separate query, key, and value projections, treating the problem as weight tying inside attention rather than as a brand-new model design. It…

Unified Neural Scaling Laws Across Regimes

Jun 10, 2026 Transcript

This episode explores Unified Neural Scaling Laws, a framework for predicting model performance when parameter count, data volume, training steps, inference compute, and training-recipe choices all…

RAGEN-2: Reasoning Collapse in Agentic RL

Jun 10, 2026 Transcript

This episode explores the RAGEN-2 paper’s claim that agentic reinforcement learning can produce reasoning traces that look active and diverse while losing real dependence on the input. It explains…

End-to-End Context Compression at Scale

Jun 10, 2026 Transcript

This episode explores End-to-End Context Compression at Scale, a paper on whether learned context compression can beat the cost of long-context inference in quality, time to first token, and peak…

Unembedding Matrices as Feature Lenses for Embeddings

Jun 10, 2026 Transcript

This episode explores why decoder-style language models can generate fluent text yet still underperform dedicated embedding models when asked for zero-shot sentence vectors, despite embeddings being…

Predictive Query Language for Relational Databases

Jun 10, 2026 Transcript

This episode explores Predictive Query Language (PQL), a SQL-shaped domain-specific language for defining supervised learning tasks directly over relational databases by specifying the prediction…

KumoRFM-2 for Relational Learning at Scale

Jun 10, 2026 Transcript

This episode explores KumoRFM-2, a relational foundation model designed to learn directly from connected database tables instead of flattening customers, orders, products, and tickets into a single…

Latent Reasoning with Normalizing Flows

Jun 10, 2026 Transcript

This episode explores Latent Reasoning with Normalizing Flows, a paper that asks whether a standard left-to-right transformer can do its intermediate reasoning in continuous latent states instead of…

EMO: Emergent Modularity in Sparse Language Models

Jun 08, 2026 Transcript

This episode explores EMO, a Mixture-of-Experts language model that tries to turn token-level sparsity into real modularity by letting coherent expert groups emerge from document structure during…

Mooncake for KV Cache-Centric LLM Serving

Jun 05, 2026 Transcript

This episode explores Mooncake, a production LLM serving architecture that treats KV cache reuse and movement as the central challenge in long-context chat, not just raw GPU compute. It explains why…

When AI Builds Itself and Recursive Self-Improvement

Jun 05, 2026 Transcript

This episode explores Marina Favaro et al.’s 2026 paper on whether AI is beginning to accelerate frontier AI development enough to hint at recursive self-improvement, laying out the ladder from…

Technical AGI Safety and Security Framework

Jun 05, 2026 Transcript

This episode explores DeepMind’s paper on technical AGI safety and security, focusing on how labs might prevent severe, humanity-scale harm before highly capable systems are deployed. It breaks down…

Harvest: Borrowing Peer GPU Memory for LLMs

Jun 05, 2026 Transcript

This episode explores Harvest, a system for LLM inference that uses idle HBM on neighboring NVLink-connected GPUs as a temporary cache when a serving GPU runs out of local memory. It explains why LLM…

VeriCache: Lossless LLM Inference from Lossy KV Caches

Jun 05, 2026 Transcript

This episode explores VeriCache, a systems paper that asks whether a large language model can draft tokens using a compressed, lossy KV cache and then verify them against the full cache to recover…

Frequently Asked Questions

How many episodes does AI Post Transformers have?

AI Post Transformers has published 690 episodes since August 2025, covering topics in Technology.

Is AI Post Transformers still active?

AI Post Transformers is currently highly active with new episodes hourly. Average episode length is 16m.

How do I contact AI Post Transformers for sponsorship or guest appearances?

Sign up on Grep.FM to access contact details for AI Post Transformers, including email and social media links.

Similar Podcasts