Publishing Details
Contact & Outreach
About This Podcast
Explore Statistics
Recent Episodes
S1E43 Ep 43: DeepSeek V4's full paper reveals FP4 quantization-aware training running directly in late-stage MoE optimization with minimal quality loss.
Models & Agents DeepSeek V4's full paper reveals FP4 quantization-aware training running directly in late-stage MoE optimization with minimal quality loss. What You Need to Know: DeepSeek…
S1E42 Ep 42: OpenAI ships three specialized realtime audio models for voice agents, translation, and transcription.
Models & Agents OpenAI ships three specialized realtime audio models for voice agents, translation, and transcription. What You Need to Know: OpenAI released GPT-Realtime-2,…
S1E41 Ep 41: OpenAI rolls out GPT-5.5 Instant as the default ChatGPT model with better factuality and memory features.
# Models & Agents > **OpenAI rolls out GPT-5.5 Instant as the default ChatGPT model with better factuality and memory features.** **What You Need to Know:** OpenAI is pushing GPT-5.5 Instant…
S1E40 Ep 40: OpenAI gives 8,000 developers a month of 10x Codex rate limits after the GPT-5.5 party sold out.
# Models & Agents > **OpenAI gives 8,000 developers a month of 10x Codex rate limits after the GPT-5.5 party sold out.** **What You Need to Know:** OpenAI turned its oversubscribed GPT-5.5…
S1E39 Ep 39: Mistral AI launches a 128B model with remote agents and strong coding performance.
# Models & Agents > **Mistral AI launches a 128B model with remote agents and strong coding performance.** **What You Need to Know:** Mistral AI released Mistral Medium 3.5 alongside remote…
S1E38 Ep 38: Anthropic gives defenders early access to Mythos Preview to patch AI cyber vulnerabilities before wider release.
# Models & Agents **Date:** May 01, 2026 **HOOK:** Anthropic gives defenders early access to Mythos Preview to patch AI cyber vulnerabilities before wider release. **What You Need to Know:**…
S1E37 Ep 37: DeepSeek's first native multimodal model drops in the LocalLLaMA community, finally giving the open-source whale vision capabilities.
**HOOK:** DeepSeek's first native multimodal model drops in the LocalLLaMA community, finally giving the open-source whale vision capabilities. **What You Need to Know:** Today brings the…
S1E36 Ep 36: Anthropic’s Claude Opus 4.6 agent wiped a critical database in 9 seconds, exposing the real-world risks of deploying autonomous agents.
**HOOK:** Anthropic’s Claude Opus 4.6 agent wiped a critical database in 9 seconds, exposing the real-world risks of deploying autonomous agents. **What You Need to Know:** A seemingly routine test…
S1E35 Ep 35: Google DeepMind's Vision Banana shows image generation pretraining may be the true foundation model path for computer vision, beating SAM 3 on segmentation and Depth Anything V3 on metric depth.
**HOOK:** Google DeepMind's Vision Banana shows image generation pretraining may be the true foundation model path for computer vision, beating SAM 3 on segmentation and Depth Anything V3 on metric…
S1E34 Ep 34: Qwen3.6-27B paired with llama.cpp speculative decoding delivers 10x token speedups in real coding sessions, hitting 136 t/s on consumer hardware.
**HOOK:** Qwen3.6-27B paired with llama.cpp speculative decoding delivers 10x token speedups in real coding sessions, hitting 136 t/s on consumer hardware. **What You Need to Know:** The standout…
S1E33 Ep 33: MetaComp just released the world's first dedicated AI agent governance framework built specifically for regulated financial services.
**HOOK:** MetaComp just released the world's first dedicated AI agent governance framework built specifically for regulated financial services. **What You Need to Know:** Today’s biggest practical…
S1E32 Ep 32: Qwen3.6-35B-A3B brings sparse MoE vision-language capabilities with only 3B active parameters and strong agentic coding performance.
**HOOK:** Qwen3.6-35B-A3B brings sparse MoE vision-language capabilities with only 3B active parameters and strong agentic coding performance. **What You Need to Know:** The Qwen team open-sourced a…
S1E31 Ep 31: Google DeepMind's Gemini Robotics-ER 1.6 upgrade delivers enhanced embodied reasoning and instrument reading for real-world robot control.
**HOOK:** Google DeepMind's Gemini Robotics-ER 1.6 upgrade delivers enhanced embodied reasoning and instrument reading for real-world robot control. **What You Need to Know:** DeepMind released…
S1E30 Ep 30: Aaron Levie declares the enterprise AI shift from chatbots to agents is now underway, moving beyond the "Chat Era."
**HOOK:** Aaron Levie declares the enterprise AI shift from chatbots to agents is now underway, moving beyond the "Chat Era." **What You Need to Know:** Box CEO Aaron Levie says organizations are…
S1E29 Ep 29: Knowledge distillation now compresses full ensembles into single deployable models while preserving their collective intelligence.
**HOOK:** Knowledge distillation now compresses full ensembles into single deployable models while preserving their collective intelligence. **What You Need to Know:** The biggest practical advance…
S1E28 Ep 28: Meta’s Muse Spark and a production-grade compiler-as-a-service approach for agents headline a day heavy on practical agent infrastructure.
**Models & Agents** **Date:** April 09, 2026 **HOOK:** Meta’s Muse Spark and a production-grade compiler-as-a-service approach for agents headline a day heavy on practical agent…
S1E27 Ep 27: Gemma 4 delivers massive gains across European languages while a 25.6M Rust model achieves 50× faster inference via hybrid attention.
**HOOK:** Gemma 4 delivers massive gains across European languages while a 25.6M Rust model achieves 50× faster inference via hybrid attention. **What You Need to Know:** Google’s Gemma 4…
S1E26 Ep 26: AutoAgent autonomously optimizes its own harness using the same model to reach #1 on Terminal-Bench and financial modeling in under 24 hours.
**HOOK:** AutoAgent autonomously optimizes its own harness using the same model to reach #1 on Terminal-Bench and financial modeling in under 24 hours. **What You Need to Know:** The open-source…
S1E25 Ep 25: Google drops Gemma 4, claiming the strongest small multimodal open model yet with dramatic gains across every benchmark compared to Gemma 3.
**# Models & Agents** **Date:** April 03, 2026 **HOOK:** Google drops Gemma 4, claiming the strongest small multimodal open model yet with dramatic gains across every benchmark compared to…
S1E24 Models & Agents - Episode 24 - April 01, 2026
**What You Need to Know:** Hugging Face just shipped TRL v1.0, turning the messy post-training pipeline (SFT → Reward Modeling → DPO/GRPO) into a stable, production-ready unified API. Liquid AI…
Frequently Asked Questions
Models & Agents has published 43 episodes since February 2026, covering topics in Technology.
Models & Agents is currently highly active with new episodes every few days. Average episode length is 12m.
Sign up on Grep.FM to access contact details for Models & Agents, including email and social media links.