Large Language Model (LLM) Talk

AI-Talk

English Technology

Apple Podcasts Website RSS

Episodes 68

Avg. Duration 15m

Activity Dormant

Since Jan 2025

Latest Episode Jan 2026

Publishing Details

Schedule

Daily

Format

Episodic

Hosting

anchor.fm

About This Podcast

AI Explained breaks down the world of AI in just 10 minutes. Get quick, clear insights into AI concepts and innovations, without any complicated math or jargon. Perfect for your commute or spare time, this podcast makes understanding AI easy, engaging, and fun—whether you're a beginner or tech enthusiast.

Explore Statistics

English Podcasts Report Technology Report English Technology Report

Recent Episodes

Context Engineering

Jan 21, 2026 13m

Context engineering is the system-level discipline of architecting the dynamic information environment for AI models. Unlike prompt engineering, which focuses on phrasing specific instructions,…

Manus AI

Jan 19, 2026 17m

Manus AI is a general-purpose autonomous agent designed to function as a digital worker rather than a passive chatbot. Developed by Monica and acquired by Meta, it utilizes a Planner-Executor…

Kimi K2

Jul 22, 2025 15m

Kimi K2, developed by Moonshot AI, is an open agentic intelligence model built on a Mixture-of-Experts (MoE) architecture. It features 1 trillion total parameters, with 32 billion active during…

Mixture-of-Recursions (MoR)

Jul 18, 2025 16m

Mixture-of-Recursions (MoR) is a unified framework built on a Recursive Transformer architecture, designed to enhance the efficiency of large language models. It achieves this by combining three core…

MeanFlow

Jul 10, 2025 6m

MeanFlow models introduce the concept of average velocity to fundamentally reformulate one-step generative modeling. Unlike Flow Matching, which focuses on instantaneous velocity, MeanFlow directly…

Mamba

Jul 10, 2025 8m

Mamba is a novel deep learning architecture that achieves linear scaling in computation and memory with sequence length, addressing Transformers' quadratic limitations. Its selective State Space…

LLM Alignment

Jun 14, 2025 20m

LLM alignment is the process of steering Large Language Models to operate in a manner consistent with intended human goals, preferences, and ethical principles. Its primary objective is to make LLMs…

Why We Think

May 20, 2025 14m

The "Why We Think" from Lilian Weng, examines improving language models by allocating more computation at test time, drawing an analogy to human "slow thinking" or System 2. By treating computation…

Deep Research

May 12, 2025 11m

Deep Research is an autonomous research agent built into ChatGPT. It performs multi-step online research over several minutes, behaving like a human researcher by searching, reading, analyzing, and…

vLLM

May 04, 2025 13m

vLLM is a high-throughput serving system for large language models. It addresses inefficient KV cache memory management in existing systems caused by fragmentation and lack of sharing, which limits…

Qwen3: Thinking Deeper, Acting Faster

May 04, 2025 13m

Qwen3 models introduce both Mixture-of-Experts (MoE) and dense architectures. They utilize hybrid thinking modes, allowing users to balance response speed and reasoning depth for tasks, controllable…

RAGEN: train and evaluate LLM agents using multi-turn RL

May 03, 2025 11m

RAGEN is a modular system for training and evaluating LLM agents using multi-turn reinforcement learning. Built on the StarPO framework, it implements the full training loop including rollout…

DeepSeek-Prover-V2

May 01, 2025 11m

DeepSeek-Prover-V2 is an open-source large language model designed for formal theorem proving in Lean 4. Its training relies heavily on synthetic data, generated by using DeepSeek-V3 to decompose…

DeepSeek-Prover

May 01, 2025 8m

The DeepSeek-Prover project aims to advance large language model capabilities in formal theorem proving by addressing the scarcity of training data. It uses autoformalization to convert informal high…

Model Context Protocol (MCP)

Apr 09, 2025 13m

The Model Context Protocol (MCP), introduced by Anthropic in November 2024, is an open protocol standardizing how applications provide context to LLMs. Acting like a "USB-C port for AI applications,"…

LLM Post-Training: Reasoning

Mar 17, 2025 22m

LLM post-training is crucial for refining the reasoning abilities developed during pretraining. It employs fine-tuning on specific reasoning tasks, reinforcement learning to reward logical steps and…

Agent AI Overview

Mar 17, 2025 21m

Agent AI refers to interactive systems that perceive visual, language, and environmental data to produce meaningful embodied actions in physical and virtual worlds. It aims to create sophisticated…

FlashAttention-3

Mar 07, 2025 13m

FlashAttention-3 accelerates attention on NVIDIA Hopper GPUs through three key innovations. It achieves producer-consumer asynchrony by dividing warps into producer (data loading with TMA) and…

FlashAttention-2

Mar 05, 2025 10m

FlashAttention-2 builds upon FlashAttention to achieve faster attention computation with better GPU resource utilization. It enhances parallelism by also parallelizing along the sequence length…

FlashAttention

Mar 05, 2025 10m

FlashAttention is an IO-aware attention mechanism designed to be fast and memory-efficient, especially for long sequences. Its core innovation is tiling, where input sequences are divided into blocks…

Frequently Asked Questions

How many episodes does Large Language Model (LLM) Talk have?

Large Language Model (LLM) Talk has published 68 episodes since January 2025, covering topics in Technology.

Is Large Language Model (LLM) Talk still active?

Large Language Model (LLM) Talk is currently dormant with new episodes daily. Average episode length is 15m.