AI Safety - Paper Digest

AI Safety - Paper Digest

Arian Abbasi, Alan Aqrawi

Episodes 13
Avg. Duration 10m
Activity Declining
Since Oct 2024
Latest Episode Dec 2025

Publishing Details

Schedule
Monthly
Format
Episodic
Hosting
anchor.fm

Contact & Outreach

About This Podcast

The podcast where we break down the latest research and developments in AI Safety - so you don’t have to. Each episode, we take a deep dive into new cutting-edge papers. Whether you’re an expert or just AI-curious, we make complex ideas accessible, engaging, and relevant. Stay ahead of the curve with AI Security Papers. Disclaimer: This podcast and its content are generated by AI. While every effort is made to ensure accuracy, please verify all information independently.

Explore Statistics

Recent Episodes

AI Agents: Adoption and Usage | Perplexity Comet

Dec 10, 2025 5m

Dive into the key findings of the first large-scale field study on the adoption, usage intensity, and use cases of general-purpose AI agents, drawing on hundreds of millions of anonymized user…

S2 WEF & Accenture | Advancing Responsible AI Innovation: A Playbook

Sep 25, 2025 2m Bonus

This episode of the AI Safety Paper Digest is about the World Economic Forum's new playbook on advancing responsible AI innovation. In cooperation with Accenture, the report provides a practical…

S2E2 Okay Waymo, Crash My Car! 🗣️ Testing Autonomous Vehicle Safety with Adversarial Driving Scenarios | LD-Scene

Aug 20, 2025 18m

How can we make autonomous driving systems safer through generative AI? In this episode, we explore LD-Scene, a novel framework that combines Large Language Models (LLMs) with Latent Diffusion Models…

The Full LLM Glossary and Foundations

Aug 04, 2025 1h 28m

Ever wanted a clear, comprehensive explanation of all the key terms related to Large Language Models (LLMs)? This episode has you covered.In this >1-hour deep-dive, we'll guide you through the…

S1E9 Anthropic's Best-of-N: Cracking Frontier AI Across Modalities

Dec 25, 2024 12m

In this special christmas episode, we delve into "Best-of-N Jailbreaking," a powerful new black-box algorithm that demonstrates the vulnerabilities of cutting-edge AI systems. This approach works by…

S1E8 Auto-Rewards & Multi-Step RL for Diverse AI Attacks by OpenAI

Nov 30, 2024 11m

In this episode, we explore the latest advancements in automated red teaming from OpenAI, presented in the paper "Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step…

S1E7 Battle of the Scanners: Top Red Teaming Frameworks for LLMs

Nov 04, 2024 14m

In this episode, we explore the findings from "Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis." As large language models (LLMs) are integrated into more…

S1E6 Watermarking LLM Output: SynthID by DeepMind

Oct 24, 2024 12m

In this episode, we delve into the groundbreaking watermarking technology presented in the paper "Scalable Watermarking for Identifying Large Language Model Outputs," published in Nature.…

S1E5 Open Source Red Teaming: PyRIT by Microsoft

Oct 08, 2024 10m

In this episode, we dive into PyRIT, the open-source toolkit developed by Microsoft for red teaming and security risk identification in generative AI systems. PyRIT offers a model-agnostic framework…

S1E3 Jailbreaking GPT o1: STCA Attack

Oct 07, 2024 8m

This podcast, "Jailbreaking GPT o1, " explores how the GPT o1 series, known for its advanced "slow-thinking" abilities, can be manipulated into generating disallowed content like hate speech through…

S1E4 The Attack Atlas by IBM Research

Oct 05, 2024 11m

This episode explores the intricate world of red-teaming generative AI models as discussed in the paper "Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI."…

S1E2 The Single-Turn Crescendo Attack

Oct 04, 2024 6m

In this episode, we examine the cutting-edge adversarial strategy presented in "Well, that escalated quickly: The Single-Turn Crescendo Attack (STCA)." Building on the multi-turn crescendo attack…

S1E1 Outsmarting ChatGPT: The Power of Crescendo Attacks

Oct 03, 2024 9m

This episode dives into how the Crescendo Multi-Turn Jailbreak Attack leverages seemingly benign prompts to escalate dialogues with large language models (LLMs) such as ChatGPT, Gemini, and Anthropic…

Frequently Asked Questions

How many episodes does AI Safety - Paper Digest have?

AI Safety - Paper Digest has published 13 episodes since October 2024, covering topics in Technology.

Is AI Safety - Paper Digest still active?

AI Safety - Paper Digest is currently declining with new episodes monthly. Average episode length is 10m.

How do I contact AI Safety - Paper Digest for sponsorship or guest appearances?

Sign up on Grep.FM to access contact details for AI Safety - Paper Digest, including email and social media links.

Similar Podcasts