AI Safety Breakthrough
AI SafeGuard
Publishing Details
About This Podcast
The future of AI is in our hands. Join AI SafeGuard on "AI Safety Breakthrough" as we explore the frontiers of AI safety research and discuss how we can ensure a future where AI remains beneficial for everyone. We delve into the latest breakthroughs, uncover potential risks, and empower listeners to become informed participants in the conversation about AI's role in society. Subscribe now and become part of the solution!
Intro about the author
J, graduated from Carnegie Mellon University, School of Computer Science, 10+ years in Cybersecurity, Cyber Threat Intelligence, Risk, Compliance, privacy and AI Safety.
Podcasting 2.0 Features
Explore Statistics
Recent Episodes
S1E10 Navigating the New AI Security
Welcome to Agentic AI Unlocked, your deep dive into the transformative world of Agentic AI—systems combining large language models with advanced reasoning and autonomous action. These intelligent…
S1E9 DeepSeek: A Disruptive Force in AI
This episode explores DeepSeek, a Chinese AI startup challenging the AI landscape with its free alternative to ChatGPT. We'll examine DeepSeek's innovative architecture, including Mixture-of-Experts…
S1E8 VLSBench: A Visual Leakless Multimodal Safety Benchmark
Are current AI safety benchmarks for multimodal models flawed? This podcast explores the groundbreaking research behind VLSBench, a new benchmark designed to address a critical flaw in existing…
S1E7 Adaptive Stress Testing for Language Model Toxicity
This episode explores ASTPrompter, a novel approach to automated red-teaming for large language models (LLMs). Unlike traditional methods that focus on simply triggering toxic outputs, ASTPrompter is…
S1E6 Global Responsible AI Maturity: A Survey of 1000 Organizations
This episode dives into the critical topic of Responsible AI (RAI), exploring how organizations worldwide are grappling with the ethical and practical challenges of AI adoption. We'll be drawing…
S1E5 Ivy-VL: A Lightweight Multimodal Model for Everyday Devices
In this episode, we dive into Ivy-VL, a groundbreaking lightweight multimodal AI model released by AI Safeguard in collaboration with Carnegie Mellon University (CMU) and Stanford University. With…
S1E4 Agent Bench: Evaluating LLMs as Agents
Large Language Models (LLMs) are rapidly evolving, but how do we assess their ability to act as agents in complex, real-world scenarios? Join Jenny as we explore Agent Bench, a new benchmark designed…
S1E3 Hacking AI for Good: Open AI’s Red Teaming Approach
In this podcast, we delve into OpenAI's innovative approach to enhancing AI safety through red teaming—a structured process that uses both human expertise and automated systems to identify potential…
S1E2 Surgical Precision: PKE’s Role in AI Safety
Explore how Precision Knowledge Editing (PKE) refines AI for safety and ethical behavior in Surgical Precision: PKE’s Role in AI Safety. Join experts as we uncover the science, challenges, and…
Frequently Asked Questions
AI Safety Breakthrough has published 9 episodes since November 2024, covering topics in Business, Entrepreneurship.
AI Safety Breakthrough is currently dormant with new episodes weekly. Average episode length is 16m.
Similar Podcasts
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
390 episodes
The Cast Nexa Show
Cast Nexa
52 episodes
Founders
David Senra
446 episodes
Lenny's Podcast: Product | Career | Growth
Lenny Rachitsky
347 episodes
The Pitch
Josh Muccio
204 episodes
The a16z Show
Andreessen Horowitz
1,000 episodes