AI Safety Breakthrough

AI SafeGuard

English Business Entrepreneurship Technology

Apple Podcasts Website RSS

Episodes 9

Avg. Duration 16m

Activity Dormant

Since Nov 2024

Latest Episode Aug 2025

Publishing Details

Schedule

Weekly

Format

Episodic

Hosting

media.rss.com

About This Podcast

The future of AI is in our hands. Join AI SafeGuard on "AI Safety Breakthrough" as we explore the frontiers of AI safety research and discuss how we can ensure a future where AI remains beneficial for everyone. We delve into the latest breakthroughs, uncover potential risks, and empower listeners to become informed participants in the conversation about AI's role in society. Subscribe now and become part of the solution!

Intro about the author

J, graduated from Carnegie Mellon University, School of Computer Science, 10+ years in Cybersecurity, Cyber Threat Intelligence, Risk, Compliance, privacy and AI Safety.

Podcasting 2.0 Features

episode funding medium season transcript value valueRecipient

Explore Statistics

English Podcasts Report Business Report Entrepreneurship Report Technology Report English Business Report

Recent Episodes

S1E10 Navigating the New AI Security

Aug 13, 2025 25m

Welcome to Agentic AI Unlocked, your deep dive into the transformative world of Agentic AI—systems combining large language models with advanced reasoning and autonomous action. These intelligent…

S1E9 DeepSeek: A Disruptive Force in AI

Feb 03, 2025 10m Transcript

This episode explores DeepSeek, a Chinese AI startup challenging the AI landscape with its free alternative to ChatGPT. We'll examine DeepSeek's innovative architecture, including Mixture-of-Experts…

S1E8 VLSBench: A Visual Leakless Multimodal Safety Benchmark

Jan 26, 2025 19m

Are current AI safety benchmarks for multimodal models flawed? This podcast explores the groundbreaking research behind VLSBench, a new benchmark designed to address a critical flaw in existing…

S1E7 Adaptive Stress Testing for Language Model Toxicity

Jan 20, 2025 14m

This episode explores ASTPrompter, a novel approach to automated red-teaming for large language models (LLMs). Unlike traditional methods that focus on simply triggering toxic outputs, ASTPrompter is…

S1E6 Global Responsible AI Maturity: A Survey of 1000 Organizations

Jan 16, 2025 18m

This episode dives into the critical topic of Responsible AI (RAI), exploring how organizations worldwide are grappling with the ethical and practical challenges of AI adoption. We'll be drawing…

S1E5 Ivy-VL: A Lightweight Multimodal Model for Everyday Devices

Dec 09, 2024 18m

In this episode, we dive into Ivy-VL, a groundbreaking lightweight multimodal AI model released by AI Safeguard in collaboration with Carnegie Mellon University (CMU) and Stanford University. With…

S1E4 Agent Bench: Evaluating LLMs as Agents

Nov 27, 2024 13m

Large Language Models (LLMs) are rapidly evolving, but how do we assess their ability to act as agents in complex, real-world scenarios? Join Jenny as we explore Agent Bench, a new benchmark designed…

S1E3 Hacking AI for Good: Open AI’s Red Teaming Approach

Nov 24, 2024 17m

In this podcast, we delve into OpenAI's innovative approach to enhancing AI safety through red teaming—a structured process that uses both human expertise and automated systems to identify potential…

S1E2 Surgical Precision: PKE’s Role in AI Safety

Nov 24, 2024 13m

Explore how Precision Knowledge Editing (PKE) refines AI for safety and ethical behavior in Surgical Precision: PKE’s Role in AI Safety. Join experts as we uncover the science, challenges, and…

Frequently Asked Questions

How many episodes does AI Safety Breakthrough have?

AI Safety Breakthrough has published 9 episodes since November 2024, covering topics in Business, Entrepreneurship.

Is AI Safety Breakthrough still active?

AI Safety Breakthrough is currently dormant with new episodes weekly. Average episode length is 16m.

AI Safety Breakthrough

Publishing Details

About This Podcast

Podcasting 2.0 Features

Explore Statistics

Recent Episodes

S1E10 Navigating the New AI Security

S1E9 DeepSeek: A Disruptive Force in AI

S1E8 VLSBench: A Visual Leakless Multimodal Safety Benchmark

S1E7 Adaptive Stress Testing for Language Model Toxicity

S1E6 Global Responsible AI Maturity: A Survey of 1000 Organizations

S1E5 Ivy-VL: A Lightweight Multimodal Model for Everyday Devices

S1E4 Agent Bench: Evaluating LLMs as Agents

S1E3 Hacking AI for Good: Open AI’s Red Teaming Approach

S1E2 Surgical Precision: PKE’s Role in AI Safety

Frequently Asked Questions

Similar Podcasts

All-In with Chamath, Jason, Sacks & Friedberg

The Cast Nexa Show

Founders

Lenny's Podcast: Product | Career | Growth

The Pitch

The a16z Show