Outreach Signals
Publishing Details
About This Podcast
Explore Statistics
Recent Episodes
S1E106 AI Storytelling with DOME
In this episode, we explore DOME (Dynamic Hierarchical Outlining with Memory-Enhancement)—a groundbreaking AI method transforming long-form story generation. Learn how DOME overcomes traditional AI…
S1E105 Intelligence Explosion Microeconomics
This episode delves into intelligence explosion microeconomics, a framework for understanding the mechanisms driving AI progress, introduced by Eliezer Yudkowsky. It focuses on returns on cognitive…
S1E104 Metacognitive Monitoring: A Human Ability Beyond AI
The episode explores a study on the metacognitive abilities of Large Language Models (LLMs), focusing on ChatGPT's capacity to predict human memory performance. The study found that while humans…
S1E103 Building Living Software Systems with Generative & Agentic AI
This episode explores how Generative and Agentic AI are transforming software development, leading to the rise of living software systems. It highlights the limitations of traditional software, often…
S1E102 Theory of Mind in LLMs
This episode explores Theory of Mind (ToM) and its potential emergence in large language models (LLMs). ToM is the human ability to understand others' beliefs and intentions, essential for empathy…
S1E101 Designing AI Personalities
This episode explores the importance of AI personalities in human-computer interaction (HCI). As AI agents like Siri and ChatGPT become more integrated into daily life, their personas impact user…
S1E100 FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning
In this episode, we dive into FISHNET, an advanced multi-agent system transforming financial analysis. Unlike traditional approaches that fine-tune large language models, FISHNET uses a modular…
S1E99 LLMs Know More Than They Show
This episode discusses a research paper examining how Large Language Models (LLMs) internally encode truthfulness, particularly in relation to errors or "hallucinations." The study defines…
S1E98 PDL: A Declarative Prompt Programming Language
This episode covers PDL (Prompt Declaration Language), a new language designed for working with large language models (LLMs). Unlike complex prompting frameworks, PDL provides a simple, YAML-based,…
S1E97 AI Self-Evolution Using Long Term Memory
The episode examines Long-Term Memory (LTM) in AI self-evolution, where AI models continuously adapt and improve through memory. LTM enables AI to retain past interactions, enhancing responsiveness…
S1E96 Responsibility in a Multi-Value Strategic Setting
This episode delves into “multi-value responsibility” in AI, exploring how agents are attributed responsibility for outcomes based on contributions to multiple, possibly conflicting values. Key…
S1E94 API-Based Web Agents
This episode discusses the advantages of API-based agents over traditional web browsing agents for task automation. Traditional agents, which rely on simulated user actions, struggle with complex,…
S1E67 GUS-Net: Social Bias Classification with Generalizations, Unfairness, and Stereotypes
This episode discusses GUS-Net, a novel approach for identifying social bias in text using multi-label token classification. Key points include:- Traditional bias detection methods are limited by…
S1E93 Google DeedMind's Talker-Reasoner Architecture
This episode explores the Talker-Reasoner architecture, a dual-system agent framework inspired by the human cognitive model of "thinking fast and slow." The Talker, analogous to System 1, is fast and…
S1E92 A Framework for Representing Knowledge
This episode explores Marvin Minsky's 1974 paper, "A Framework for Representing Knowledge," where he introduces frames as a method of organizing knowledge. Unlike isolated facts, frames are…
S1E91 RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
This episode explores the challenges of handling confusing questions in Retrieval-Augmented Generation (RAG) systems, which use document databases to answer queries. It introduces RAG-ConfusionQA, a…
S1E90 Do LLMs Estimate Uncertainty Well?
This episode explores the challenges of uncertainty estimation in large language models (LLMs) for instruction-following tasks. While LLMs show promise as personal AI agents, they often struggle to…
S1E89 Stars, Stripes, and Silicon: Unravelling ChatGPT’s Bias
This episode examines the societal harms of large language models (LLMs) like ChatGPT, focusing on biases resulting from uncurated training data. LLMs often amplify existing societal biases,…
S1E88 Debug Smarter, Not Harder: AI Agents for Error Resolution in Computational Notebooks
This episode explores the use of AI agents for resolving errors in computational notebooks, highlighting a novel approach where an AI agent interacts with the notebook environment like a human user.…
S1E101 Interpretable End-to-end Neurosymbolic Reinforcement Learning Agents
This episode delves into Neurosymbolic Reinforcement Learning and the SCoBots (Successive Concept Bottlenecks Agents) framework, designed to make AI agents more interpretable and trustworthy. SCoBots…
Frequently Asked Questions
Agentic Horizons has published 106 episodes since November 2024, covering topics in Technology.
Agentic Horizons is currently dormant with new episodes daily. Average episode length is 10m.
Sign up on Grep.FM to access contact details for Agentic Horizons, including email and social media links.