Inference Time Tactics

NeuroMetric AI

English Technology

Apple Podcasts Website RSS

Episodes 13

Avg. Duration 29m

Activity Highly Active

Apple Rating ★ 5.0 (1)

Since Aug 2025

Latest Episode Mar 2026

Publishing Details

Schedule

Monthly

Format

Episodic

Consistency

58%

Hosting

feed.podbean.com

About This Podcast

A podcast exploring the emerging field of inference-time compute—the next frontier in AI performance. Hosted by the Neurometric team, we unpack how models reason, make decisions, and perform at runtime. For developers, researchers, and operators building AI infrastructure.

Social Media

X X LinkedIn LinkedIn

Explore Statistics

English Podcasts Report Technology Report English Technology Report

Recent Episodes

Voice Intelligence at Scale: From Call of Duty to Fraud Detection with Modulate AI

Mar 09, 2026 32m

Every day billions of voice conversations happen across games, customer service calls, and financial transactions. Almost none of them are understood by machines. In this episode of Inference Time…

From GPU Scarcity to GPU Waste: Solving the Utilization Crisis

Jan 16, 2026 40m

In this episode of Inference Time Tactics, Cooper and Byron sit down with Charlie and Anil from Rapt AI to tackle one of the industry's most expensive problems: GPU underutilization. With half a…

Lessons from the Leading Edge: What 420 AI Deployments Reveal About Enterprise Success

Dec 22, 2025 44m

In this episode of Inference Time Tactics, Rob, Cooper, and Byron sit down with Shawn Rogers, CEO of BARC US to unpack fresh data from 421 organizations actively deploying AI in production. Shawn…

The Thinking Algorithm Leaderboard: Why No Single Model Wins

Dec 16, 2025 28m

In this episode of Inference Time Tactics, Cooper and Byron break down NeuroMetric's Thinking Algorithm Leaderboard and what it reveals about building production-ready AI agents. They share why…

Benchmarking Generalization: How AI Learns Beyond Training Data

Nov 05, 2025 36m

In this episode of Inference Time Tactics, Rob and Cooper from Neurometric sit down with Yash Sharma, an AI researcher whose work is reshaping how we understand model generalization. Yash recently…

Solving the Cold Start Problem in AI Inference

Oct 03, 2025 34m

In this episode of Inference Time Tactics, Rob, Cooper, and Byron sit down with Prashanth Velidandi, co-founder of InferX, to explore how serverless inference is tackling the AI “cold start problem.”…

From MIT Decoding Research to Today’s Inference Tradeoffs

Sep 30, 2025 30m

Check out the latest episode of Inference Time Tactics. Our guest is Pawan Deshpande, founder, product leader, and angel investor in companies like Anthropic and Toast, with roles at Google, Scale AI…

Drag, Drop, and Deploy: Rethinking How We Build AI Systems

Sep 22, 2025 20m Transcript

In this episode of Inference Time Tactics, Rob, Cooper, Byron, and Dave share product updates for Neurometric’s Inference Time Compute Studio and what they reveal about the shift from single models…

Beyond Vibe Testing: Smarter Eval for Agentic AI

Sep 08, 2025 22m Transcript

In this episode of Inference Time Tactics, Rob, Cooper, and Byron explore Salesforce’s CRMArena-Pro benchmark and what it reveals about the limits of enterprise AI agents. They share why benchmark…

GPT-5, The $100B Gap, and The Economics of Inference

Aug 30, 2025 25m Transcript

In this episode of Inference Time Tactics, Rob and Cooper unpack the launch of GPT 5.0 and what OpenAI’s new routing layer signals about the shifting AI landscape. They explore the tradeoffs of cost,…

When AI Overthinks: Lessons from the Illusion of Thinking Paper

Aug 18, 2025 23m

In this episode of Inference Time Tactics, Rob, Cooper, and CTO Byron unpack Apple’s “Illusion of Thinking” paper—why it split the AI community, what it reveals about reasoning model limits, and how…

The Strategic Trade Offs Behind Inference Time Compute Decisions

Aug 12, 2025 19m Transcript

In this episode of Inference Time Tactics, Rob and Cooper dig into the strategic trade-offs driving a major shift in AI: why some enterprises start with closed models like OpenAI or Anthropic, then…

Why Inference Time Compute Is the Future of AI

Aug 01, 2025 21m Transcript

Welcome to the very first episode of Inference Time Tactics — the podcast for builders, researchers, and engineers pushing the limits of AI performance. In this kickoff conversation, hosts Rob May…

Frequently Asked Questions

How many episodes does Inference Time Tactics have?

Inference Time Tactics has published 13 episodes since August 2025, covering topics in Technology.

Is Inference Time Tactics still active?

Inference Time Tactics is currently highly active with new episodes monthly. Average episode length is 29m.