Publishing Details
Contact & Outreach
About This Podcast
Social Media
Explore Statistics
Recent Episodes
[QA] On the Theoretical Limitations of Embedding-Based Retrieval
https://arxiv.org/abs//2508.21038YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts:…
On the Theoretical Limitations of Embedding-Based Retrieval
https://arxiv.org/abs//2508.21038YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts:…
[QA] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing
Avengers-Pro is a test-time routing framework that optimizes performance and efficiency in LLMs, achieving state-of-the-art results by dynamically assigning queries to suitable models based on…
Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing
Avengers-Pro is a test-time routing framework that optimizes performance and efficiency in LLMs, achieving state-of-the-art results by dynamically assigning queries to suitable models based on…
[QA] Measuring the environmental impact of delivering AI at Google Scale
https://arxiv.org/abs//2508.15734YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts:…
Measuring the environmental impact of delivering AI at Google Scale
https://arxiv.org/abs//2508.15734YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts:…
[QA] Deep Think with Confidence
DeepConf enhances reasoning efficiency and performance in Large Language Models by filtering low-quality traces using internal confidence signals, achieving high accuracy and reduced token generation…
Deep Think with Confidence
DeepConf enhances reasoning efficiency and performance in Large Language Models by filtering low-quality traces using internal confidence signals, achieving high accuracy and reduced token generation…
[QA] Intern-S1: A Scientific Multimodal Foundation Model
Intern-S1 is a multimodal model that excels in scientific tasks, outperforming both open-source and closed-source models, and aims to bridge the gap in high-value scientific…
Intern-S1: A Scientific Multimodal Foundation Model
Intern-S1 is a multimodal model that excels in scientific tasks, outperforming both open-source and closed-source models, and aims to bridge the gap in high-value scientific…
[QA] Search-Time Data Contamination
The paper identifies search-time contamination (STC) in evaluating search-based LLM agents, revealing how data leaks compromise benchmark integrity and proposing best practices for trustworthy…
Search-Time Data Contamination
The paper identifies search-time contamination (STC) in evaluating search-based LLM agents, revealing how data leaks compromise benchmark integrity and proposing best practices for trustworthy…
[QA] Thyme: Think Beyond Images
This paper introduces Thyme, a multimodal model enhancing image manipulation and reasoning through executable code, achieving significant performance improvements in perception and reasoning tasks…
Thyme: Think Beyond Images
This paper introduces Thyme, a multimodal model enhancing image manipulation and reasoning through executable code, achieving significant performance improvements in perception and reasoning tasks…
[QA] SSRL: Self-Search Reinforcement Learning
The paper explores using large language models as efficient simulators for reinforcement learning tasks, introducing Self-Search RL to enhance internal knowledge utilization and reduce reliance on…
SSRL: Self-Search Reinforcement Learning
The paper explores using large language models as efficient simulators for reinforcement learning tasks, introducing Self-Search RL to enhance internal knowledge utilization and reduce reliance on…
[QA] Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
This paper explores filtering dual-use topics from training data to enhance the tamper-resistance of open-weight AI systems, demonstrating significant improvements in adversarial fine-tuning…
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
This paper explores filtering dual-use topics from training data to enhance the tamper-resistance of open-weight AI systems, demonstrating significant improvements in adversarial fine-tuning…
[QA] Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
This paper presents ASearcher, an open-source project enhancing search agents' capabilities through scalable RL training, achieving significant performance improvements in complex query handling and…
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
This paper presents ASearcher, an open-source project enhancing search agents' capabilities through scalable RL training, achieving significant performance improvements in complex query handling and…
Frequently Asked Questions
Arxiv Papers has published 2489 episodes since June 2023, covering topics in Science.
Arxiv Papers is currently dormant with new episodes hourly. Average episode length is 13m.
Sign up on Grep.FM to access contact details for Arxiv Papers, including email and social media links.