Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses Paper • 2606.02373 • Published 5 days ago • 42
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20, 2025 • 20