VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 8 days ago • 25
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published 12 days ago • 74
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 15 days ago • 97
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 19 days ago • 333
Paused Agents Featured 158 daVinci-MagiHuman 🎬 158 Generate short videos from an image and text prompt
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published Apr 1 • 68