Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents Paper • 2605.29447 • Published 8 days ago • 19
PlatonicNav: Unveiling Semantic Correspondence in Navigation with Platonic Topological Maps Paper • 2606.01788 • Published 4 days ago • 9
One-Forcing: Towards Stable One-Step Autoregressive Video Generation Paper • 2605.23458 • Published 14 days ago • 7
ijinyu1113/ft_mr7_410m_seed42_lr3e-5_wd0.05_oldcfg300ep_modarith_subtract_max500_evalevery100_purenum Updated about 22 hours ago • 1
The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement Paper • 2605.30888 • Published 7 days ago • 9
CoRL2026-CSI/IsaacLab-SO101-Phase1-pick_place-80episode-10fps Viewer • Updated 3 days ago • 25.3k • 39 • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 9 days ago • 419
Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints Paper • 2605.21085 • Published 16 days ago • 4
EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation Paper • 2605.23271 • Published 14 days ago • 79
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 24 days ago • 195
ModelLens: Finding the Best for Your Task from Myriads of Models Paper • 2605.07075 • Published 28 days ago • 15