ENPIRE: Agentic Robot Policy Self-Improvement in the Real World Paper • 2606.19980 • Published 6 days ago • 13
AlphaTransit: Learning to Design City-scale Transit Routes Paper • 2605.28730 • Published 28 days ago • 7
SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills Paper • 2605.24117 • Published May 22 • 22
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 237
KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving Paper • 2605.13734 • Published May 13 • 12
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models Paper • 2605.14906 • Published May 14 • 79
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 274