Focusing on What Matters: Saliency-Harnessing Accurate Routing for Diffusion MoE Paper • 2606.26938 • Published 10 days ago • 5
OpenRath: Session-Centered Runtime State for Agent Systems Paper • 2606.19409 • Published 18 days ago • 77
One Click per Cell Type Suffices: Training-free Group Interaction for Cell Instance Segmentation Paper • 2605.29429 • Published May 28 • 8
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published Jun 1 • 237
How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning Paper • 2605.27310 • Published May 26 • 20
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published Mar 26 • 155
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 248