IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation Paper • 2605.14712 • Published 3 days ago • 14
FrameSkip: Learning from Fewer but More Informative Frames in VLA Training Paper • 2605.13757 • Published 4 days ago • 19
ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning Paper • 2602.11636 • Published Feb 12 • 2
BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries Paper • 2601.15197 • Published Jan 21 • 54
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Paper • 2512.16793 • Published Dec 18, 2025 • 76
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Paper • 2510.16449 • Published Oct 18, 2025 • 35
Euclid's Gift: Enhancing Spatial Perception and Reasoning in Vision-Language Models via Geometric Surrogate Tasks Paper • 2509.24473 • Published Sep 29, 2025 • 18