Exploration and Exploitation Errors Are Measurable for Language Model Agents Paper • 2604.13151 • Published 4 days ago • 23
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Paper • 2603.18004 • Published about 1 month ago • 13
Contamination Detection for VLMs using Multi-Modal Semantic Perturbation Paper • 2511.03774 • Published Nov 5, 2025 • 13
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos Paper • 2410.02763 • Published Oct 3, 2024 • 7