Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 3 days ago • 90
FineVLA: Fine-Grained Instruction Alignment For VLA Collection This is the collection of FineVLA, including the RoboFine-Bench RoboFine-VLM and FineVLA-policyLA • 2 items • Updated 5 days ago • 1
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting Paper • 2504.20630 • Published Apr 29, 2025 • 9