Qwen/Qwen-Image-Bench
Image-Text-to-Text • 27B • Updated • 202 • 27
None defined yet.
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents