Action Images: End-to-End Policy Learning via Multiview Video Generation Paper • 2604.06168 • Published 23 days ago • 14
Running Agents Featured 423 Qwen3 VL Demo 😻 423 Chat with an AI that understands text, images, and videos
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning Paper • 2507.12508 • Published Jul 16, 2025 • 27
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens Paper • 2506.17218 • Published Jun 20, 2025 • 29
Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper • 2505.23604 • Published May 29, 2025 • 23