SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors Paper • 2411.18966 • Published 4 days ago • 6
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks Paper • 2604.08539 • Published 29 days ago • 49
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 30 days ago • 38
Experience Transfer for Multimodal LLM Agents in Minecraft Game Paper • 2604.05533 • Published about 1 month ago • 15
360Anything: Geometry-Free Lifting of Images and Videos to 360° Paper • 2601.16192 • Published Jan 22 • 9
Running Agents 42 Image Upscaler And Restoring GFPGAN Algorithm 🦀 42 Enhance and upscale images using GFPGAN