VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 8 days ago • 25
Running on Zero MCP 1.3k Wan2.2 14B Fast Preview 🐌 1.3k generate a video from an image with a text prompt
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published 12 days ago • 74
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143
Running on Zero Agents Featured 1.1k InfiniteYou-FLUX 📸 1.1k Flexible Photo Recrafting While Preserving Your Identity