Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling Paper ⢠2605.13062 ⢠Published 2 days ago ⢠30
Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization Paper ⢠2605.10780 ⢠Published 3 days ago ⢠31
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation Paper ⢠2604.18240 ⢠Published 25 days ago ⢠16
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper ⢠2603.27538 ⢠Published Mar 29 ⢠146
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper ⢠2603.25804 ⢠Published Mar 26 ⢠29
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining Paper ⢠2603.15030 ⢠Published Mar 16 ⢠21
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing Paper ⢠2509.24900 ⢠Published Sep 29, 2025 ⢠53
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper ⢠2509.24897 ⢠Published Sep 29, 2025 ⢠46
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠14 items ⢠Updated 6 days ago ⢠302