MM-JudgeBias: A Benchmark for Evaluating Compositional Biases in MLLM-as-a-Judge Paper • 2604.18164 • Published 6 days ago • 4
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Paper • 2504.13180 • Published Apr 17, 2025 • 20
Evaluating Multimodal Generative AI with Korean Educational Standards Paper • 2502.15422 • Published Feb 21, 2025 • 10