Thinking with Imagination: Agentic Visual Spatial Reasoning with World Simulators Paper • 2606.06476 • Published 6 days ago • 15
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges Paper • 2605.26046 • Published 16 days ago • 3
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 6 days ago • 44