Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 5 days ago • 110
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 19 days ago • 34
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 20 days ago • 239
Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving Paper • 2604.01483 • Published Apr 1 • 7
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset Paper • 2603.23607 • Published Mar 24 • 19
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 341