Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published 19 days ago • 109
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding Paper • 2504.09925 • Published Apr 14, 2025 • 39