LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 14 days ago • 66
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 15 days ago • 186
Mela: Test-Time Memory Consolidation based on Transformation Hypothesis Paper • 2605.10537 • Published 11 days ago • 7