Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 5 days ago • 77
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes Paper • 2603.25562 • Published 23 days ago • 13
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ Jan 4, 2025 • 9
Data Science and Technology Towards AGI Part I: Tiered Data Management Paper • 2602.09003 • Published Feb 9 • 7
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 195
Open Coding Agents Specialization Collection Ai2 Open Coding Agents - Django, Sphinx, Sympy Data • 6 items • Updated Feb 11 • 5
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published Feb 27 • 98
view article Article Automated Discovery of High-Performance GPU Kernels with OpenEvolve Jun 27, 2025 • 25
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation Paper • 2601.10124 • Published Jan 15 • 4
SWE-smith: Scaling Data for Software Engineering Agents Paper • 2504.21798 • Published Apr 30, 2025 • 15
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published Dec 31, 2025 • 108
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning Paper • 2512.02551 • Published Dec 2, 2025 • 13