OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization Paper • 2605.21226 • Published 2 days ago • 8
Measuring Maximum Activations in Open Large Language Models Paper • 2605.15572 • Published 7 days ago • 17
view article Article Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step FINAL-Bench • 7 days ago • 18
view article Article Vocabulary-Augmented Prompting for Sango — Production African Language AI Without a Parallel Corpus MEYNG • 8 days ago • 2
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published Jan 8 • 28
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 232