Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination
Paper • 2605.31058 • Published • 2
None defined yet.
Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards