The checkpoints of the models trained with Youtu-Agent RL for Code/Math and Search tasks.
Yulei Qin
yolay
AI & ML interests
Medical Imaging, Computer Vision,
Language Models
Organizations
Youtu-Agent RL
The checkpoints of the models trained with Youtu-Agent RL for Code/Math and Search tasks.
SmartSnap
Data and Checkpoints of "SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents" [arxiv.org/abs/2512.22322]
SPEAR
Checkpoints "Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning" arxiv [2509.22601]
RAIF
Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF].