arxiv:2604.24432
Xuan Xiao
xiaoxuanzi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation updated a model about 1 month ago
Kwai-Klear/GoLongRL-4B updated a model about 1 month ago
Kwai-Klear/GoLongRL-30B-A3B