arxiv:2606.04923
Hao Zhuoyuan 郝卓远
larry2210
AI & ML interests
None yet
Recent Activity
authored a paper about 22 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning submitted a paper 1 day ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement LearningOrganizations
None yet