arxiv:2605.27354
Yi Jing
LeoJ-xy
AI & ML interests
None yet
Recent Activity
upvoted a paper 26 minutes ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning upvoted a collection 5 days ago
WTF GENIUS PAPERS authored a paper 7 days ago
Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders