The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement Paper • 2605.30888 • Published 10 days ago • 10
JLT: Clean-Latent Prediction in Latent Diffusion Transformers Paper • 2605.27102 • Published 13 days ago • 32
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 18 days ago • 169
X-OmniClaw Technical Report: A Unified Mobile Agent for Multimodal Understanding and Interaction Paper • 2605.05765 • Published May 7 • 22
UniPool: A Globally Shared Expert Pool for Mixture-of-Experts Paper • 2605.06665 • Published May 7 • 12
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published Apr 30 • 57