QQ's picture

3

QQ

BruceQQ0

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 1 day ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper 1 day ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all activity

Organizations

None yet

BruceQQ0 's models

None public yet