Wangjie Gan's picture

Wangjie Gan

zju-omniai

·

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

commentedon a paper 3 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

upvoted a paper 3 days ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

View all activity

Organizations

zju-omniai 's datasets

None public yet