Wangjie Gan
zju-omniai
ยท
AI & ML interests
None yet
Recent Activity
authored a paper 3 days ago
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification commentedon a paper 3 days ago
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification