Liaxuu's picture

2

Liaxuu

Liaxuu

·

AI & ML interests

None yet

Organizations

None yet

upvoted an article 3 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 120

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 292