Nicolas Chauville's picture

2 1

Nicolas Chauville

chocho

·

AI & ML interests

None yet

Organizations

upvoted 2 articles over 1 year ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 126

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889