Liv d'Aliberti's picture

1

Liv d'Aliberti PRO

od2961

·

https://liv-daliberti.github.io/

liv-daliberti

AI & ML interests

None yet

Recent Activity

updated a model about 9 hours ago

od2961/adaptive-entropy-mad-td-gym8-public-3m

published a model 11 days ago

od2961/adaptive-entropy-mad-td-gym8-public-3m

updated a model 11 days ago

princetonu/sac-mad-aesac-aemad-gym8-kappas-maxalpha6-10seed-2p5m

View all activity

Organizations

od2961 's models 46

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v7

2B • Updated Aug 8, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v8

2B • Updated Aug 8, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v6

2B • Updated Aug 7, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5

2B • Updated Aug 5, 2025 • 3

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v4

2B • Updated Aug 4, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v3

2B • Updated Aug 3, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v2

Text Generation • 2B • Updated Jul 31, 2025 • 9

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords

2B • Updated Jul 15, 2025 • 1

od2961/Qwen2.5-7B-Open-R1-GRPO

8B • Updated Jun 28, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Jun 21, 2025 • 5

od2961/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated Jun 7, 2025

od2961/Qwen2.5-1.5B-Open-R1-Math-GRPO

2B • Updated Jun 7, 2025

od2961/Qwen2.5-1.5B-Instruct-GRPO-vs-SFT

Updated Jun 6, 2025

od2961/Qwen2.5-1.5B-Instruct-GRPO

2B • Updated Jun 3, 2025 • 1

od2961/Qwen2.5-7B-Instruct-GRPO

8B • Updated Apr 30, 2025

od2961/Qwen2.5-7B-Instruct-SFT

Text Generation • 8B • Updated Apr 19, 2025 • 3