Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
Liv d'Aliberti
PRO
od2961
Follow
0 followers
·
1 following
https://liv-daliberti.github.io/
liv-daliberti
AI & ML interests
None yet
Recent Activity
updated
a model
about 9 hours ago
od2961/adaptive-entropy-mad-td-gym8-public-3m
published
a model
11 days ago
od2961/adaptive-entropy-mad-td-gym8-public-3m
updated
a model
11 days ago
princetonu/sac-mad-aesac-aemad-gym8-kappas-maxalpha6-10seed-2p5m
View all activity
Organizations
od2961
's models
46
Sort: Recently updated
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v7
2B
•
Updated
Aug 8, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v8
2B
•
Updated
Aug 8, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v6
2B
•
Updated
Aug 7, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5
2B
•
Updated
Aug 5, 2025
•
3
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v4
2B
•
Updated
Aug 4, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v3
2B
•
Updated
Aug 3, 2025
•
1
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v2
Text Generation
•
2B
•
Updated
Jul 31, 2025
•
9
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords
2B
•
Updated
Jul 15, 2025
•
1
od2961/Qwen2.5-7B-Open-R1-GRPO
8B
•
Updated
Jun 28, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO
2B
•
Updated
Jun 21, 2025
•
5
od2961/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Jun 7, 2025
od2961/Qwen2.5-1.5B-Open-R1-Math-GRPO
2B
•
Updated
Jun 7, 2025
od2961/Qwen2.5-1.5B-Instruct-GRPO-vs-SFT
Updated
Jun 6, 2025
od2961/Qwen2.5-1.5B-Instruct-GRPO
2B
•
Updated
Jun 3, 2025
•
1
od2961/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
Apr 30, 2025
od2961/Qwen2.5-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
Apr 19, 2025
•
3
Previous
1
2
Next