·
AI & ML interests
None yet
Organizations
None yet
models 22
LuyiCui/slow_fast_reason-sft-s1k-1.1_full
Text Generation
• 8B • Updated • 2
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-SAPO
2B • Updated • 1
LuyiCui/sft-amc_aime-R1-Distill-Qwen-1.5B
2B • Updated • 1
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-Instruct-CEPO
Text Generation
• 2B • Updated • 3
LuyiCui/Qwen2.5-Math-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-GRPO
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-123
Text Generation
• 2B • Updated • 8
LuyiCui/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 3
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-3
Updated