arxiv:2602.16932
Jinming Nian
jnian
·
AI & ML interests
IR, NLP
Organizations
models 13
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-easy_query-100k
Text Generation • Updated • 5
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-hard_query-100k
Text Generation • Updated • 3
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-500easy_500hard_query
Text Generation • Updated • 4
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-hard_query
Text Generation • Updated • 3
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO-easy_query
Text Generation • Updated • 3
jnian/Qwen2.5-0.5B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-3B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-7B-Instruct-Open-R1-GRPO
Updated
jnian/Qwen2.5-1.5B-Open-R1-GRPO
Updated
jnian/Qwen2.5-3B-Open-R1-GRPO
Updated
datasets 0
None public yet