-
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation • 4B • Updated • 607 -
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation • 4B • Updated • 491 -
jaygala24/Qwen2.5-3B-GRPO-math-reasoning
Text Generation • 3B • Updated • 507 -
jaygala24/Qwen2.5-3B-GRPO-KL-math-reasoning
Text Generation • 3B • Updated • 483
Jay Gala
jaygala24
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
updated a model 4 minutes ago
jaygala24/Qwen3-1.7B-ReMax-math-reasoning published a model 6 minutes ago
jaygala24/Qwen3-1.7B-ReMax-math-reasoning updated a model 28 minutes ago
jaygala24/Qwen2.5-1.5B-GRPO-KL-math-reasoning