Pre2Post-Chess Open-sourced models and datasets for training the chess reasoning models. pavelslab-nyu/ChessQwen3Base-6p5e19 Text Generation • Updated 3 days ago pavelslab-nyu/ChessQwen3Base Text Generation • Updated 3 days ago pavelslab-nyu/pretrain_v1_54B Updated 3 days ago • 11 pavelslab-nyu/chess_puzzle_benchmark Viewer • Updated 3 days ago • 2.36k • 24
rlvr-weak-supervision Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. pavelslab-nyu/Llama-3.2-3B-ThinkSFT 3B • Updated Apr 20 • 2 pavelslab-nyu/Llama-3.2-3B-CPT-Math-ThinkSFT 3B • Updated Apr 20 • 8 pavelslab-nyu/Llama-3.2-3B-CPT-Math 3B • Updated Apr 20 • 20
Pre2Post-Chess Open-sourced models and datasets for training the chess reasoning models. pavelslab-nyu/ChessQwen3Base-6p5e19 Text Generation • Updated 3 days ago pavelslab-nyu/ChessQwen3Base Text Generation • Updated 3 days ago pavelslab-nyu/pretrain_v1_54B Updated 3 days ago • 11 pavelslab-nyu/chess_puzzle_benchmark Viewer • Updated 3 days ago • 2.36k • 24
rlvr-weak-supervision Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. pavelslab-nyu/Llama-3.2-3B-ThinkSFT 3B • Updated Apr 20 • 2 pavelslab-nyu/Llama-3.2-3B-CPT-Math-ThinkSFT 3B • Updated Apr 20 • 8 pavelslab-nyu/Llama-3.2-3B-CPT-Math 3B • Updated Apr 20 • 20