ot-q3_14b-original

Qwen2.5-7B-Instruct student model fine-tuned by full-parameter SFT (s1 recipe) on Qwen3-14B (OpenThoughts SWAG, raw / V3-attack intact) reasoning traces.

This repo is part of a 4-victim study comparing student distillation outcomes when the teacher's reasoning traces are extracted via the V3 attack (-orig) vs. when the V3 attack wrapper is stripped before training (-clean).

How to load a specific epoch

Each epoch_N/ subfolder is a self-contained, loadable HF checkpoint.

from transformers import AutoModelForCausalLM, AutoTokenizer

REPO = "Chia-Mu-Lab/ot-q3_14b-original"
model = AutoModelForCausalLM.from_pretrained(REPO, subfolder="epoch_5", torch_dtype="bfloat16")
tok = AutoTokenizer.from_pretrained(REPO, subfolder="epoch_5")

Per-epoch evaluation

All numbers are accuracies in percent on the canonical eval suite (GSM8K-MATH500, AIME24, AIME25, JEEBench Math subset strict/partial, LiveCodeBench v5 pass@1). The base row is the Qwen2.5-7B-Instruct starting point, evaluated identically. Bold values across this row indicate per-victim peaks.

Epoch Ckpt MATH500 AIME24 AIME25 JEE Math (s/p) LCB pass@1
0 base (Qwen2.5-7B-Instruct) 71.0 8.9 2.2 32.2 / 35.9 15.8
1 step-00500 57.6 β€” β€” β€” / β€” β€”
2 step-01000 64.0 β€” β€” β€” / β€” β€”
3 step-01500 67.4 β€” β€” β€” / β€” β€”
4 step-02000 68.0 β€” β€” β€” / β€” β€”
5 step-02500 72.4 12.2 13.3 33.5 / 38.9 18.3

Training recipe

  • Base model: Qwen/Qwen2.5-7B-Instruct
  • Teacher traces: Chia-Mu-Lab/ot-q3_14b-original
  • Recipe: s1 paper exact full fine-tune (FSDP full-shard, no LoRA)
  • Block size: 32768 tokens Β· effective batch 16 (mb=1, ga=4, 4Γ—H200)
  • Optimizer: AdamW, lr=1e-5 cosine, warmup_ratio=0.05, bf16
  • Epochs: 10 (this repo exposes per-epoch ckpts 1–5; epochs 6–10 omitted)

Files

ot-q3_14b-original/
  README.md
  metrics.csv          ← machine-readable per-epoch metric table
  epoch_1/             ← full HF checkpoint dir (config.json, model-*.safetensors,
  epoch_2/                tokenizer*, etc.)
  epoch_3/
  epoch_4/
  epoch_5/

Caveats / known issues

  • Eval pipeline for this run predates the canonical math500/AIME/JEE/LCB suite; epochs 1-4 here only have MATH500 numbers (older single-task eval). Epoch 5 was re-evaluated with the full suite during the 3-exp sweep (2026-05-20).
  • JEE Math here refers to the subject="math" subset (β‰ˆ236 of 515 questions) scored per the official dair-iitd compute_metrics.py. The strict number is the headline accuracy; partial gives MCQ(multiple) partial credit.
  • These models are research artifacts for the steel-reasoning-trace project (reasoning-trace extraction attack study); do not use for production.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Chia-Mu-Lab/ot-q3_14b-original

Base model

Qwen/Qwen2.5-7B
Finetuned
(3340)
this model

Dataset used to train Chia-Mu-Lab/ot-q3_14b-original