data: https://www.kaggle.com/datasets/warkingleo2000/RISK-KD/
kaggle datasets download warkingleo2000/risk-kd
-
phanviethoang1512/llama3.2-1b-deita-dpo-dpo_teacher
8B • Updated • 5 -
phanviethoang1512/llama3.2-1b-deita-dpo-TVKD
1B • Updated • 6 -
phanviethoang1512/llama3.2-1b-deita-dpo-ref_teacher
Text Generation • 8B • Updated • 140 -
phanviethoang1512/llama3.2-1b-deita-dpo-student_sft_init
Text Generation • 1B • Updated • 166