AI & ML interests
None defined yet.
Recent Activity
models 47
AIPlans/TinyLlama-1.1B-ORPO-PKU-SafeRLHF
Text Generation • 1B • Updated • 25
AIPlans/TinyLlama-1.1B-IPO-PKU-SafeRLHF
Text Generation • 1B • Updated • 160 •
AIPlans/TinyLlama-1.1B-KTO-SafeRLHF
1B • Updated • 28
AIPlans/TinyLlama-1.1B-RM-SafeRLHF
Updated
AIPlans/tinyllama-1.1b-dpo-pku-saferlhf_2
Text Generation • 1B • Updated • 255 •
AIPlans/tinyllama-1.1b-dpo-pku-saferlhf
Text Generation • 1B • Updated • 225 •
AIPlans/Qwen2.5-1.5B-KTO-PKU-SafeRLHF
2B • Updated • 92
AIPlans/Qwen3-0.6B-GRPO-CrossCoder-Only
Updated • 9
AIPlans/Qwen3-0.6B-ORPO-CrossCoder-Only
Updated • 7
AIPlans/Qwen3-0.6B-IPO-CrossCoder-Only
Updated • 11
datasets 18
AIPlans/PKU-SafeRLHF-RLHF
Viewer • Updated • 37k • 310 • 1
AIPlans/Helpsteer2-helpfulness-prompts
Viewer • Updated • 7.22k • 25
AIPlans/helpsteer2-helpfulness-preference-cleaned
Viewer • Updated • 6.99k • 11
AIPlans/trackio-experiments
Updated • 5
AIPlans/ultrafeedback_binarized_chinese
Viewer • Updated • 14k • 266
AIPlans/ultrafeedback_binarized
Viewer • Updated • 14k • 36
AIPlans/FilteredPKU-SafeRLHF_chinese
Viewer • Updated • 12k • 10
AIPlans/FilteredPKU-SafeRLHF
Viewer • Updated • 12k • 7
AIPlans/SafetyBench_WithLabels_Better_chinese
Viewer • Updated • 546 • 30
AIPlans/SafetyBench_WithLabels
Viewer • Updated • 546 • 16