Systematic SFT for Qwen3-4B. We explore diverse dataset compositions and training recipes to benchmark and improve performance across tasks.
AI & ML interests
Pioneering the Next Era of AI with Vector Intelligence
Recent Activity
View all activity
models 35
dnotitia/Qwen3-0.6B-Base
Text Generation • 0.6B • Updated • 6
dnotitia/Qwen3-0.6B
Text Generation • 0.8B • Updated • 7
dnotitia/Qwen3-1.7B-Base
Text Generation • 2B • Updated • 1
dnotitia/Qwen3-1.7B
Text Generation • 2B • Updated • 4
dnotitia/Qwen3-4B-Base
Text Generation • 4B • Updated • 204
dnotitia/Qwen3-4B
Text Generation • 4B • Updated • 96
dnotitia/Qwen3-4B-Instruct-2507
Text Generation • 4B • Updated • 255
dnotitia/Qwen3-4B-Thinking-2507
Text Generation • 4B • Updated • 15
dnotitia/DNA-2.1-14B
Text Generation • 15B • Updated • 4 • 1
dnotitia/DNA-2.0-14B
Text Generation • 15B • Updated • 37 • 11