S2L-PO model weights (ICML 2026). Qwen3-8B/14B trained with small-model explorers.
qishisuren
qishisuren
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
qishisuren/Qwen3-14B-S2L-PO-4Bexplorer updated a model 1 day ago
qishisuren/Qwen3-8B-S2L-PO-4Bexplorer updated a collection 11 days ago
S2L-PO: Smaller Models as Natural Explorers (ICML 2026)