This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
published a model about 10 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct-parquet_qwen3-1.7b_epoch_3_mask updated a model about 10 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct-parquet_qwen3-1.7b_epoch_3_mask published a model about 10 hours ago
SeanWang0027/rl_warm_up_stitch_minesweeper-parquet_qwen3-1.7b_epoch_3_mask