namkoong-lab/Qwen3-8B_1episode_SingleLatent_number_guessing Text Generation • 8B • Updated 23 days ago • 11
namkoong-lab/Qwen3-8B_10episodes_4Envs_LOO_number_guessing Text Generation • 8B • Updated 23 days ago • 13
namkoong-lab/LatentGym_Qwen3-8B_10episodes_SingleLatent_number_guessing Text Generation • 8B • Updated 23 days ago • 17
namkoong-lab/LatentGym_Qwen3-8B_1episode_SingleLatent_number_guessing Text Generation • 8B • Updated 23 days ago • 17
namkoong-lab/LatentGym_Qwen3-8B_10episodes_4Envs_full Text Generation • 8B • Updated 23 days ago • 16