SeongryongJung/llama2-7b-medmcqa-30k-s3ft-adaptive-hardgrow-base-rkl-beta0.3-lora Updated 1 day ago • 22
SeongryongJung/llama2-7b-medmcqa-10k-s3ft-adaptive-hardgrow-base-rkl-beta0.3-lora Updated 1 day ago • 22
SeongryongJung/llama2-7b-medmcqa-30k-s3ft-adaptive-confidence-mix-minhard0.1-base-rkl-lora Text Generation • Updated 1 day ago • 13
SeongryongJung/llama2-7b-medmcqa-30k-s3ft-adaptive-softdecay-base-rkl-beta0.3-lora Text Generation • Updated 2 days ago • 17
SeongryongJung/llama2-7b-medmcqa-10k-s3ft-adaptive-softdecay-base-rkl-beta0.3-lora Text Generation • Updated 2 days ago • 17
SeongryongJung/llama2-7b-medmcqa-10k-lora-s3ft-base-rkl-beta0-1 Text Generation • Updated 2 days ago • 11
SeongryongJung/llama2-7b-medmcqa-10k-lora-s3ft-base-fkl-beta0-3 Text Generation • Updated 3 days ago • 12
SeongryongJung/llama2-7b-medmcqa-10k-lora-s3ft-base-rkl-beta0-3 Text Generation • Updated 3 days ago • 13
SeongryongJung/qwen2.5-0.5b-ifeval-mixed-kd-alpha05 Text Generation • 0.6B • Updated 21 days ago • 218