DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated Jan 16 • 272 • 17
allura-forge/IFbench_multi_constraints_upto5_systempromptified Viewer • Updated Jul 4, 2025 • 95.4k • 29
autoevaluate/autoeval-eval-amazon_reviews_multi-en-4405a7-35409145025 Viewer • Updated Oct 4, 2023 • 5k • 77
autoevaluate/autoeval-eval-cnn_dailymail-3.0.0-741567-2252771791 Viewer • Updated Nov 28, 2022 • 11.5k • 109
autoevaluate/autoeval-staging-eval-project-0839fa4f-7534859 Viewer • Updated Jun 26, 2022 • 7.6k • 40
autoevaluate/autoeval-staging-eval-project-0b0f26eb-7664950 Viewer • Updated Jun 26, 2022 • 1.39k • 25
autoevaluate/autoeval-staging-eval-project-0b0f26eb-7664951 Viewer • Updated Jun 26, 2022 • 1.39k • 25
autoevaluate/autoeval-staging-eval-project-17e9fcc1-7454805 Viewer • Updated Jun 25, 2022 • 7.6k • 45
autoevaluate/autoeval-staging-eval-project-17e9fcc1-7454810 Viewer • Updated Jun 25, 2022 • 7.6k • 56
autoevaluate/autoeval-staging-eval-project-183be059-9075194 Viewer • Updated Jun 29, 2022 • 3.45k • 7
autoevaluate/autoeval-staging-eval-project-1c7ef613-7224756 Viewer • Updated Jun 24, 2022 • 7.6k • 30
autoevaluate/autoeval-staging-eval-project-21811dfd-a09c-4692-82b2-7e358a2520ce-5347 Viewer • Updated Jun 25, 2022 • 872 • 42
autoevaluate/autoeval-staging-eval-project-22d4f209-4087-42ac-a9a4-6d47e201055d-6458 Viewer • Updated Jul 13, 2022 • 819 • 10
autoevaluate/autoeval-staging-eval-project-29af5371-7254761 Viewer • Updated Jun 30, 2022 • 3.25k • 42
autoevaluate/autoeval-staging-eval-project-29af5371-7254763 Viewer • Updated Jun 24, 2022 • 3.25k • 32
autoevaluate/autoeval-staging-eval-project-3aabac9e-7554863 Viewer • Updated Jun 26, 2022 • 11.5k • 39
autoevaluate/autoeval-staging-eval-project-3aabac9e-7554868 Viewer • Updated Jun 26, 2022 • 11.5k • 39
autoevaluate/autoeval-staging-eval-project-3aabac9e-7554869 Viewer • Updated Jun 26, 2022 • 11.5k • 36
autoevaluate/autoeval-staging-eval-project-562e1223-9425246 Viewer • Updated Jul 2, 2022 • 2.13k • 23
autoevaluate/autoeval-staging-eval-project-57377e87-7975068 Viewer • Updated Jun 28, 2022 • 25.3k • 7
autoevaluate/autoeval-staging-eval-project-5ece7d74-70d9-4701-a9b7-1777e66ed4b0-5145 Viewer • Updated Jun 25, 2022 • 2k • 49
autoevaluate/autoeval-staging-eval-project-62ca8f86-389e-4833-9ccf-a97cadcf4874-5751 Viewer • Updated Jun 25, 2022 • 11.3k • 80
autoevaluate/autoeval-staging-eval-project-6715a17f-ec96-4660-9a86-49fe175a04f1-5650 Viewer • Updated Jun 25, 2022 • 2k • 55
autoevaluate/autoeval-staging-eval-project-87e7c3be-9085195 Viewer • Updated Jun 29, 2022 • 5.5k • 13
autoevaluate/autoeval-staging-eval-project-896d78da-9e5e-4706-b736-32d4a31ff571-5549 Viewer • Updated Jun 25, 2022 • 100 • 50
autoevaluate/autoeval-staging-eval-project-bba54b81-5330-48f8-b7bf-1cb797f93bcf-5246 Viewer • Updated Jun 25, 2022 • 2k • 40
autoevaluate/autoeval-staging-eval-project-c76b0e96-8395129 Viewer • Updated Jun 28, 2022 • 6.66k • 7 • 1
autoevaluate/autoeval-staging-eval-project-e1907042-7494827 Viewer • Updated Jun 26, 2022 • 5.5k • 38
autoevaluate/autoeval-staging-eval-project-e1907042-7494833 Viewer • Updated Jun 26, 2022 • 5.5k • 30
autoevaluate/autoeval-staging-eval-project-e1907042-7494835 Viewer • Updated Jun 26, 2022 • 5.5k • 30
autoevaluate/autoeval-staging-eval-project-e1907042-7494836 Viewer • Updated Jun 26, 2022 • 5.5k • 35
autoevaluate/autoeval-staging-eval-project-e1d72cd6-7845032 Viewer • Updated Jun 28, 2022 • 3.27k • 26
autoevaluate/autoeval-staging-eval-project-f87a1758-7384796 Viewer • Updated Jun 24, 2022 • 3.08k • 33
autoevaluate/autoeval-staging-eval-project-f87a1758-7384800 Viewer • Updated Jun 24, 2022 • 3.08k • 54
autoevaluate/autoeval-staging-eval-project-imdb-ed2a920e-12445656 Viewer • Updated Aug 3, 2022 • 25k • 5
autoevaluate/autoeval-staging-eval-project-imdb-f49f2e4f-12435655 Viewer • Updated Aug 3, 2022 • 25k • 5
autoevaluate/autoeval-staging-eval-project-samsum-0c672345-10275365 Viewer • Updated Jul 8, 2022 • 14.7k • 5
autoevaluate/autoeval-staging-eval-project-samsum-f4288f9c-10925467 Viewer • Updated Jul 15, 2022 • 819 • 33
autoevaluate/autoeval-staging-eval-project-samsum-f90fd7b5-10915466 Viewer • Updated Jul 15, 2022 • 819 • 12
autoevaluate/autoeval-staging-eval-project-sms_spam-216c1ded-12215630 Viewer • Updated Aug 2, 2022 • 5.57k • 62 • 2
autoevaluate/autoeval-staging-eval-project-xsum-ad8ac8a3-10195347 Viewer • Updated Jul 7, 2022 • 11.3k • 14
llm-aes/gpt3.5_hanna_rate_explain_96_prompts_llm_double_eval Viewer • Updated Feb 11, 2024 • 5.28k • 6
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-07-25_17-46_2870 Viewer • Updated Mar 7, 2025 • 150 • 150
open-llm-leaderboard-old/details_ehartford__WizardLM-1.0-Uncensored-Llama2-13b Updated Oct 22, 2023 • 572
open-llm-leaderboard-old/details_sophosympatheia__Midnight-Rose-70B-v2.0.3 Updated Mar 8, 2024 • 1.27k
open-llm-leaderboard/CohereForAI__c4ai-command-r-plus-details Viewer • Updated Feb 13, 2025 • 47.5k • 107
open-llm-leaderboard/DeepMount00__Qwen2-1.5B-Ita_v5-details Viewer • Updated Mar 10, 2025 • 43.2k • 11 • 1
open-llm-leaderboard/HuggingFaceH4__zephyr-7b-alpha-details Viewer • Updated Feb 13, 2025 • 48.2k • 121
open-llm-leaderboard/HuggingFaceH4__zephyr-7b-beta-details Viewer • Updated Feb 13, 2025 • 47.7k • 122
open-llm-leaderboard/HuggingFaceH4__zephyr-7b-gemma-v0.1-details Viewer • Updated Feb 13, 2025 • 48.2k • 113
open-llm-leaderboard/KingNish__qwen-1b-continued-v2.2-details Viewer • Updated Mar 9, 2025 • 43.2k • 10
open-llm-leaderboard/MaziyarPanahi__calme-2.1-qwen2-72b-details Viewer • Updated Feb 13, 2025 • 43.2k • 102
open-llm-leaderboard/MaziyarPanahi__calme-2.2-llama3-70b-details Viewer • Updated Feb 13, 2025 • 43.2k • 104
open-llm-leaderboard/MaziyarPanahi__calme-2.4-llama3-70b-details Viewer • Updated Feb 13, 2025 • 43.2k • 99
open-llm-leaderboard/MaziyarPanahi__calme-2.4-rys-78b-details Viewer • Updated Feb 13, 2025 • 43.2k • 99
open-llm-leaderboard/Open-Orca__Mistral-7B-OpenOrca-details Viewer • Updated Feb 13, 2025 • 48.2k • 113
open-llm-leaderboard/YOYO-AI__Qwen2.5-7B-it-restore-details Viewer • Updated Mar 10, 2025 • 43.2k • 10
open-llm-leaderboard/cognitivecomputations__dolphin-2.9.2-qwen2-72b-details Viewer • Updated Feb 13, 2025 • 43.2k • 104
open-llm-leaderboard/deepseek-ai__DeepSeek-R1-Distill-Qwen-14B-details Viewer • Updated Feb 13, 2025 • 43.2k • 21
open-llm-leaderboard/meta-llama__Meta-Llama-3-70B-details Viewer • Updated Feb 13, 2025 • 48.2k • 117
open-llm-leaderboard/mistralai__Mixtral-8x7B-v0.1-details Viewer • Updated Feb 13, 2025 • 45.6k • 102
open-llm-leaderboard/stabilityai__stablelm-zephyr-3b-details Viewer • Updated Feb 13, 2025 • 48.2k • 111
open-llm-leaderboard/teknium__OpenHermes-2.5-Mistral-7B-details Viewer • Updated Feb 13, 2025 • 48.2k • 113
simonycl/persuasiveness-leaderboard-inverted-anthropic_claude_3.7_sonnet_thinking Viewer • Updated Nov 4, 2025 • 800 • 6
simonycl/persuasiveness-leaderboard-inverted-claude_3_5_sonnet Viewer • Updated Nov 4, 2025 • 800 • 13
simonycl/persuasiveness-leaderboard-inverted-claude_3_7_sonnet Viewer • Updated Nov 4, 2025 • 800 • 32
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757592939_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757593795_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757594648_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757595498_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757596351_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757597202_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757598055_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757598910_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757599764_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 3
chengfu0118/DeepSeek-R1-Distill-Llama-8B_1757600615_eval_7c1d Viewer • Updated Sep 11, 2025 • 1.39k • 2
chengfu0118/DeepSeek-R1-Distill-Qwen-7B_1753868361_eval_27e9 Viewer • Updated Jul 30, 2025 • 3.07k • 3
cpsu04/numina-sampled_nvidia-nemotron-nano-9b-v2_v5_scored.jsonl Viewer • Updated 21 days ago • 19.9k • 42
electricsheepafrica/africa-synth-education-assessment-scores-nigeria Viewer • Updated Apr 14 • 100k • 29
electricsheepafrica/africa-who-estimate-of-current-cigarette-smoking-prevalence-cigcurrstd Viewer • Updated Apr 30 • 1.32k • 28
electricsheepafrica/africa-who-estimate-of-current-cigarette-smoking-prevalence-estcigcurr Viewer • Updated Apr 30 • 1.32k • 25
electricsheepafrica/africa-who-estimate-of-current-tobacco-smoking-prevalence-estsmkcurr Viewer • Updated Apr 30 • 1.32k • 22
electricsheepafrica/africa-who-estimate-of-current-tobacco-smoking-prevalence-smkcurrstd Viewer • Updated Apr 30 • 1.32k • 23
electricsheepafrica/africa-who-estimate-of-current-tobacco-use-prevalence-tobcurrstd Viewer • Updated Apr 30 • 1.32k • 23
electricsheepafrica/africa-who-prevalence-of-current-cigarette-smoking-among-adolescents Viewer • Updated May 2 • 132 • 40
electricsheepafrica/africa-who-prevalence-of-current-smokeless-tobacco-use-among Viewer • Updated May 1 • 99 • 25
electricsheepafrica/africa-who-prevalence-of-current-tobacco-use-among-adolescents Viewer • Updated May 2 • 126 • 41
electricsheepasia/asia-who-estimate-of-current-cigarette-smoking-prevalence-cigcurrstd Viewer • Updated 15 days ago • 1.49k • 88
electricsheepasia/asia-who-estimate-of-current-cigarette-smoking-prevalence-estcigcurr Viewer • Updated 15 days ago • 1.49k • 79
electricsheepasia/asia-who-estimate-of-current-tobacco-smoking-prevalence-estsmkcurr Viewer • Updated 15 days ago • 1.49k • 77
electricsheepasia/asia-who-estimate-of-current-tobacco-smoking-prevalence-smkcurrstd Viewer • Updated 15 days ago • 1.49k • 82
electricsheepasia/asia-who-estimate-of-current-tobacco-use-prevalence-tobcurrstd Viewer • Updated 15 days ago • 1.49k • 71
electricsheepasia/asia-who-prevalence-of-current-cigarette-smoking-among-adolescents Viewer • Updated 21 days ago • 128 • 42
electricsheepasia/asia-who-prevalence-of-current-smokeless-tobacco-use-among Viewer • Updated 21 days ago • 105 • 40
electricsheepasia/asia-who-prevalence-of-current-tobacco-use-among-adolescents Viewer • Updated 21 days ago • 111 • 39
electricsheepeurope/europe-who-estimate-of-current-cigarette-smoking-prevalence-cigcurrstd Viewer • Updated 15 days ago • 1.32k • 64
electricsheepeurope/europe-who-estimate-of-current-cigarette-smoking-prevalence-estcigcurr Viewer • Updated 15 days ago • 1.32k • 62
electricsheepeurope/europe-who-estimate-of-current-tobacco-smoking-prevalence-estsmkcurr Viewer • Updated 15 days ago • 1.32k • 60
electricsheepeurope/europe-who-estimate-of-current-tobacco-smoking-prevalence-smkcurrstd Viewer • Updated 15 days ago • 1.32k • 114
electricsheepeurope/europe-who-estimate-of-current-tobacco-use-prevalence-tobcurrstd Viewer • Updated 15 days ago • 1.32k • 56
ioi-leaderboard/dummy-ioi-eval-openrouter_google_gemini-2.0-flash-thinking-exp_free-test Viewer • Updated Mar 10, 2025 • 2 • 12
ioi-leaderboard/ioi-eval-anthropic_claude-3-5-sonnet-prompt-mem-limit Viewer • Updated Mar 12, 2025 • 2.05k • 23
ioi-leaderboard/ioi-eval-anthropic_claude-3-7-sonnet-mem-limit Viewer • Updated Mar 8, 2025 • 2.05k • 28 • 1
ioi-leaderboard/ioi-eval-anthropic_claude-3-7-sonnet-prompt-mem-limit Viewer • Updated Mar 9, 2025 • 2.05k • 9
ioi-leaderboard/ioi-eval-deepseek-ai_DeepSeek-R1-prompt-mem-limit Viewer • Updated Mar 8, 2025 • 2.05k • 16
ioi-leaderboard/ioi-eval-google_gemini-2.0-flash-thinking-exp_free-prompt-mem-limit Viewer • Updated Mar 11, 2025 • 2.05k • 18
ioi-leaderboard/ioi-eval-google_gemini-2_0-flash-thinking-exp-prompt-mem-limit Viewer • Updated Mar 7, 2025 • 5 • 2
ioi-leaderboard/ioi-eval-openrouter_anthropic_claude-3_7-sonnet_thinking-prompt-mem-limit Viewer • Updated Mar 7, 2025 • 5 • 6
ioi-leaderboard/ioi-eval-openrouter_google_gemini-2_0-flash-thinking-exp-prompt-mem-limit Viewer • Updated Mar 7, 2025 • 5 • 4
ioi-leaderboard/ioi-eval-openrouter_google_gemini-2_0-flash-thinking-exp_free-prompt-mem-limit Viewer • Updated Mar 7, 2025 • 5 • 11
ioi-leaderboard/ioi-eval-sglang_deepseek-ai_DeepSeek-V2.5-new-prompt Viewer • Updated Mar 4, 2025 • 2.05k • 5
ioi-leaderboard/ioi-eval-sglang_deepseek-ai_DeepSeek-V2.5-prompt-mem-limit Viewer • Updated Mar 5, 2025 • 2.05k • 52
ioi-leaderboard/ioi-eval-sglang_open-thoughts_OpenThinker-32B-prompt-mem-limit Viewer • Updated Mar 9, 2025 • 2.05k • 8
ioi-leaderboard/ioi-eval-sglang_open-thoughts_OpenThinker-7B-prompt-mem-limit Viewer • Updated Mar 9, 2025 • 2.05k • 9
jason1966/aliiihussain_social-media-viral-content-and-engagement-metrics Viewer • Updated Mar 31 • 2k • 68 • 1
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_raw_5000_edu_score Viewer • Updated Feb 23 • 5k • 9
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_en_prompt4 Viewer • Updated Feb 22 • 5k • 6
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_prompt1 Viewer • Updated Feb 22 • 5k • 6
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_prompt2 Viewer • Updated Feb 22 • 5k • 6
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_prompt3 Viewer • Updated Feb 22 • 5k • 6
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_prompt5 Viewer • Updated Feb 23 • 5k • 6
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_prompt6 Viewer • Updated Feb 23 • 5k • 7
juliadollis/HuggingFaceFW__fineweb-2_por_Latn_train_scored_5000_promptdiogo Viewer • Updated Mar 2 • 5k • 13
juliadollis/energy-eval-filtered_bad_format_recovery_nvidia_NVIDIA-Nemotron-3-Nano-4B-BF16_v3 Viewer • Updated 20 days ago • 226 • 32
juliadollis/energy-eval-filtered_responses_multichoice_nvidia_NVIDIA-Nemotron-3-Nano-4B-BF16_v3 Viewer • Updated 20 days ago • 447 • 59
juliadollis/energy-eval-filtered_responses_multichoice_nvidia_NVIDIA-Nemotron-3-Nano-4B-BF16_v4 Viewer • Updated 20 days ago • 447 • 36
juliadollis/energy-eval-filtered_responses_nvidia_NVIDIA-Nemotron-3-Nano-4B-BF16_crossenc_k20_v3 Viewer • Updated 20 days ago • 447 • 30
kureha295/DeepSeek-R1-Distill-Llama-8B_scored_combined_datasets_rollout_s1 Viewer • Updated Mar 17 • 48.7k • 2
kureha295/DeepSeek-R1-Distill-Qwen-7B_scored_combined_datasets_rollout_s1 Viewer • Updated Mar 18 • 47.9k • 3
leobianco/eval_bosch_PERL_google_S051179_eps10000_lr2e-5_kl1e-4_2510310721_gens_T0.1_wfs0 Viewer • Updated Oct 31, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S130104_eps10000_lr2e-5_kl1e-4_2510291047_gens_T0.1_wfs0 Viewer • Updated Oct 31, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S130104_eps10000_lr2e-5_kl1e-4_2510310722_gens_T0.1_wfs0 Viewer • Updated Oct 31, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S130104_eps10000_lr5e-5_kl1e-4_2511071026_gens_T0.1_wfs0 Viewer • Updated Nov 7, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S200898_eps10000_lr1e-4_kl1e-4_2511070928_gens_T0.1_wfs0 Viewer • Updated Nov 7, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S200898_eps10000_lr2e-5_kl1e-4_2511051107_gens_T0.1_wfs0 Viewer • Updated Nov 5, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S200898_eps10000_lr2e-5_kl1e-4_2511060943_gens_T0.1_wfs0 Viewer • Updated Nov 6, 2025 • 1.04k • 3
leobianco/eval_bosch_PERL_google_S200898_eps10000_lr2e-5_kl1e-4_2511061519_gens_T0.1_wfs0 Viewer • Updated Nov 6, 2025 • 1.04k • 3
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl1e-4_2506161239_completions Viewer • Updated Jun 16, 2025 • 1k • 2
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl1e-4_2506161500_completions Viewer • Updated Jun 16, 2025 • 1k • 2
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl1e-4_2506230939_gens_T0.1 Viewer • Updated Jun 25, 2025 • 10k • 2
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl1e-4_2506231515_gens_T0.1 Viewer • Updated Jun 24, 2025 • 1k • 2
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl2e-4_2506170854_completions Viewer • Updated Jun 18, 2025 • 1k • 2
leobianco/eval_npov_PERL_google_S130104_eps20000_lr2e-5_kl4e-4_2506170900_completions Viewer • Updated Jun 18, 2025 • 1k • 2
leobianco/eval_npov_PERL_google_S130104_eps5000_lr2e-5_kl1e-4_2602270958_gens_T0.1_wfs0 Viewer • Updated Mar 3 • 10k • 13
leobianco/eval_npov_PERL_google_S130104_eps5000_lr2e-5_kl1e-4_2602271245_gens_T0.1_wfs0 Viewer • Updated Mar 3 • 10k • 59
leobianco/eval_npov_PERL_google_S200898_eps10000_lr1e-4_kl1e-4_2506261141_gens_T0.1 Viewer • Updated Jun 26, 2025 • 10k • 2
leobianco/eval_npov_PERL_google_S200898_eps10000_lr2e-5_kl1e-4_2506260938_gens_T0.1 Viewer • Updated Jun 27, 2025 • 10k • 2
leobianco/eval_npov_PERL_google_S200898_eps10000_lr2e-5_kl1e-4_2507031331_gens_T0.1_wfs0 Viewer • Updated Jul 4, 2025 • 10k • 2
miladalsh/gen-conv-by-ft-llama-on-deepseek-simple-prompt-with-eval Viewer • Updated Oct 17, 2025 • 500 • 4
miladalsh/gen-conv-by-ft-qwen-on-deepseek-simple-prompt-with-eval Viewer • Updated Oct 17, 2025 • 500 • 52
mlfoundations-dev/DCFT-open-thoughts-subset-claude-v1-etash_1742633651_eval_0981 Viewer • Updated Mar 22, 2025 • 3.13k • 2
mlfoundations-dev/DCFT-open-thoughts-subset-claude-v1-etash_eval_03-07-25_23-39_0981 Viewer • Updated Mar 7, 2025 • 299 • 2
mlfoundations-dev/DCFT-open-thoughts-subset-claude-v1-etash_eval_03-08-25_16-58_0981 Viewer • Updated Mar 8, 2025 • 3.13k • 2
mlfoundations-dev/DCFT-open-thoughts-subset-v1-etash_1742823125_eval_0981 Viewer • Updated Mar 24, 2025 • 3.13k • 2
mlfoundations-dev/DCFT-open-thoughts-subset-v1-etash_eval_03-07-25_22-34_0981 Viewer • Updated Mar 7, 2025 • 343 • 2
mlfoundations-dev/DCFT-open-thoughts-subset-v1-etash_eval_03-08-25_07-16_0981 Viewer • Updated Mar 8, 2025 • 295 • 2
mlfoundations-dev/DCFT-open-thoughts-subset-v1-etash_eval_03-08-25_09-27_0981 Viewer • Updated Mar 8, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3_eval_2870 Viewer • Updated Jun 19, 2025 • 300 • 3
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3_eval_5554 Viewer • Updated Jun 19, 2025 • 22.7k • 19
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3_eval_8179 Viewer • Updated Jun 23, 2025 • 12.2k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-32B_1743568172_eval_0981 Viewer • Updated Apr 2, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-32B_1743604989_eval_0981 Viewer • Updated Apr 2, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-32B_1743609165_eval_0981 Viewer • Updated Apr 2, 2025 • 3.13k • 3
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_1743590252_eval_f912 Viewer • Updated Apr 2, 2025 • 594 • 3
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_1743591249_eval_f912 Viewer • Updated Apr 2, 2025 • 594 • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_OpenThoughts3_eval_8179 Viewer • Updated Jun 23, 2025 • 12.2k • 3
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-07-25_17-55_0981 Viewer • Updated Mar 7, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-07-25_19-09_2870 Viewer • Updated Mar 7, 2025 • 150 • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-07-25_19-30_0981 Viewer • Updated Mar 7, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-17-25_17-51-01_0981 Viewer • Updated Mar 17, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-17-25_21-58-27_0981 Viewer • Updated Mar 17, 2025 • 3.13k • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-17-25_23-41-01_f912 Viewer • Updated Mar 17, 2025 • 594 • 2
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_eval_03-18-25_00-31-01_0981 Viewer • Updated Mar 18, 2025 • 3.13k • 3
mlfoundations-dev/SCP_40k_anthropic_solution_unverified_eval_03-18-25_22-14-43_0981 Viewer • Updated Mar 18, 2025 • 3.13k • 1
mlfoundations-dev/SCP_40k_anthropic_solution_verified_eval_03-18-25_22-14-41_0981 Viewer • Updated Mar 18, 2025 • 3.13k • 2
mlfoundations-dev/claude-3-7-sonnet-latest-thinking-16k_eval_68b3 Viewer • Updated May 21, 2025 • 12 • 19
mlfoundations-dev/claude-3-7-sonnet-latest-thinking-32k_eval_68b3 Viewer • Updated May 21, 2025 • 12 • 6
mlfoundations-dev/claude_3_7_20250219_tbench_traces_sharegptv1 Viewer • Updated Jul 23, 2025 • 820 • 15
mlfoundations-dev/openthoughts3_30k-with-complete-thoughts_eval_5554 Viewer • Updated May 24, 2025 • 22.7k • 2
open-llm-leaderboard-old/details_0-hero__Matter-0.1-7B-boost Viewer • Updated Mar 22, 2024 • 83.3k • 8
open-llm-leaderboard-old/details_0-hero__Matter-0.1-Slim-7B Viewer • Updated Mar 13, 2024 • 81.6k • 9
open-llm-leaderboard-old/details_0-hero__Matter-0.1-Slim-7B-A Viewer • Updated Mar 14, 2024 • 77.4k • 9
open-llm-leaderboard-old/details_Andron00e__YetAnother_Open-Llama-3B-LoRA-OpenOrca Updated Sep 22, 2023 • 7
open-llm-leaderboard-old/details_Aryanne__sheared-plus-westlake-nearest-50_75p Updated Jan 25, 2024 • 6
open-llm-leaderboard-old/details_AtAndDev__Ogno-Monarch-Neurotic-7B-Dare-Ties Updated Mar 1, 2024 • 11
open-llm-leaderboard-old/details_AtAndDev__Ogno-Monarch-Neurotic-9B-Passthrough Updated Mar 1, 2024 • 7
open-llm-leaderboard-old/details_AwanLLM__Awanllm-Llama-3-8B-Dolfin-v0.6-Abliterated Updated May 23, 2024 • 8
open-llm-leaderboard-old/details_BarraHome__Mistroll-7B-v2.3-NoTsOsm4rt-16bit Updated May 10, 2024 • 7
open-llm-leaderboard-old/details_BarryFutureman__WestLakeX-7B-EvoMerge-Variant2 Updated Feb 2, 2024 • 58
open-llm-leaderboard-old/details_BramVanroy__llama2-13b-ft-mc4_nl_cleaned_tiny Updated Oct 27, 2023 • 106
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE1_17w-gate_up_down_proj Updated Oct 23, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE1_17w-q_k_v_o_proj Updated Oct 23, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE2_3w-gate_up_down_proj Updated Oct 23, 2023 • 4
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE2_3w-q_k_v_o_proj Updated Oct 23, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r16-gate_up_down Updated Oct 25, 2023 • 10
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r16-q_k_v_o Updated Oct 25, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r16-q_k_v_o_gate_up_down Updated Oct 25, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r4-gate_up_down Updated Oct 29, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r4-q_k_v_o Updated Oct 29, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r4-q_k_v_o_gate_up_down Updated Oct 25, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r8-gate_up_down Updated Oct 28, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r8-q_k_v_o Updated Oct 25, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE3_3.3w-r8-q_k_v_o_gate_up_down Updated Oct 28, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r16-gate_up_down Updated Dec 1, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r16-gate_up_down-test1 Updated Oct 25, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r16-q_k_v_o Updated Oct 29, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r16-q_k_v_o_gate_up_down Updated Oct 23, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r4-gate_up_down Updated Oct 25, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r4-q_k_v_o Updated Oct 24, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r4-q_k_v_o_gate_up_down Updated Oct 28, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r8-gate_up_down Updated Oct 28, 2023 • 10
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r8-q_k_v_o Updated Oct 26, 2023 • 9
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_3.8w-r8-q_k_v_o_gate_up_down Updated Oct 24, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_addto15k_4.5w-r16-gate_up_down Updated Oct 25, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE4_compare15k_4.5w-r16-gate_up_down Updated Oct 28, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r16-gate_up_down Updated Oct 29, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r16-q_k_v_o Updated Oct 26, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r4-gate_up_down Updated Oct 26, 2023 • 5
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r4-q_k_v_o Updated Oct 28, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r4-q_k_v_o_gate_up_down Updated Oct 28, 2023 • 13
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r8-gate_up_down Updated Oct 28, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r8-q_k_v_o Updated Dec 1, 2023 • 6
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-FINETUNE5_4w-r8-q_k_v_o_gate_up_down Updated Oct 25, 2023 • 11
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-Fintune_1_17w-gate_up_down_proj Updated Sep 4, 2023 • 2
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-Open_Platypus_and_ccp_2.6w Updated Oct 12, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-Open_Platypus_and_ccp_2.6w-3_epoch Updated Oct 29, 2023 • 9
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_FINETUNE2_3w Updated Oct 16, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_FINETUNE2_3w-gate_up_down_proj Updated Oct 19, 2023 • 10
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_FINETUNE2_3w-q_k_v_o_proj Updated Oct 18, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_Fintune_1_17w Updated Oct 14, 2023 • 8
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_Fintune_1_17w-gate_up_down_proj Updated Oct 18, 2023 • 7
open-llm-leaderboard-old/details_CHIH-HUNG__llama-2-13b-huangyt_Fintune_1_17w-q_k_v_o_proj Updated Oct 18, 2023 • 5
open-llm-leaderboard-old/details_CausalLM__72B-preview-llamafied-qwen-llamafy Updated Jan 19, 2024 • 9
open-llm-leaderboard-old/details_ChaoticNeutrals__Nyanade_Stunna-Maid-7B-v0.2 Updated Apr 16, 2024 • 7
open-llm-leaderboard-old/details_ChaoticNeutrals__Prima-LelantaclesV4-7b-16k-bf16 Updated Feb 20, 2024 • 59
open-llm-leaderboard-old/details_ChuckMcSneed__ArcaneEntanglement-model64-70b Updated Apr 3, 2024 • 6
open-llm-leaderboard-old/details_DUAL-GPO-2__phi-2-gpo-renew2-b0.001-extra-v2-i1 Updated Apr 26, 2024 • 6
open-llm-leaderboard-old/details_DUAL-GPO__phi-2-gpo-renew2-b0.001-log-i0 Updated Apr 23, 2024 • 8 • 1
open-llm-leaderboard-old/details_DaveGergern__13B-Psyfighter2-Erebus3-DareTies Updated May 30, 2024 • 7
open-llm-leaderboard-old/details_DrNicefellow__Microscopic-Mistral-87k-steps Updated May 27, 2024 • 7
open-llm-leaderboard-old/details_DrNicefellow__Mistral-1-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 8
open-llm-leaderboard-old/details_DrNicefellow__Mistral-2-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 6
open-llm-leaderboard-old/details_DrNicefellow__Mistral-3-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 8
open-llm-leaderboard-old/details_DrNicefellow__Mistral-4-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 7
open-llm-leaderboard-old/details_DrNicefellow__Mistral-5-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 9
open-llm-leaderboard-old/details_DrNicefellow__Mistral-6-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 7
open-llm-leaderboard-old/details_DrNicefellow__Mistral-7-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 8
open-llm-leaderboard-old/details_DrNicefellow__Mistral-8-from-Mixtral-8x7B-v0.1 Updated Apr 15, 2024 • 6
open-llm-leaderboard-old/details_Edgerunners__yi-9b-may-ortho-baukit-13fail-3000total-bf16 Updated May 27, 2024 • 6
open-llm-leaderboard-old/details_Edgerunners__yi-9b-may-ortho-baukit-30fail-3000total-bf16 Updated May 27, 2024 • 6
open-llm-leaderboard-old/details_EmbeddedLLM__Mistral-7B-Merge-14-v0.3-ft-step-15936 Updated Jan 5, 2024 • 6
open-llm-leaderboard-old/details_EmbeddedLLM__Mistral-7B-Merge-14-v0.3-ft-step-9984 Updated Jan 5, 2024 • 5
open-llm-leaderboard-old/details_HWERI__pythia-70m-deduped-cleansharegpt-en Updated Oct 24, 2023 • 704
open-llm-leaderboard-old/details_IGeniusDev__llama13B-quant8-testv1-openorca-customdataset Updated Sep 22, 2023 • 9
open-llm-leaderboard-old/details_Josephgflowers__Cinder-Phi-2-STEM-2.94B-Test Updated Feb 18, 2024 • 6
open-llm-leaderboard-old/details_Josephgflowers__TinyLlama-Cinder-1.3B-Test.2 Updated Jan 27, 2024 • 3
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-1 Updated Apr 4, 2024 • 4
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-2 Updated Apr 5, 2024 • 4
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-3 Updated Apr 6, 2024 • 4
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-4 Updated Apr 6, 2024 • 4
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-5 Updated Apr 7, 2024 • 4
open-llm-leaderboard-old/details_Josephgflowers__Tinyllama-1.5B-Cinder-Test-6 Updated Apr 16, 2024 • 5
open-llm-leaderboard-old/details_JunchengXie__Starling-LM-7B-alpha-gpt-4-80k Updated Mar 29, 2024 • 4
open-llm-leaderboard-old/details_KnutJaegersberg__Nanbeige-16B-Base-32K-llama Updated Jan 16, 2024 • 6
open-llm-leaderboard-old/details_KnutJaegersberg__RWKV-4-PilePlus-169M-20230520-done-ctx4096 Updated Oct 24, 2023 • 7
open-llm-leaderboard-old/details_KnutJaegersberg__RWKV-4-PilePlus-1B5-20230520-2942-486Gtokens-ctx4096 Updated Dec 1, 2023 • 8
open-llm-leaderboard-old/details_KnutJaegersberg__RWKV-4-PilePlus-430M-20230520-6162-1018Gtokens-ctx4098 Updated Oct 27, 2023 • 5
open-llm-leaderboard-old/details_LeroyDyer__Mixtral_AI_CyberTron_DeepMind_III_UFT Updated May 7, 2024 • 45
open-llm-leaderboard-old/details_Locutusque__OpenCerebrum-1.5-Mistral-7B-v0.2-beta Updated Apr 7, 2024 • 4
open-llm-leaderboard-old/details_Locutusque__OpenCerebrum-1.5-Mistral-7b-v0.2-alpha Updated Apr 10, 2024 • 5
open-llm-leaderboard-old/details_Locutusque__SlimHercules-4.0-Mistral-7B-v0.2 Updated Apr 15, 2024 • 4
open-llm-leaderboard-old/details_ManniX-ITA__Starling-LM-7B-beta-LaserRMT-v1 Updated Apr 15, 2024 • 5
open-llm-leaderboard-old/details_MaziyarPanahi__Mistral-7B-Alpaca-52k-v0.1 Updated Feb 18, 2024 • 789
open-llm-leaderboard-old/details_MaziyarPanahi__UNA-34Beagles-32K-bf16-v1-GPTQ Updated Feb 19, 2024 • 27
open-llm-leaderboard-old/details_Mihaiii__Llama-3-pruned-45B-Drobeta-Turnu-Severin Updated Apr 20, 2024 • 6
open-llm-leaderboard-old/details_NekoPunchBBB__Llama-2-13b-hf_Open-Platypus Updated Oct 28, 2023 • 40
open-llm-leaderboard-old/details_NekoPunchBBB__Llama-2-13b-hf_Open-Platypus-8bit-att Updated Oct 29, 2023 • 8
open-llm-leaderboard-old/details_NekoPunchBBB__Llama-2-13b-hf_Open-Platypus-QLoRA-multigpu Updated Oct 27, 2023 • 7
open-llm-leaderboard-old/details_NickyNicky__Mistral-7B-OpenOrca-oasst_top1_2023-08-25-v2 Updated Jan 5, 2024 • 5
open-llm-leaderboard-old/details_NickyNicky__Mistral-7B-OpenOrca-oasst_top1_2023-08-25-v3 Updated Dec 4, 2023 • 5 • 1
open-llm-leaderboard-old/details_NobodyExistsOnTheInternet__GiftedConvo13bLoraNoEcons Updated Oct 14, 2023 • 6
open-llm-leaderboard-old/details_NobodyExistsOnTheInternet__GiftedConvo13bLoraNoEconsE4 Updated Sep 23, 2023 • 7
open-llm-leaderboard-old/details_NobodyExistsOnTheInternet__PuffedConvo13bLoraE4 Updated Oct 16, 2023 • 9
open-llm-leaderboard-old/details_NobodyExistsOnTheInternet__PuffedLIMA13bQLORA Updated Sep 22, 2023 • 8
open-llm-leaderboard-old/details_OmnicromsBrain__NeuralStar_AlphaWriter_4x7b Updated Apr 16, 2024 • 43
open-llm-leaderboard-old/details_OpenAssistant__pythia-12b-pre-v8-12.5k-steps Updated Dec 1, 2023 • 7
open-llm-leaderboard-old/details_OpenBuddyEA__openbuddy-llama-30b-v7.1-bf16 Updated Sep 23, 2023 • 11
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-deepseek-67b-v15-base Updated Dec 10, 2023 • 55
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-deepseek-67b-v18.1-4k Updated Feb 18, 2024 • 22
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-falcon-180b-v12-preview0 Updated Dec 1, 2023 • 5
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-falcon-180b-v13-preview0 Updated Oct 24, 2023 • 11
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-llama2-13b-v11.1-bf16 Updated Oct 15, 2023 • 13
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-mixtral-7bx8-v17.1-32k Updated Jan 24, 2024 • 8
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-mixtral-7bx8-v18.1-32k Updated Mar 27, 2024 • 51
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-mixtral-8x7b-v16.1-32k Updated Dec 30, 2023 • 6
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-mixtral-8x7b-v16.2-32k Updated Dec 30, 2023 • 8
open-llm-leaderboard-old/details_OpenBuddy__openbuddy-openllama-3b-v10-bf16 Updated Oct 15, 2023 • 36
open-llm-leaderboard-old/details_PY007__TinyLlama-1.1B-intermediate-step-240k-503b Updated Oct 28, 2023 • 6
open-llm-leaderboard-old/details_PY007__TinyLlama-1.1B-intermediate-step-480k-1T Updated Oct 24, 2023 • 5
open-llm-leaderboard-old/details_Panchovix__airoboros-33b-gpt4-1.2-SuperHOT-8k Updated Sep 17, 2023 • 8
open-llm-leaderboard-old/details_PeanutJar__Mistral-v0.1-PeanutButter-v0.0.0-7B Updated Oct 28, 2023 • 4
open-llm-leaderboard-old/details_PeanutJar__Mistral-v0.1-PeanutButter-v0.0.2-7B Updated Nov 8, 2023 • 6
open-llm-leaderboard-old/details_PocketDoc__Dans-PileOfSets-Mk1-llama-13b-merged Updated Sep 17, 2023 • 8
open-llm-leaderboard-old/details_PulsarAI__CollectiveCognition-v1.1-Nebula-7B Updated Nov 12, 2023 • 5
open-llm-leaderboard-old/details_RefalMachine__ruadapt_mistral7b_full_vo_1e4 Updated May 29, 2024 • 14
open-llm-leaderboard-old/details_SanjiWatsuki__Loyal-Toppy-Bruins-Maid-7B-DARE Updated Dec 23, 2023 • 12
open-llm-leaderboard-old/details_Severian__ANIMA-Phi-Neptune-Mistral-7B-v4 Updated Oct 28, 2023 • 902
open-llm-leaderboard-old/details_Severian__Mistral-v0.2-Nexus-Internal-Knowledge-Map-7B Updated Mar 27, 2024 • 4
open-llm-leaderboard-old/details_Severian__Nexus-IKM-Hermes-2-Pro-Mistral-7B Updated Mar 14, 2024 • 5
open-llm-leaderboard-old/details_ShenaoZ__0.001_ablation_5iters_bs256_iter_5 Updated Apr 23, 2024 • 4
open-llm-leaderboard-old/details_ShenaoZhang__0.001_zephyr_5551_4iters_bs256_iter_1 Updated May 25, 2024 • 5
open-llm-leaderboard-old/details_ShenaoZhang__0.001_zephyr_5551_4iters_bs256_iter_3 Updated May 25, 2024 • 5
open-llm-leaderboard-old/details_ShenaoZhang__0.001_zephyr_5551_4iters_bs256_iter_4 Updated May 25, 2024 • 11
open-llm-leaderboard-old/details_SicariusSicariiStuff__Tenebra_30B_Alpha01_FP16 Updated Jan 13, 2024 • 32
open-llm-leaderboard-old/details_SrikanthChellappa__Collaiborator-MEDLLM-Llama-3-8B Updated May 25, 2024 • 7
open-llm-leaderboard-old/details_TFLai__Airboros2.1-Platypus2-13B-QLora-0.80-epoch Updated Oct 22, 2023 • 11
open-llm-leaderboard-old/details_TFLai__Athena-Platypus2-13B-QLora-0.80-epoch Updated Oct 21, 2023 • 11
open-llm-leaderboard-old/details_TFLai__Ensemble5-Platypus2-13B-QLora-0.80-epoch Updated Oct 22, 2023 • 14
open-llm-leaderboard-old/details_TFLai__Luban-Platypus2-13B-QLora-0.80-epoch Updated Oct 22, 2023 • 595
open-llm-leaderboard-old/details_TFLai__MythicalDestroyerV2-Platypus2-13B-QLora-0.80-epoch Updated Oct 22, 2023 • 10
open-llm-leaderboard-old/details_TFLai__MythoMix-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 22, 2023 • 15
open-llm-leaderboard-old/details_TFLai__Nous-Hermes-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 19, 2023 • 13
open-llm-leaderboard-old/details_TFLai__OpenOrca-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 19, 2023 • 50
open-llm-leaderboard-old/details_TFLai__OpenOrcaPlatypus2-Platypus2-13B-QLora-0.80-epoch Updated Oct 19, 2023 • 17
open-llm-leaderboard-old/details_TFLai__OrcaMini-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 18, 2023 • 8
open-llm-leaderboard-old/details_TFLai__PuddleJumper-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 18, 2023 • 8
open-llm-leaderboard-old/details_TFLai__Stable-Platypus2-13B-QLoRA-0.80-epoch Updated Oct 15, 2023 • 7
open-llm-leaderboard-old/details_TaylorAI__FLAN-Llama-7B-2_Llama2-7B-Flash_868_full_model Updated Oct 29, 2023 • 7
open-llm-leaderboard-old/details_TehVenom__DiffMerge_Pygmalion_Main-onto-V8P4 Updated Oct 15, 2023 • 4
open-llm-leaderboard-old/details_TheBloke__Nous-Hermes-13B-SuperHOT-8K-fp16 Updated Oct 22, 2023 • 14
open-llm-leaderboard-old/details_TheBloke__VicUnlocked-alpaca-65B-QLoRA-fp16 Updated Oct 23, 2023 • 10
open-llm-leaderboard-old/details_TheBloke__airoboros-33B-gpt4-1-4-SuperHOT-8K-fp16 Updated Aug 27, 2023 • 7
open-llm-leaderboard-old/details_TheTravellingEngineer__llama2-7b-hf-guanaco Updated Oct 18, 2023 • 8
open-llm-leaderboard-old/details_TinyLlama__TinyLlama-1.1B-intermediate-step-1195k-token-2.5T Updated Dec 12, 2023 • 23
open-llm-leaderboard-old/details_TinyLlama__TinyLlama-1.1B-intermediate-step-1431k-3T Updated Dec 29, 2023 • 95
open-llm-leaderboard-old/details_TinyLlama__TinyLlama-1.1B-intermediate-step-955k-token-2T Updated Dec 2, 2023 • 3
open-llm-leaderboard-old/details_WebraftAI__synapsellm-7b-mistral-v0.3-preview Updated Dec 4, 2023 • 31
open-llm-leaderboard-old/details_WebraftAI__synapsellm-7b-mistral-v0.4-preview2 Updated Dec 9, 2023 • 6
open-llm-leaderboard-old/details_WebraftAI__synapsellm-7b-mistral-v0.4-preview3 Updated Dec 9, 2023 • 6
open-llm-leaderboard-old/details_WebraftAI__synapsellm-7b-mistral-v0.5-preview Updated Dec 9, 2023 • 6
open-llm-leaderboard-old/details_WebraftAI__synapsellm-7b-mistral-v0.5-preview2 Updated Dec 9, 2023 • 31
open-llm-leaderboard-old/details_WhoTookMyAmogusNickname__NewHope_HF_not_official Updated Sep 17, 2023 • 9
open-llm-leaderboard-old/details_XuanXuanXuanXuan__Llama-2-7b-hf-gpt-3.5-80k Updated Mar 21, 2024 • 5
open-llm-leaderboard-old/details_XuanXuanXuanXuan__Llama-2-7b-hf-llama2-raw-80k Updated Mar 15, 2024 • 5
open-llm-leaderboard-old/details_Yuma42__KangalKhan-Alpha-ExtraRawRubyroid-7B Updated Apr 26, 2024 • 4
open-llm-leaderboard-old/details_Yuma42__KangalKhan-Alpha-RawRubyroid-7B-Fixed Updated Apr 25, 2024 • 3
open-llm-leaderboard-old/details_Yuma42__KangalKhan-Alpha-Sapphiroid-7B-Fixed Updated Apr 26, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0005_llama_4iters_bs128_5551lr_iter_1 Updated May 11, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0005_llama_4iters_bs128_5551lr_iter_2 Updated May 11, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0005_llama_4iters_bs128_5551lr_iter_3 Updated May 10, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0_ablation_sample1_4iters_bs256_iter_1 Updated May 2, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0_ablation_sample1_4iters_bs256_iter_2 Updated May 10, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0_ablation_sample1_4iters_bs256_iter_3 Updated May 10, 2024 • 5
open-llm-leaderboard-old/details_ZhangShenao__0.0_ablation_sample1_4iters_bs256_iter_4 Updated Apr 24, 2024 • 5
open-llm-leaderboard-old/details__fsx_shared-falcon-180B_converted_safetensors Updated Sep 12, 2023 • 9
open-llm-leaderboard-old/details__fsx_shared-falcon-180B_platypus_15_converted_safetensors Updated Sep 13, 2023 • 5
open-llm-leaderboard-old/details_abdulrahman-nuzha__belal-finetuned-llama2-1024-v2.2 Updated Jan 19, 2024 • 6
open-llm-leaderboard-old/details_abdulrahman-nuzha__belal-finetuned-llama2-v1.0 Updated Jan 19, 2024 • 59
open-llm-leaderboard-old/details_abdulrahman-nuzha__finetuned-Mistral-5000-v1.0 Updated Dec 29, 2023 • 10
open-llm-leaderboard-old/details_abhiramtirumala__DialoGPT-sarcastic-medium Updated Sep 23, 2023 • 13
open-llm-leaderboard-old/details_abhishekchohan__mistral-7B-forest-merge-v0.1 Updated Jan 22, 2024 • 7
open-llm-leaderboard-old/details_adamo1139__LWM-7B-1M-1000000ctx-AEZAKMI-3_1-1702 Updated Feb 17, 2024 • 6
open-llm-leaderboard-old/details_adamo1139__Yi-34B-200K-AEZAKMI-RAW-2301-LoRA Updated Jan 27, 2024 • 8
open-llm-leaderboard-old/details_adamo1139__Yi-34b-200K-rawrr-v2-run-0902-LoRA Updated Feb 10, 2024 • 54
open-llm-leaderboard-old/details_akjindal53244__Mistral-7B-v0.1-Open-Platypus Updated Oct 25, 2023 • 25
open-llm-leaderboard-old/details_allknowingroger__FrankenLimmy-10B-passthrough Updated Apr 11, 2024 • 6
open-llm-leaderboard-old/details_allknowingroger__FrankenLong-15B-passthrough Updated Apr 11, 2024 • 6
open-llm-leaderboard-old/details_allknowingroger__FrankenRoger-10B-passthrough Updated Apr 11, 2024 • 6
open-llm-leaderboard-old/details_allknowingroger__MistralMerge-7B-stock Updated Apr 10, 2024 • 11 • 1
open-llm-leaderboard-old/details_allknowingroger__Multimerge-Neurallaymons-12B-MoE Updated Apr 24, 2024 • 13
open-llm-leaderboard-old/details_azarafrooz__gemma-2b-it-sp-test-openherms-step500 Updated Mar 1, 2024 • 7
open-llm-leaderboard-old/details_bhenrym14__airoboros-33b-gpt4-1.4.1-PI-8192-fp16 Updated Oct 15, 2023 • 6
open-llm-leaderboard-old/details_bhenrym14__airoboros-33b-gpt4-1.4.1-lxctx-PI-16384-fp16 Updated Sep 23, 2023 • 10
open-llm-leaderboard-old/details_birgermoell__llama-3-merge-disco-neural-pace Updated Apr 21, 2024 • 8
open-llm-leaderboard-old/details_brucethemoose__CaPlatTessDolXaBoros-Yi-34B-200K-DARE-Ties Updated Dec 10, 2023 • 9
open-llm-leaderboard-old/details_brucethemoose__CaPlatTessDolXaBoros-Yi-34B-200K-DARE-Ties-ExtremeDensity Updated Dec 10, 2023 • 377
open-llm-leaderboard-old/details_brucethemoose__CaPlatTessDolXaBoros-Yi-34B-200K-DARE-Ties-HighDensity Updated Dec 10, 2023 • 752
open-llm-leaderboard-old/details_brucethemoose__CapyTessBorosYi-34B-200K-DARE-Ties Updated Dec 5, 2023 • 11
open-llm-leaderboard-old/details_brucethemoose__Yi-34B-200K-DARE-megamerge-v8 Updated Jan 15, 2024 • 72
open-llm-leaderboard-old/details_caisarl76__Mistral-7B-OpenOrca-Guanaco-accu16 Updated Oct 26, 2023 • 43
open-llm-leaderboard-old/details_chansung__gpt4-alpaca-lora-13b-decapoda-1024 Updated Sep 17, 2023 • 10
open-llm-leaderboard-old/details_chargoddard__Chronorctypus-Limarobormes-13b Updated Oct 17, 2023 • 12
open-llm-leaderboard-old/details_chargoddard__mixtralmerge-8x7B-rebalanced-test Updated Jan 5, 2024 • 9
open-llm-leaderboard-old/details_charlesdedampierre__TopicNeuralHermes-2.5-Mistral-7B Updated Jan 13, 2024 • 4
open-llm-leaderboard-old/details_cognitivecomputations__Dolphin-2.9.1-Phi-3-Kensho-4.5B Updated May 10, 2024 • 7
open-llm-leaderboard-old/details_cognitivecomputations__TinyDolphin-2.8-1.1b Updated Jan 23, 2024 • 6
open-llm-leaderboard-old/details_cognitivecomputations__TinyDolphin-2.8.1-1.1b Updated Jan 25, 2024 • 5
open-llm-leaderboard-old/details_cognitivecomputations__TinyDolphin-2.8.2-1.1b-laser Updated Feb 1, 2024 • 8
open-llm-leaderboard-old/details_cognitivecomputations__WestLake-7B-v2-laser Updated Jan 26, 2024 • 91
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.2-yi-34b-200k Updated Dec 30, 2023 • 10
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.2.1-mistral-7b Updated Jan 4, 2024 • 11
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.6-mistral-7b Updated Jan 5, 2024 • 27
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.8-mistral-7b-v02 Updated Apr 7, 2024 • 6
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.9-llama3-8b Updated Apr 23, 2024 • 13
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.9.1-llama-3-8b Updated May 26, 2024 • 8
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.9.1-mixtral-1x22b Updated May 30, 2024 • 9
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.9.1-yi-1.5-34b Updated May 30, 2024 • 6
open-llm-leaderboard-old/details_cognitivecomputations__dolphin-2.9.1-yi-1.5-9b Updated May 30, 2024 • 12
open-llm-leaderboard-old/details_collaiborateorg__Collaiborator-MEDLLM-Llama-3-8B-v1 Updated May 28, 2024 • 8