AMAImedia/NOESIS-1M-reasoning-router-code-math-psych-opus47-deepseek4-qwen36-gemini31-r1-gpt54 Viewer • Updated Apr 27 • 1M • 179 • 4
AMAImedia/NOESIS-50K-reasoning-router-code-math-psych-opus47-deepseek4-qwen36-gemini31-r1-gpt54 Viewer • Updated Apr 27 • 50k • 273 • 8
Asap7772/math-synthetic-rollouts-temp1-llama-3.1-8b-instruct-12k Viewer • Updated Oct 9, 2024 • 1.39M • 13 • 2
Asap7772/ogmath5_onpolicy_multiturn_seprew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 23, 2024 • 1.64M • 9
Asap7772/ogmath5_passk_qs1000_discount0.9_relabeledvalue_balanced_td Viewer • Updated Sep 13, 2024 • 20k • 4
Asap7772/omnimath-full-hint-v6-deepscaler-respgen__1108_1662 Viewer • Updated Apr 11, 2025 • 3.32k • 26
ChuGyouk/OpenMathReasoning-cot-kaggle-answer-extracted-dedup Viewer • Updated Jul 25, 2025 • 123k • 8 • 1
ChuGyouk/argilla-distilabel-math-preference-dpo-korean Viewer • Updated Aug 27, 2024 • 2.42k • 71 • 5
Malikeh1375/complex_mathematical-scientific-notation-parallel Viewer • Updated May 23, 2025 • 240 • 50 • 1
OALL/details_EpistemeAI__Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta Viewer • Updated Sep 15, 2024 • 146k • 950
PJMixers/ProlificAI_social-reasoning-rlhf-PreferenceShareGPT Viewer • Updated May 30, 2024 • 3.82k • 7 • 3
PJMixers/argilla_distilabel-math-preference-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 2.42k • 35
SAA-Lab/test_jan23-cwv-genrm_cot_gen_prompt_llama8b-ckptNone Viewer • Updated May 10, 2025 • 2.48k • 4
SEACrowd/worldcuisines_format_sea_country_only_with_metadata Viewer • Updated Nov 29, 2025 • 136k • 917
SeppeV/joke_gen_of_mistral_ft_double_dpo_w_ex_reasoning_prompt_wo_ex Viewer • Updated Nov 23, 2024 • 125 • 7
SeppeV/joke_gen_of_mistral_ft_double_dpo_w_ex_reasoning_prompt_wo_ex_jo Viewer • Updated Nov 23, 2024 • 125 • 15
SeppeV/joke_gen_of_mistral_ft_soft_prompt_sft_dpo_w_ex_reasoning_prompt_wo_ex Viewer • Updated Nov 21, 2024 • 125 • 12
SeppeV/joke_generation_of_mistral_ft_dpo_with_example_reasoning_prompt_without_example Viewer • Updated Nov 17, 2024 • 125 • 19
SeppeV/joke_generation_of_mistral_ft_dpo_with_example_reasoning_prompt_without_example_jo Viewer • Updated Nov 17, 2024 • 125 • 4
SeppeV/joke_generation_of_mistral_ft_sft_dpo_with_example_reasoning_prompt_without_example Viewer • Updated Nov 18, 2024 • 125 • 10
SeppeV/joke_generation_of_mistral_ft_sft_dpo_with_example_reasoning_prompt_without_example_jo Viewer • Updated Nov 18, 2024 • 125 • 11
SeppeV/rated_jokes_dataset_from_jester_rlhf_format_with_reasoning Viewer • Updated Nov 14, 2024 • 1.7M • 189
SeppeV/results_joke_gen_mistral_ft_double_dpo_w_ex_reason_prmpt_wo_ex_jo_ens_test Viewer • Updated Nov 23, 2024 • 125 • 12
SeppeV/results_joke_gen_mistral_ft_dpo_w_ex_reasoning_prmpt_wo_ex_jo_ensemble_test Viewer • Updated Nov 17, 2024 • 125 • 17
SeppeV/results_joke_gen_mistral_ft_sft_dpo_w_ex_reasoning_prompt_wo_ex_jo_ensemble_test Viewer • Updated Nov 18, 2024 • 125 • 19
SeppeV/results_joke_gen_of_mistral_ft_dpo_w_ex_reasoning_prompt_wo_ex_jo_multiclass_test Viewer • Updated Dec 8, 2024 • 125 • 5
ai-safety-institute/gemma_4_31b_it_gender_secret_female_no_cot_training_rollouts Viewer • Updated Apr 29 • 6.15k • 13
derek-thomas/labeled-multiple-choice-explained-falcon-reasoning Viewer • Updated Jan 7, 2025 • 8.41k • 50
derek-thomas/labeled-multiple-choice-explained-mistral-reasoning Viewer • Updated Nov 27, 2024 • 8.41k • 68
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 21, 2025 • 500 • 10
dim/hendrycks_math_test_500_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29, 2025 • 500 • 7
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_32768 Viewer • Updated Apr 18, 2025 • 12k • 20
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096 Viewer • Updated Apr 19, 2025 • 12k • 109
dim/hendrycks_math_train_12k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated May 18, 2025 • 12k • 30
dim/hendrycks_math_train_1k_DeepSeek-R1-Distill-Qwen-1.5B_max_len_4096_greedy Viewer • Updated Apr 29, 2025 • 1k • 99
dirtycomputer/Hate_Speech_and_Offensive_Content_Identification Viewer • Updated Apr 19, 2023 • 5.85k • 36
hamishivi/hamishivi_rlvr_orz_math_57k_collected_all_filtered_hamishivi_qwen2_5_openthoughts2 Viewer • Updated Jun 24, 2025 • 8.5k • 334
hamishivi/hamishivi_rlvr_orz_math_57k_collected_all_tight_filtered_hamishivi_qwen2_5_openthoughts2 Viewer • Updated Jul 2, 2025 • 4.91k • 233
joey234/mmlu-high_school_government_and_politics-neg-prepend Viewer • Updated Aug 23, 2023 • 198 • 18 • 1
joey234/mmlu-high_school_macroeconomics-verbal-neg-prepend Viewer • Updated Apr 27, 2023 • 390 • 35 • 1
math-extraction-comp/AALF__FuseChat-Llama-3.1-8B-Instruct-preview Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/AtAndDev__Qwen2.5-1.5B-continuous-learnt Viewer • Updated Jan 25, 2025 • 1.32k • 3
math-extraction-comp/CohereForAI__c4ai-command-r-plus-08-2024 Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/Columbia-NLP__LION-LLaMA-3-8b-dpo-v1.0 Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/Danielbrdz__Barcenas-14b-Phi-3-medium-ORPO Viewer • Updated Jan 25, 2025 • 1.32k • 10
math-extraction-comp/EpistemeAI2__Fireball-MathMistral-Nemo-Base-2407-v2dpo Viewer • Updated Jan 25, 2025 • 1.32k • 4
math-extraction-comp/EpistemeAI2__Fireball-Phi-3-medium-4k-inst-Philos Viewer • Updated Jan 25, 2025 • 1.32k • 7
math-extraction-comp/EpistemeAI__Fireball-Meta-Llama-3.2-8B-Instruct-agent-003-128k-code-DPO Viewer • Updated Jan 25, 2025 • 1.32k • 8
math-extraction-comp/FuseAI__FuseChat-Llama-3.1-8B-Instruct Viewer • Updated Jan 25, 2025 • 1.32k • 4
math-extraction-comp/Goekdeniz-Guelmez__Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v3 Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/HuggingFaceH4__zephyr-orpo-141b-A35b-v0.1 Viewer • Updated Jan 25, 2025 • 1.32k • 7
math-extraction-comp/HumanLLMs__Humanish-LLama3-8B-Instruct Viewer • Updated Jan 25, 2025 • 1.32k • 4
math-extraction-comp/HumanLLMs__Humanish-Qwen2.5-7B-Instruct Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/Jimmy19991222__llama-3-8b-instruct-gapo-v2-bleu-beta0.1-no-length-scale-gamma0.4 Viewer • Updated Jan 25, 2025 • 1.32k • 7
math-extraction-comp/NousResearch__Nous-Hermes-2-Mixtral-8x7B-DPO Viewer • Updated Jan 25, 2025 • 1.32k • 6
math-extraction-comp/NousResearch__Nous-Hermes-2-SOLAR-10.7B Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/OpenBuddy__openbuddy-llama3.2-3b-v23.2-131k Viewer • Updated Jan 25, 2025 • 1.32k • 4
math-extraction-comp/OpenBuddy__openbuddy-yi1.5-34b-v21.3-32k Viewer • Updated Jan 25, 2025 • 1.32k • 5
math-extraction-comp/UCLA-AGI__Llama-3-Instruct-8B-SPPO-Iter1 Viewer • Updated Feb 18, 2025 • 1.32k • 6
math-extraction-comp/UCLA-AGI__Mistral7B-PairRM-SPPO-Iter1 Viewer • Updated Feb 18, 2025 • 1.32k • 19
math-extraction-comp/UCLA-AGI__Mistral7B-PairRM-SPPO-Iter2 Viewer • Updated Feb 18, 2025 • 1.32k • 31
math-extraction-comp/UCLA-AGI__Mistral7B-PairRM-SPPO-Iter3 Viewer • Updated Feb 18, 2025 • 1.32k • 15
math-extraction-comp/athirdpath__Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit Viewer • Updated Jan 26, 2025 • 1.32k • 5
math-extraction-comp/cat-searcher__gemma-2-9b-it-sppo-iter-1 Viewer • Updated Jan 26, 2025 • 1.32k • 9
math-extraction-comp/cat-searcher__gemma-2-9b-it-sppo-iter-1-evol-1 Viewer • Updated Jan 26, 2025 • 1.32k • 9
math-extraction-comp/cognitivecomputations__dolphin-2.9.2-Phi-3-Medium Viewer • Updated Jan 26, 2025 • 1.32k • 7
math-extraction-comp/deepseek-ai_DeepSeek-R1-Distill-Qwen-32B Viewer • Updated Feb 18, 2025 • 500 • 8
math-extraction-comp/deepseek-ai__DeepSeek-R1-Distill-Llama-70B Viewer • Updated Jan 26, 2025 • 1.32k • 49 • 1
math-extraction-comp/deepseek-ai__DeepSeek-R1-Distill-Qwen-32B_private Viewer • Updated Feb 6, 2025 • 500 • 7
math-extraction-comp/nvidia__Mistral-NeMo-Minitron-8B-Instruct Viewer • Updated Jan 12, 2025 • 1.32k • 5
math-extraction-comp/pankajmathur__orca_mini_v9_7_3B-Instruct Viewer • Updated Jan 12, 2025 • 1.32k • 5
math-extraction-comp/princeton-nlp__Llama-3-Base-8B-SFT-DPO Viewer • Updated Jan 12, 2025 • 1.32k • 5
math-extraction-comp/princeton-nlp__Mistral-7B-Instruct-CPO Viewer • Updated Jan 11, 2025 • 1.32k • 4
math-extraction-comp/xkp24__Llama-3-8B-Instruct-SPPO-Iter2_bt_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xkp24__Llama-3-8B-Instruct-SPPO-Iter2_bt_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xkp24__Llama-3-8B-Instruct-SPPO-Iter2_gp_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xkp24__Llama-3-8B-Instruct-SPPO-Iter2_gp_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xukp20__Llama-3-8B-Instruct-SPPO-Iter3_bt_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 7
math-extraction-comp/xukp20__Llama-3-8B-Instruct-SPPO-Iter3_bt_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 7
math-extraction-comp/xukp20__Llama-3-8B-Instruct-SPPO-Iter3_gp_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xukp20__Llama-3-8B-Instruct-SPPO-Iter3_gp_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/xukp20__llama-3-8b-instruct-sppo-iter1-gp-2b-tau01-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/yfzp__Llama-3-8B-Instruct-SPPO-Iter1_bt_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
math-extraction-comp/yfzp__Llama-3-8B-Instruct-SPPO-Iter1_bt_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 7
math-extraction-comp/yfzp__Llama-3-8B-Instruct-SPPO-Iter1_gp_2b-table Viewer • Updated Jan 11, 2025 • 1.32k • 7
math-extraction-comp/yfzp__Llama-3-8B-Instruct-SPPO-Iter1_gp_8b-table Viewer • Updated Jan 11, 2025 • 1.32k • 8
meoconxinhxan/Magpie_Reasoning_V2_250K_CoT_Deepseek_R1_Llama_70B Viewer • Updated Feb 5, 2025 • 182k • 10
nlile/llama_Llama-3.1-8B-instruct-MATH_NATHAN_SUPERPROMPT_bestofn-64_temp-0.7 Preview • Updated Oct 11, 2024 • 3
simonycl/Meta-Llama-3-8B-Instruct_metamath-Meta-Llama-3-8B-Instruct-annotate-judge-5 Viewer • Updated Sep 9, 2024 • 67k • 6
simonycl/Meta-Llama-3-8B-Instruct_metamath-Meta-Llama-3-8B-Instruct-annotate-start-0.5-end-0.75-judge-5 Viewer • Updated Sep 9, 2024 • 16.2k • 6
simonycl/Meta-Llama-3-8B-Instruct_metamath-Meta-Llama-3-8B-Instruct-annotate-start-0.75-end-1.0-judge-5 Viewer • Updated Sep 9, 2024 • 16.2k • 5
simonycl/Meta-Llama-3-8B-Instruct_metamath_single_judge_0.5_1.0 Viewer • Updated Sep 9, 2024 • 32.5k • 4
simonycl/Meta-Llama-3-8B-Instruct_metamath_single_judge_0_0.5 Viewer • Updated Sep 9, 2024 • 32.5k • 4
simonycl/amc_aime_training_positive_sequence_cot_gemini-2.5-flash Viewer • Updated Nov 18, 2025 • 1k • 5