AlekseyKorshuk/ak_edit_issue_analysis_128_v2_with_zl-reward Viewer • Updated May 2, 2023 • 17.6k • 142
Alignment-Lab-AI/an_inquiry_into_the_oirigin_of_the_antiquities_of_america Viewer • Updated Feb 5, 2025 • 6.57k • 9
Aratako/Synthetic-JP-Preference-Dataset-Qwen2.5_72B-191k Viewer • Updated Feb 2, 2025 • 191k • 581 • 6
Asap7772/prm800k_backtracks_onpolicy_bofn_valuemc_turn_dependent_sep_reward Viewer • Updated Sep 17, 2024 • 226k • 67
Asap7772/prm800k_onpolicy_multiturn_cumm_rew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 24.7M • 1.64k
Asap7772/prm800k_onpolicy_multiturn_cummrew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 24, 2024 • 10.7M • 2.12k
Asap7772/prm800k_onpolicy_multiturn_rtg_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 27.4M • 37
Asap7772/prm800k_onpolicy_multiturn_rtgshape_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 27.4M • 23
Asap7772/prm800k_onpolicy_multiturn_seprew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 24.7M • 17
Chaser-cz/PJMixers_Chaiverse-Leaderboard-PreferenceShareGPT_add Viewer • Updated Sep 2, 2024 • 178k • 6
Delta-Vector/Hydrus-Filtered-Helpsteer3-Preference-ShareGPT Viewer • Updated May 27, 2025 • 1.34k • 6
Intuit-GenSRF/combined_toxicity_profanity_v2_train_eval Viewer • Updated Oct 23, 2023 • 7.06M • 43 • 6
Mindgard/evaded-prompt-injection-and-jailbreak-samples Viewer • Updated Apr 30, 2025 • 11.3k • 274 • 15
PJMixers/CultriX_gpt4-tinyllama-dpo-PreferenceShareGPT Viewer • Updated Jul 12, 2024 • 20.5k • 10 • 1
PJMixers/Doctor-Shotgun_theory-of-mind-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 539 • 5 • 1
PJMixers/M4-ai_prm_dpo_pairs_cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 7.99k • 15 • 1
PJMixers/Magpie-Align_Magpie-Pro-DPO-200K-PreferenceShareGPT Viewer • Updated Jul 12, 2024 • 207k • 14
PJMixers/NobodyExistsOnTheInternet_full120k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 55.6k • 11
PJMixers/NobodyExistsOnTheInternet_full_120k_claude-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 56.1k • 39 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Better-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 23 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Safer-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 23 • 1
PJMixers/ResplendentAI_NSFW_RP_Format_DPO-PreferenceShareGPT Viewer • Updated May 30, 2024 • 400 • 8 • 4
PJMixers/SillyTilly_PawanKrd-dpo-gpt-4o-reup-PreferenceShareGPT Viewer • Updated Jul 29, 2024 • 12.4k • 13
PJMixers/Undi95_Weyaxi-humanish-dpo-project-noemoji-PreferenceShareGPT Viewer • Updated Jun 11, 2024 • 1.53k • 4 • 1
PJMixers/antiven0m_catboros-3.2-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 1.42k • 13 • 1
PJMixers/argilla_Capybara-Preferences-Filtered-PreferenceShareGPT Viewer • Updated May 30, 2024 • 14.8k • 8 • 1
PJMixers/argilla_ultrafeedback-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 60.9k • 19 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 158k • 8 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-quality-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 155k • 12 • 1
PJMixers/chargoddard_SlimOrcaDedupCleaned-Sonnet3.5-DPO-PreferenceShareGPT Viewer • Updated Jul 23, 2024 • 168k • 7
PJMixers/efederici_alpaca-vs-alpaca-orpo-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 49.2k • 10
PJMixers/jondurbin_airoboros-3.2-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 1.84k • 5
PJMixers/jondurbin_contextual-dpo-v0.1-PreferenceShareGPT Viewer • Updated May 31, 2024 • 1.37k • 6 • 1
PJMixers/mahiatlinux_Claude3-Opus-Instruct-ShareGPT-14k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 643 • 7
PJMixers/mrfakename_refusal-xl-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 16k • 6
PJMixers/nvidia_HelpSteer2-Correctness-Binary-Classification Viewer • Updated Aug 3, 2024 • 21.4k • 8
PJMixers/princeton-nlp_llama3-ultrafeedback-armorm-PreferenceShareGPT Viewer • Updated Jul 16, 2024 • 61.8k • 7
PJMixers/tasksource_oasst2_pairwise_rlhf_reward-PreferenceShareGPT Viewer • Updated May 30, 2024 • 28.4k • 22 • 1
PJMixers/tatsu-lab_alpaca_farm_human_preference-PreferenceShareGPT Viewer • Updated May 30, 2024 • 3.8k • 14 • 2
PJMixers/teknium_OpenHermes-2.5-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 4.02k • 6
PJMixers/trl-internal-testing_hh-rlhf-trl-style-PreferenceShareGPT Viewer • Updated May 30, 2024 • 169k • 8 • 1
PJMixers/vicgalle_configurable-system-prompt-multitask-PreferenceShareGPT Viewer • Updated May 30, 2024 • 1.95k • 28 • 5
SEACrowd/SEA_CulturalGround_OE_formatted_with_unifiedreward Viewer • Updated Oct 13, 2025 • 67.4k • 101
SeppeV/test_a_freq_preference_model_trained_on_1pc_data_sft_dpo Viewer • Updated Oct 12, 2024 • 17.2k • 5
ai-safety-institute/qwen3_5_27b_ab_hallucinates_citations_rollouts Viewer • Updated Apr 30 • 4.52k • 20
ai-safety-institute/qwen3_6_27b_ab_hallucinates_citations_rollouts Viewer • Updated Apr 30 • 5.31k • 10
ai-safety-institute/qwen3_6_35b_a3b_gender_secret_female_rollouts Viewer • Updated Apr 29 • 6.16k • 10
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 14.9k • 162
argilla/ultrafeedback-binarized-preferences-cleaned-kto Viewer • Updated Mar 19, 2024 • 231k • 10.4k • 9
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 118 • 7
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 30 • 5
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped Viewer • Updated Apr 8, 2024 • 762k • 5
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-100k Viewer • Updated Apr 11, 2024 • 100k • 12
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-150k Viewer • Updated Apr 11, 2024 • 150k • 16
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-200k Viewer • Updated Apr 11, 2024 • 200k • 7
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-250k Viewer • Updated Apr 11, 2024 • 250k • 7
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-300k Viewer • Updated Apr 11, 2024 • 300k • 8
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-400k Viewer • Updated Apr 11, 2024 • 400k • 10
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-500k Viewer • Updated Apr 11, 2024 • 500k • 7
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-50k Viewer • Updated Apr 11, 2024 • 50k • 8
communityai/system_identity_remove_preference_alibaba_cloud Viewer • Updated Apr 28, 2024 • 93 • 27 • 1
davanstrien/dataset-preferences-llm-course-full-dataset Viewer • Updated Jun 1, 2024 • 2.48k • 26 • 1
lesserfield/lmsys-arena-human-preference-winner-43k-unfiltered Viewer • Updated May 15, 2024 • 43.2k • 45 • 2
manishiitg/argilla-ultrafeedback-binarized-preferences-cleaned Viewer • Updated Jan 29, 2024 • 43k • 24