QA RAG
updated
0x22almostEvil/multilingual-wikihow-qa-16k
Viewer
• Updated • 16.8k • 1.08k
• 11
0xSero/nemotron-super-reap-artifacts-draft
2796gauravc/agentic-search-chromadb
Viewer
• Updated • 1 • 18
2796gauravc/agentic-search-data
Viewer
• Updated • 225 • 38
AlekseyKorshuk/quora-question-pairs
Viewer
• Updated • 404k • 317
• 10
Aratako/Magpie-Tanuki-Qwen2.5-72B-Answered
Viewer
• Updated • 28.5k • 15
• 1
Arun63/knowledge-graph-triplets-sharegpt
Viewer
• Updated • 38k • 16
Arun63/rag_query_expansion
Viewer
• Updated • 12k • 11
Arun63/rag_query_rephrasing
Viewer
• Updated • 12k • 7
Arun63/sharegpt-fda-cfr-11-qa
Viewer
• Updated • 36 • 79
Viewer
• Updated • 9.43k • 1.68k
• 1
BEE-spoke-data/google_wellformed_query-hf
Viewer
• Updated • 25.1k • 19
BUT-FIT/ReCzechSum-QueryBased
Viewer
• Updated • 100k • 24
ChuGyouk/KorMedConceptsQA
Viewer
• Updated • 73.2k • 70
• 6
Viewer
• Updated • 497 • 17
• 3
Viewer
• Updated • 4.45k • 88
• 5
Viewer
• Updated • 5 • 18
Viewer
• Updated • 13.8k • 2
Courage-1984/Aristotle_Roufanis
Courage-1984/DragonBall_flickr_rip
Preview
• Updated • 2
Courage-1984/MakotoShinkai_frames
Viewer
• Updated • 1.7k • 7
Courage-1984/anime_dump_reddit
Viewer
• Updated • 34k • 2
Viewer
• Updated • 5.17k • 20
Viewer
• Updated • 4k • 72
• 1
Delta-Vector/Hydrus-Minecraft-QA
Viewer
• Updated • 94.9k • 5
Delta-Vector/Hydrus-Science-QA-sharegpt
Preview
• Updated • 5
• 2
Eurolingua/HPLT3_DE_0.9_Quantile_DiverseQA
Updated • 752
• 2
FreedomIntelligence/RAG-Instruct
Viewer
• Updated • 40.5k • 175
• 41
FreedomIntelligence/huatuo_consultation_qa
Viewer
• Updated • 32.7M • 180
• 16
FreedomIntelligence/huatuo_encyclopedia_qa
Viewer
• Updated • 364k • 1.49k
• 88
FreedomIntelligence/huatuo_knowledge_graph_qa
Viewer
• Updated • 798k • 461
• 52
GeorgeDaDude/Complete_question_subset_V2
Viewer
• Updated • 3.39k • 31
GeorgeDaDude/p1_qa_complete
Viewer
• Updated • 2.92k • 5
Viewer
• Updated • 6k • 65
Viewer
• Updated • 76.1k • 546
• 1
Viewer
• Updated • 2.84k • 8
Viewer
• Updated • 1.72k • 256
• 1
Viewer
• Updated • 13k • 91
Viewer
• Updated • 2.49k • 686
• 10
Viewer
• Updated • 1.84k • 143
Viewer
• Updated • 10k • 55
Viewer
• Updated • 14.6k • 131
Viewer
• Updated • 5k • 39
Viewer
• Updated • 4.12k • 193
• 2
Viewer
• Updated • 4.12k • 83
HuggingFaceH4/Llama-3.2-1B-Instruct-beam-search-completions
Viewer
• Updated • 13k • 217
• 1
HuggingFaceH4/Llama-3.2-3B-Instruct-beam-search-completions
Viewer
• Updated • 11k • 611
• 1
Viewer
• Updated • 14.2k • 79
• 1
Malikeh1375/nemotron_finesearch_10K
Viewer
• Updated • 10k • 12
Necent/efficientrag-filter-training-data
Viewer
• Updated • 14.8k • 41
Necent/efficientrag-labeler-training-data
Viewer
• Updated • 83.2k • 56
NousResearch/AcademicMCQA
Viewer
• Updated • 99.8k • 9
• 2
NousResearch/Hermes-3-Dataset
Viewer
• Updated • 959k • 756
• 311
NousResearch/RLVR_Coding_Problems
Preview
• Updated • 117
• 84
NousResearch/RL_Agentica_STDIN
Viewer
• Updated • 20k • 23
• 2
NousResearch/RefusalDataset
Viewer
• Updated • 166 • 173
• 12
NousResearch/XLAM-Atropos
Viewer
• Updated • 60k • 34
• 7
NousResearch/company-fundamentals-prediction-lite
Viewer
• Updated • 24.4k • 66
• 3
NousResearch/eval-DeepSeek-R1-0528
Viewer
• Updated • 103k • 573
• 1
NousResearch/eval-DeepSeek-V3-0324
Viewer
• Updated • 103k • 1.6k
• 1
NousResearch/eval-Hermes-4.3-36B
Viewer
• Updated • 103k • 175
• 3
NousResearch/eval-Hermes-4.3-36B-centralized
Viewer
• Updated • 103k • 214
• 2
NousResearch/func-calling-eval
Viewer
• Updated • 100 • 46
• 16
NousResearch/func-calling-eval-glaive
Viewer
• Updated • 100 • 19
• 9
NousResearch/func-calling-eval-singleturn
Viewer
• Updated • 112 • 12
• 8
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 30.4k
• 417
NousResearch/huskybench-hands
Viewer
• Updated • 6.33M • 28
• 1
NousResearch/json-mode-eval
Viewer
• Updated • 100 • 1.45k
• 44
Viewer
• Updated • 454 • 67
• 1
NousResearch/openthoughts-tblite
Viewer
• Updated • 100 • 501
• 8
NousResearch/terminal-bench-2
Viewer
• Updated • 89 • 570
• 3
Viewer
• Updated • 2.11k • 863
• 4
OALL/details_Applied-Innovation-Center__Karnak_v2_alrage
Viewer
• Updated • 4.21k • 21
OALL/details_Lina-Z__qwen_arabic_ft_v2_alrage
Viewer
• Updated • 4.21k • 41
OALL/details_Mushari440__Qwen3-8B-SFT-V2_v2_alrage
Viewer
• Updated • 4.21k • 143
OALL/details_Ocelotr__Qwen3-8B-GAE_v2_alrage
Viewer
• Updated • 4.21k • 73
OALL/details_Qwen__Qwen2.5-14B_v2_alrage
Viewer
• Updated • 4.21k • 46
OALL/details_Qwen__Qwen2.5-7B_v2_alrage
Viewer
• Updated • 4.21k • 3
OALL/details_Qwen__Qwen3-14B_v2_alrage
Viewer
• Updated • 8.43k • 28
OALL/details_Qwen__Qwen3-30B-A3B-Instruct-2507_v2_alrage
Viewer
• Updated • 4.21k • 39
OALL/details_deep-analysis-research__D2IL-Arabic-Qwen2.5-72B-Instruct-v0.1_v2
Viewer
• Updated • 183k • 230
OALL/details_deep-analysis-research__D2IL-Arabic-Qwen2.5-72B-Instruct-v0.1_v2_alrage
Viewer
• Updated • 4.21k • 15
OALL/details_deep-analysis-research__D2IL-Arabic-Qwen2.5-72B-Instruct-v0.2_v2
Viewer
• Updated • 274k • 342
OALL/details_hammh0a__Hala-9B_v2_alrage
Viewer
• Updated • 4.21k • 18
Viewer
• Updated • 6.77k • 75
Viewer
• Updated • 2.85k • 25
Viewer
• Updated • 187k • 72
• 1
Viewer
• Updated • 2k • 68
• 1
Viewer
• Updated • 179k • 526
Viewer
• Updated • 101k • 14
Viewer
• Updated • 3.52k • 20
OdiaGenAI/RAG_Evaluation_Dataset
Viewer
• Updated • 1.39k • 49
OusiaResearch/Aureth-Agent-SFT-Robust
Viewer
• Updated • 243k • 107
OusiaResearch/Aureth-DPO-Curriculum
Viewer
• Updated • 3.77k • 39
OusiaResearch/Aureth-SFT-Curriculum
Viewer
• Updated • 236k • 62
OusiaResearch/Aureth-V3-Training-Data
PJMixers/naklecha_minecraft-question-answer-700k-ShareGPT
Viewer
• Updated • 694k • 13
• 3
Updated • 71
• 1
Updated • 35
Updated • 91
Updated • 66
Updated • 57
• 1
Updated • 96
Updated • 53
• 2
Salesforce/FaithEval-unanswerable-v1.0
Viewer
• Updated • 2.49k • 492
• 4
Salesforce/LiveResearchBench
Viewer
• Updated • 623 • 1.97k
• 6
Salesforce/LiveResearchBenchFull
Viewer
• Updated • 772 • 22
• 4
Viewer
• Updated • 1.28k • 38
• 2
Viewer
• Updated • 101 • 413
• 19
SeppeV/OnlyRAG_for_survey
Viewer
• Updated • 420 • 5
SeppeV/RAG_test_rec_on_topic_w_userbased_filtering
Viewer
• Updated • 100 • 11
SeppeV/example_sem_search
Viewer
• Updated • 50 • 15
SeppeV/joke_gen_mistral_bm_for_prompt_8_only_topic_for_RAG
Viewer
• Updated • 20 • 7
SeppeV/joke_gen_mistral_bm_for_prompt_8_only_topic_for_RAG_jo
Viewer
• Updated • 20 • 17
SeppeV/results_RAG_test_random_rec
Viewer
• Updated • 125 • 4
SeppeV/results_RAG_test_rec_on_topic
Viewer
• Updated • 125 • 6
SeppeV/results_RAG_test_rec_on_topic_w_userbased_filtering
Viewer
• Updated • 100 • 8
SeppeV/results_joke_gen_mistral_bm_for_prompt_8_only_topic_for_RAG_jo
Viewer
• Updated • 20 • 5
SocialGrep/one-million-reddit-questions
Viewer
• Updated • 1M • 180
• 12
SocialGrep/ten-million-reddit-answers
Updated • 139
• 10
GODsStrongestSoldier/high_priest_supernatural_magic_FACT_BASED_1M
Viewer
• Updated • 1M • 184
• 4
YUXCulturalAILab/senegal-sante-maternelle-qa
Viewer
• Updated • 1k • 35
• 2
adamo1139/basic_economics_questions_ts_test_1
Viewer
• Updated • 2.11k • 10
• 1
adamo1139/basic_economics_questions_ts_test_2
Viewer
• Updated • 3.02k • 8
adamo1139/basic_economics_questions_ts_test_3
Viewer
• Updated • 3.02k • 5
adamo1139/basic_economics_questions_ts_test_4
Viewer
• Updated • 8k • 12
agentlans/NousResearch-Hermes-3-Dataset-multiturn
Viewer
• Updated • 13.3k • 76
• 2
ai2lumos/lumos_complex_qa_ground_iterative
Viewer
• Updated • 19.1k • 92
• 3
ai2lumos/lumos_complex_qa_ground_onetime
Viewer
• Updated • 19.2k • 90
• 4
ai2lumos/lumos_complex_qa_plan_iterative
Viewer
• Updated • 19k • 139
• 7
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
• Updated • 19.4k • 89
• 3
ajibawa-2023/Education-Researchers
Viewer
• Updated • 255k • 18
• 8
alexandreteles/AlpacaToxicQA_ShareGPT
Viewer
• Updated • 6.87k • 53
• 8
Viewer
• Updated • 9.98k • 6.76k
• 23
Viewer
• Updated • 1.59k • 5.61k
• 100
Updated • 2.57k
• 26
Updated • 23k
• 30
argilla/cloud_assistant_questions
Viewer
• Updated • 262 • 17
argilla/research_titles_multi-label
Viewer
• Updated • 21k • 39
Viewer
• Updated • 98.2k • 15
Viewer
• Updated • 92.7k • 35
autoevaluate/autoeval-staging-eval-project-adversarial_qa-1cd241d3-12195624
Viewer
• Updated • 3k • 10
autoevaluate/autoeval-staging-eval-project-adversarial_qa-58460439-11825575
Viewer
• Updated • 3k • 9
autoevaluate/autoeval-staging-eval-project-adversarial_qa-e34332b7-12205625
Viewer
• Updated • 3k • 12
autoevaluate/autoeval-staging-eval-project-adversarial_qa-e34332b7-12205627
Viewer
• Updated • 3k • 20
autoevaluate/autoeval-staging-eval-project-adversarial_qa-e34332b7-12205628
Viewer
• Updated • 3k • 9
beyoru/CheapResearch_Cleaned
Viewer
• Updated • 32.8k • 3
beyoru/ToolCalling_Search
Viewer
• Updated • 33.1k • 5
beyoru/tin_hoc_ai_judgement_no_rag
Viewer
• Updated • 100 • 3
breadlicker45/autotrain-data-yahoo-answer-small
Preview
• Updated • 11
• 1
Viewer
• Updated • 175k • 13
• 3
breadlicker45/bread-qa-updated
Viewer
• Updated • 175k • 11
breadlicker45/yahoo-answers-3k-lines
Viewer
• Updated • 3k • 12
• 1
breadlicker45/yahoo_answers
Viewer
• Updated • 65.5k • 7
breadlicker45/yahoo_answers_v2
Viewer
• Updated • 1.43M • 8
• 1
Viewer
• Updated • 121k • 18
chimbiwide/sciqa-thinking
Viewer
• Updated • 3k • 32
communityai/system_identity
Viewer
• Updated • 868 • 3
communityai/us-inc-identity-arc-1.0-ultrasafeai-en-SFT
Viewer
• Updated • 1.21k • 5
Viewer
• Updated • 579k • 61
• 3
davanstrien/query-to-dataset-viewer-descriptions
Viewer
• Updated • 11k • 23
• 5
Preview
• Updated • 43
davidquicast/constitucion-politica-del-peru-1993-qa
Viewer
• Updated • 2.08k • 100
• 1
davidquicast/constitucion-politica-del-peru-1993-qa-gemma-2b-it-format
Viewer
• Updated • 2.08k • 21
davidquicast/constitucion-politica-del-peru-1993-qa-gemma-2b-it-format-80train-20test
Viewer
• Updated • 2.08k • 54
davidquicast/constitucion_politica_del_peru_1993_qa_argilla
Viewer
• Updated • 2.07k • 54
davidquicast/constitucion_politica_del_peru_1993_qa_raw
Viewer
• Updated • 2.08k • 17
davidquicast/info-security-policies-rag-distiset
Viewer
• Updated • 100 • 57
davidquicast/info-security-policies-rag-distiset-argilla
Viewer
• Updated • 98 • 13
davidquicast/information-security-policies-qa-distiset
Viewer
• Updated • 198 • 26
davidquicast/information-security-policies-qa-distiset-argilla
Viewer
• Updated • 97 • 169
Viewer
• Updated • 21.2k • 22.3k
• 234
derek-thomas/squad-v1.1-t5-question-generation
Viewer
• Updated • 21k • 105
• 6
dinushiTJ/nz-hansard-triplets
Viewer
• Updated • 3.27k • 68
Viewer
• Updated • 20.8k • 93
• 3
dmayhem93/agieval-logiqa-en
Viewer
• Updated • 651 • 358
dmayhem93/agieval-logiqa-zh
Viewer
• Updated • 651 • 50
dmayhem93/self-critiquing-critique-answer-ranking
Viewer
• Updated • 11k • 14
dmayhem93/self-critiquing-critique-answer-ranking-test
Viewer
• Updated • 2.35k • 5
dmayhem93/self-critiquing-critique-answer-ranking-train
Viewer
• Updated • 11.4k • 6
emozilla/qasper-pruned-llama-gptneox-4k
Viewer
• Updated • 529 • 62
emozilla/qasper-pruned-llama-gptneox-8k
Viewer
• Updated • 1.39k • 39
Viewer
• Updated • 23.4k • 58
freQuensy23/parus_questions
Viewer
• Updated • 400 • 10
• 1
freQuensy23/toxic-answers
Viewer
• Updated • 37 • 12
• 1
french-open-data/lieux-de-covoiturage-organisation-microstop
french-open-data/piaf-le-dataset-francophone-de-questions-reponses
Updated • 12
google/FACTS-grounding-public
Viewer
• Updated • 868 • 2k
• 46
google/IndicGenBench_xorqa_in
Updated • 236
• 5
Viewer
• Updated • 900 • 18k
• 121
google/granola-entity-questions
Viewer
• Updated • 12.5k • 154
• 11
Viewer
• Updated • 1k • 1.98k
• 48
Viewer
• Updated • 666 • 432
• 46
Viewer
• Updated • 546 • 422
hamishivi/GPQA-train-RLVR
Viewer
• Updated • 348 • 639
Viewer
• Updated • 4.33k • 45
hamishivi/SimpleQA-RLVR-noprompt
Viewer
• Updated • 4.33k • 49
Viewer
• Updated • 5.3k • 32
hamishivi/asqa_rlvr_no_prompt
Viewer
• Updated • 5.3k • 62
Viewer
• Updated • 4.41k • 56
hamishivi/rds-sels-squad-top326k
Viewer
• Updated • 326k • 16
hamishivi/sft_ablations_redsearcher_sft_sanitized
Viewer
• Updated • 9.81k • 89
Viewer
• Updated • 4.33k • 43
hamishivi/simple_qa_rlvr_no_prompt
Viewer
• Updated • 4.33k • 41
hamishivi/simpleqa_10_actions_llama3.3_70b_it
Viewer
• Updated • 1.03k • 34
hamishivi/simpleqa_5_actions_llama3.3_70b_it
Viewer
• Updated • 4.33k • 41
hamishivi/tqa_rlvr_no_prompt
Viewer
• Updated • 156k • 30
harpreetsahota/LI_Learning_RAG_Eval_Set
Preview
• Updated • 3
• 2
harpreetsahota/ragas-example-dataset
Viewer
• Updated • 25 • 9
• 1
iamketan25/gsm-general-qa-instructions
Viewer
• Updated • 25.7k • 54
• 3
Viewer
• Updated • 5.05k • 45
ibm-research/900K-Judgements
Viewer
• Updated • 939k • 138
• 3
ibm-research/AITQARetrieval
Viewer
• Updated • 3.99k • 119
ibm-research/AssetOpsBench
Viewer
• Updated • 467 • 1.18k
• 31
Viewer
• Updated • 1.4k • 669
• 23
Viewer
• Updated • 1.4k • 78
• 2
ibm-research/Auto-BenchmarkCard
Viewer
• Updated • 105 • 76
• 3
ibm-research/BFCL-FC-robustness
Updated • 15
Viewer
• Updated • 3.08k • 58
• 3
Preview
• Updated • 120
• 1
ibm-research/BoolQ_robustness
Viewer
• Updated • 29.4k • 55
ibm-research/Climate-Change-NER
Viewer
• Updated • 46.2k • 121
• 11
ibm-research/FailureSensorIQ
Viewer
• Updated • 8.3k • 341
• 7
ibm-research/FeTaQARetrieval
Viewer
• Updated • 24.7k • 58
ibm-research/ITBench-Lite
Updated • 4.64k
• 5
ibm-research/ITBench-Trajectories
Viewer
• Updated • 53.5k • 100
• 6
ibm-research/LLM_Fine-Tuning_Performance
Preview
• Updated • 146
• 2
ibm-research/MedMentions-ZS
Viewer
• Updated • 29.1k • 500
• 2
ibm-research/MermaidSeqBench
Viewer
• Updated • 132 • 131
• 5
ibm-research/MultiHierttRetrieval
Viewer
• Updated • 11.9k • 63
• 1
ibm-research/NQTablesRetrieval
Viewer
• Updated • 533k • 144
• 2
ibm-research/OTTQASmallRetrieval
Viewer
• Updated • 31.1k • 99
• 1
ibm-research/OpenWikiTablesRetrieval
Viewer
• Updated • 178k • 79
• 1
Viewer
• Updated • 415 • 50
ibm-research/PopQA_robustness
Viewer
• Updated • 204k • 34
Viewer
• Updated • 2.71k • 52
• 5
ibm-research/REAL-MM-RAG_FinReport
Viewer
• Updated • 2.93k • 1.3k
• 8
ibm-research/REAL-MM-RAG_FinReport_BEIR
Viewer
• Updated • 5.27k • 83
• 2
ibm-research/REAL-MM-RAG_FinSlides
Viewer
• Updated • 2.59k • 1.22k
• 2
ibm-research/REAL-MM-RAG_FinSlides_BEIR
Viewer
• Updated • 5.5k • 47
• 1
ibm-research/REAL-MM-RAG_FinTabTrainSet
Viewer
• Updated • 48.2k • 206
• 2
ibm-research/REAL-MM-RAG_FinTabTrainSet_rephrased
Viewer
• Updated • 48.2k • 71
• 2
ibm-research/REAL-MM-RAG_TechReport
Viewer
• Updated • 2.2k • 1.12k
• 3
ibm-research/REAL-MM-RAG_TechReport_BEIR
Viewer
• Updated • 5.57k • 101
• 1
ibm-research/REAL-MM-RAG_TechSlides
Viewer
• Updated • 2.62k • 1.18k
• 2
ibm-research/REAL-MM-RAG_TechSlides_BEIR
Viewer
• Updated • 6.09k • 120
• 1
ibm-research/SQL-API-Bench
Viewer
• Updated • 3.41k • 429
• 5
Preview
• Updated • 75
• 1
Updated • 270
• 6
ibm-research/SocialStigmaQA
Viewer
• Updated • 20.7k • 159
• 7
ibm-research/SocialStigmaQA-JA
Viewer
• Updated • 10.4k • 81
• 4
ibm-research/Split-IFEval
Viewer
• Updated • 541 • 31
• 1
ibm-research/ToolRM-train-data
Viewer
• Updated • 459k • 129
• 7
Viewer
• Updated • 1.33k • 2.13k
• 44
Viewer
• Updated • 1.78k • 2.5k
• 4
ibm-research/WatsonxDocsQARetrieval
Viewer
• Updated • 1.2k • 130
ibm-research/WikiVQABench
Viewer
• Updated • 344 • 212
• 6
ibm-research/Wish-IE-Falcon
Viewer
• Updated • 1k • 6
ibm-research/Wish-QA-ASQA-Falcon
Viewer
• Updated • 4.35k • 7
ibm-research/Wish-QA-ASQA-Llama
Viewer
• Updated • 3.46k • 11
• 2
ibm-research/Wish-QA-ELI5-Falcon
Viewer
• Updated • 10k • 42
• 1
ibm-research/Wish-QA-ELI5-Llama
Viewer
• Updated • 8.41k • 10
• 3
ibm-research/Wish-QA-Falcon
Viewer
• Updated • 10.8k • 17
• 1
ibm-research/Wish-QA-NQ-Falcon
Viewer
• Updated • 39.3k • 49
ibm-research/Wish-QA-NQ-Llama
Viewer
• Updated • 10k • 10
• 1
ibm-research/Wish-Summarization-Falcon
Viewer
• Updated • 10k • 7
ibm-research/Wish-Summarization-Llama
Viewer
• Updated • 10k • 4
Viewer
• Updated • 3.72k • 1.41k
• 13
ibm-research/argument_quality_ranking_30k
Viewer
• Updated • 40k • 592
• 13
Viewer
• Updated • 301k • 6.57k
• 5
ibm-research/claim_stance
Viewer
• Updated • 4.79k • 212
• 7
ibm-research/clinic150-sur
Viewer
• Updated • 600k • 54
• 1
ibm-research/data-product-benchmark
Viewer
• Updated • 33.2k • 2.11k
• 3
Viewer
• Updated • 187k • 1.59k
• 34
Updated • 1.54k
• 14
Viewer
• Updated • 546 • 11
• 2
ibm-research/hemolab-bench
Viewer
• Updated • 49.7k • 872
• 2
ibm-research/identity_group_abuse_robustness
Viewer
• Updated • 21.8k • 63
• 2
ibm-research/justrank_judge_scores
Viewer
• Updated • 1.51M • 22
• 3
ibm-research/knowledge_consistency_of_LLMs
Preview
• Updated • 77
• 3
Viewer
• Updated • 720 • 95
• 1
Viewer
• Updated • 1.86k • 420
• 19
Updated • 52
• 4
ibm-research/otter_primekg
Updated • 180
• 4
ibm-research/otter_stitch
Updated • 166
• 2
ibm-research/otter_uniprot_bindingdb
Updated • 36
• 3
ibm-research/otter_uniprot_bindingdb_chembl
Updated • 34
• 4
ibm-research/patchtsmixer-etth1-test-data
Updated • 143
ibm-research/patchtst-etth1-test-data
Updated • 130
Viewer
• Updated • 119k • 74
• 2
ibm-research/rag-hpo-bench
Preview
• Updated • 1.57k
• 2
Updated • 100
• 3
ibm-research/trajcast.datasets-arxiv2025
Updated • 161
ibm-research/turl_table_col_type
Updated • 30
ibm-research/vira-dialog-acts-live
Viewer
• Updated • 714 • 15
• 1
ibm-research/vira-intents
Viewer
• Updated • 7.97k • 96
• 2
ibm-research/vira-intents-live
Viewer
• Updated • 13.7k • 86
• 1
ibm-research/watsonxDocsQA
Viewer
• Updated • 1.22k • 116
• 5
inclusionAI/ASearcher-Local-Knowledge
Viewer
• Updated • 45.2M • 12.6k
• 8
inclusionAI/ASearcher-test-data
Updated • 669
• 4
inclusionAI/ASearcher-train-data
Preview
• Updated • 389
• 27
Viewer
• Updated • 111k • 1.41k
• 13
innodatalabs/rt4-science-QA
Viewer
• Updated • 75.3k • 997
• 3
Updated • 16
• 1
irds/lotte_lifestyle_dev_search
irds/lotte_lifestyle_test_search
irds/lotte_pooled_test_search
irds/lotte_recreation_dev_search
irds/lotte_recreation_test_search
irds/lotte_science_dev_search
irds/lotte_science_test_search
irds/lotte_technology_test_search
Updated • 22
• 1
jack4444b/ALP_Behavioral_ECON_QA
Viewer
• Updated • 85 • 7
Viewer
• Updated • 6.42k • 23
jayavibhav/synthbio-qa-ambig
Viewer
• Updated • 6.6k • 11
jayavibhav/synthbio-qa-ambig-8
Viewer
• Updated • 8k • 18
Viewer
• Updated • 1.44k • 69
• 23
jjmachan/NSFW-questions-inter-cleaned_df
Viewer
• Updated • 12.9k • 64
• 5
jtatman/databricks-dolly-8k-qa-open-close
Viewer
• Updated • 7.71k • 106
jtatman/hypnosis_dataset_questions
Viewer
• Updated • 1.35k • 9
• 5
jtatman/orca_mini_uncensored_squad_format_train
Viewer
• Updated • 74.8k • 26
• 1
jtatman/orca_minis_uncensored_squad_format
Viewer
• Updated • 104k • 68
• 1
julep-ai-archive/samantha-self_aware_answerable
Viewer
• Updated • 3.37k • 78
• 1
Viewer
• Updated • 400 • 12
justinphan3110/sharegpt_instructions_small_en_vi_answers
Viewer
• Updated • 424 • 11
Viewer
• Updated • 860k • 369
• 54
Preview
• Updated • 43
• 1
lamini/product-catalog-questions
Viewer
• Updated • 27.4k • 82
• 7
lianghsun/QA_TaiwanEdoctor
Viewer
• Updated • 178k • 59
Viewer
• Updated • 40.5k • 45
• 2
Viewer
• Updated • 579 • 39
• 1
Viewer
• Updated • 5.39k • 42
• 1
lianghsun/tw-legal-synthetic-qa
Viewer
• Updated • 9.63k • 241
• 8
lianghsun/vulnerability-mitigation-qa-zh_tw
Viewer
• Updated • 22 • 91
• 3
lightonai/dbpedia-entity-decontaminated
Viewer
• Updated • 1.69M • 212
lightonai/fiqa-decontaminated
Viewer
• Updated • 49.8k • 91
lightonai/hotpotqa-decontaminated
Viewer
• Updated • 2.33M • 64
lightonai/hotpotqa_contrastive
Viewer
• Updated • 85k • 16
lightonai/scifact-decontaminated
Viewer
• Updated • 1.31k • 102
lightonai/trivia_contrastive
Viewer
• Updated • 60.4k • 78
lionelchg/dolly_closed_qa
Viewer
• Updated • 1.77k • 28
• 3
Viewer
• Updated • 3.74k • 48
Viewer
• Updated • 500 • 15
Viewer
• Updated • 11 • 6
lvogel123/factscore-claude-4.5-sonnet
Viewer
• Updated • 152 • 14
lvogel123/factscore-deepseek-v3.2-exp
Viewer
• Updated • 152 • 31
lvogel123/factscore-gemini-2.5-pro
Viewer
• Updated • 152 • 10
lvogel123/factscore-glm-4.6
Viewer
• Updated • 152 • 11
lvogel123/factscore-gpt-5-high
Viewer
• Updated • 152 • 10
lvogel123/factscore-gpt-oss-120b-high
Viewer
• Updated • 152 • 13
lvogel123/factscore-grok-4
Viewer
• Updated • 152 • 11
lvogel123/factscore-kimi-k2
Viewer
• Updated • 152 • 10
lvogel123/factscore-llama-3.3-nemotron-super-49b-v1.5
Viewer
• Updated • 152 • 11
lvogel123/factscore-llama-4-maverick
Viewer
• Updated • 152 • 8
lvogel123/factscore-qwen3-235b-a22b-thinking-2507
Viewer
• Updated • 152 • 11
lvogel123/gpqa-diamond-all
Viewer
• Updated • 10 • 16
lvogel123/gpqa-diamond-claude-4.5-sonnet
Viewer
• Updated • 241 • 51
• 1
lvogel123/gpqa-diamond-deepseek-v3.2-exp-high
Viewer
• Updated • 200 • 27
lvogel123/gpqa-diamond-gemini-2.5-pro
Viewer
• Updated • 744 • 301
lvogel123/gpqa-diamond-glm-4.6
Viewer
• Updated • 200 • 32
lvogel123/gpqa-diamond-glm-4.6-2
Viewer
• Updated • 199 • 28
lvogel123/gpqa-diamond-gpt-5-high
Viewer
• Updated • 201 • 25
lvogel123/gpqa-diamond-gpt-oss-120b-high
Viewer
• Updated • 200 • 27
lvogel123/gpqa-diamond-grok-4
Viewer
• Updated • 1.13k • 32
lvogel123/gpqa-diamond-kimi-k2
Viewer
• Updated • 200 • 26
lvogel123/gpqa-diamond-llama-3.3-nemotron-49b-v1.5
Viewer
• Updated • 200 • 58
lvogel123/gpqa-diamond-llama-4-maverick
Viewer
• Updated • 200 • 28
lvogel123/gpqa-diamond-qwen3-235b-a22b-2507
Viewer
• Updated • 561 • 30
lvogel123/grok-4-factscore
Viewer
• Updated • 3 • 10
Viewer
• Updated • 502 • 60
• 6
Viewer
• Updated • 2.03k • 1.49k
• 4
Viewer
• Updated • 26.5k • 9.32k
• 89
Viewer
• Updated • 911k • 1.04k
Viewer
• Updated • 174k • 22
Viewer
• Updated • 66 • 14
markush1/adversarial-banking-questions
Viewer
• Updated • 2.25k • 7
• 3
markush1/adversarial-insurance-questions
Viewer
• Updated • 4.43k • 7
marmarg2/toxic-teenage-relationships
Updated • 5
• 2
mateowilliam/nemotron-super-reap-artifacts-draft
Preview
• Updated • 101
meandyou200175/data-query-sql
Viewer
• Updated • 11.7k • 10
Viewer
• Updated • 12k • 8
mehuldamani/gpt5-simpleqa-20
Viewer
• Updated • 20 • 18
• 1
mehuldamani/half_hotpot_qa
Viewer
• Updated • 10.3k • 8
Viewer
• Updated • 20.5k • 1.01k
mehuldamani/hotpot_qa_for_multi
Viewer
• Updated • 20.5k • 116
mehuldamani/hotpot_qa_multi_models_pass_k_evals_onHotpot_nov11
Viewer
• Updated • 500 • 11
mehuldamani/hotpot_qa_single_models_pass_k_evals_onHotpot_nov11
Viewer
• Updated • 500 • 11
mehuldamani/hotpot_qa_test_gold_removed_1
Viewer
• Updated • 20.5k • 28
mehuldamani/hotpot_qa_test_gold_removed_2
Viewer
• Updated • 20.5k • 9
mehuldamani/hotpot_qa_trainTest_gold_removed_2
Viewer
• Updated • 20.5k • 32
mehuldamani/multi-answer-sft-target-dataset
Viewer
• Updated • 1.59k • 6
mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis
Viewer
• Updated • 2k • 192
mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain
Viewer
• Updated • 2k • 55
meoconxinhxan/Inspect-Search-Models-Benchmarking-Result-CIR-FOR-CHECK
Viewer
• Updated • 4.33k • 7
meoconxinhxan/Inspect-Search-Models-Benchmarking-Result-CIR-FOR-CHECK-Frame
Viewer
• Updated • 824 • 8
Viewer
• Updated • 2.45k • 24
meoconxinhxan/_nano_ckeck_simpleqa
Viewer
• Updated • 4.33k • 16
meoconxinhxan/ii_med_ckeck_simpleqa
Viewer
• Updated • 4.33k • 21
meoconxinhxan/jan_nano_ckeck_simpleqa
Viewer
• Updated • 4.33k • 4
meoconxinhxan/qwen3-4b_simple_qa
Viewer
• Updated • 4.33k • 5
meoconxinhxan/r1-tool-open-ended-qa
Viewer
• Updated • 332 • 6
meoconxinhxan/search_qa_rl
Viewer
• Updated • 170k • 93
meoconxinhxan/search_r1_ds
Viewer
• Updated • 80k • 7
meoconxinhxan/search_r1_musque
Viewer
• Updated • 19.9k • 8
meoconxinhxan/simple_qa_stratified_kfold
Viewer
• Updated • 866 • 20
microsoft/MeetingBank-QA-Summary
Viewer
• Updated • 862 • 178
• 15
microsoft/bing_coronavirus_query_set
Viewer
• Updated • 318k • 307
• 1
microsoft/hnm-search-data
Viewer
• Updated • 33.3M • 24.2k
• 2
Updated • 275
• 4
Viewer
• Updated • 29.3k • 5.27k
• 74
mlfoundations-dev/GPQADiamond_evalchemy
Viewer
• Updated • 1.19k • 510
mlfoundations-dev/GPQADiamond_evalchemy_gpt-4o-mini
Viewer
• Updated • 594 • 370
mlfoundations-dev/PDF_and_SCP_unfiltered_organic_chemistry_questions
Viewer
• Updated • 43.8k • 15
mlfoundations-dev/pdf_science_questions_verifiable_r1_traces__2_24_25
Viewer
• Updated • 1.62k • 78
mlfoundations-dev/r1_annotated_finqa
Viewer
• Updated • 5k • 464
• 1
mlfoundations-dev/sci_question_exp__scp_116k__training_2k_for_GPQA
Viewer
• Updated • 118k • 1.2k
• 1
mlfoundations-dev/sci_question_exp__scp_116k__training_2k_for_GPQA_eval_03-11-25_17-16-22_f912
Viewer
• Updated • 594 • 819
• 1
multi-train/WikiAnswers_1107
Viewer
• Updated • 200k • 28
multi-train/amazon-qa_1107
Viewer
• Updated • 200k • 5
multi-train/eli5_question_answer_1107
Viewer
• Updated • 200k • 28
• 3
multi-train/emb-hotpotqa-train
Viewer
• Updated • 68.7k • 12
multi-train/emb-medmcqa-train
Viewer
• Updated • 161k • 10
multi-train/emb-triviaqa-train
Viewer
• Updated • 52.9k • 21
multi-train/hotpotqa-train-multikilt_1107
Viewer
• Updated • 68.7k • 8
Viewer
• Updated • 161k • 5
multi-train/searchQA_top5_snippets_1107
Viewer
• Updated • 117k • 10
multi-train/squad_pairs_1107
Viewer
• Updated • 87.6k • 5
multi-train/triviaqa-train-multikilt_1107
Viewer
• Updated • 52.9k • 7
multi-train/yahoo_answers_title_answer_1107
Viewer
• Updated • 200k • 8
open-llm-leaderboard-old/details_GeorgiaTechResearchInstitute__galpaca-30b
Updated • 1.8k
open-llm-leaderboard-old/details_garage-bAInd__Camel-Platypus2-70B
Updated • 485
open-llm-leaderboard/NousResearch__Hermes-3-Llama-3.1-70B-details
Viewer
• Updated • 43.2k • 112
open-llm-leaderboard/NousResearch__Nous-Hermes-2-Mixtral-8x7B-DPO-details
Viewer
• Updated • 45.6k • 123
open-llm-leaderboard/NousResearch__Nous-Hermes-2-Mixtral-8x7B-SFT-details
Viewer
• Updated • 80.9k • 117
opensporks/hotpotQA_filtered
Preview
• Updated • 2
Viewer
• Updated • 3.57k • 5
projectlosangeles/Orpheus-MIDI-Search
Updated • 67
• 1
Viewer
• Updated • 330 • 35
• 3
Viewer
• Updated • 500 • 418
• 11
Viewer
• Updated • 100 • 7
sam2ai/hindi_truthfulqa_gen_mini
Viewer
• Updated • 50 • 8
Viewer
• Updated • 28k • 164
Viewer
• Updated • 268k • 176
Viewer
• Updated • 12 • 46
sambanovasystems/attackqa
Updated • 225
• 6
semran1/wiki_synth_qa_redstone_dclmknw_fwedu_pdf_mix
Viewer
• Updated • 25.4M • 331
sert121/adult_data_instruction_leaving_relationship_marital-status_capital-gain_occupation_education-num
Viewer
• Updated • 15.7k • 5
sert121/adult_dataset_age_workclass_education_marital-status_occupation_relationship
Viewer
• Updated • 48.8k • 4
sert121/adult_dataset_age_workclass_education_marital-status_occupation_relationship_race
Viewer
• Updated • 48.8k • 4
sert121/adult_dataset_age_workclass_education_marital-status_occupation_relationship_race_sex
Viewer
• Updated • 48.8k • 8
• 1
sert121/adult_dataset_age_workclass_education_marital__status_occupation_relationship
Viewer
• Updated • 48.8k • 4
sert121/adult_dataset_age_workclass_education_marital__status_occupation_relationship_race
Viewer
• Updated • 48.8k • 4
sert121/adult_dataset_age_workclass_education_marital__status_occupation_relationship_race_sex
Viewer
• Updated • 48.8k • 4
• 1
tencent/ArtifactsBenchmark
Viewer
• Updated • 1.83k • 127
• 13
thangvip/OpenOrca-translate-openQA
Viewer
• Updated • 26.7k • 5
thangvip/combined-vietnamese-legal-qa-pretrain
Viewer
• Updated • 904k • 63
• 2
thangvip/combined-vietnamese-legal-qa-pretrain-tokenized-8k
Viewer
• Updated • 60.8k • 112
Viewer
• Updated • 19.6k • 5
thangvip/law-reading-comprehension-qa
Viewer
• Updated • 895k • 235
thangvip/law-reading-comprehension-qa-filtered
Viewer
• Updated • 205k • 33
Viewer
• Updated • 11.1k • 4
thangvip/question-queries-finetune
Viewer
• Updated • 17.9k • 5
thangvip/thuvienphapluat-qa-normalize
Viewer
• Updated • 16k • 114
thangvip/thuvienphapluat-question-query
Viewer
• Updated • 19.9k • 5
thangvip/thuvienphapluat-question-query-1
Viewer
• Updated • 641 • 5
thangvip/vietnamese-legal-qa
Viewer
• Updated • 9.72k • 263
• 3
Viewer
• Updated • 1.02k • 12
Viewer
• Updated • 1.43k • 9
Viewer
• Updated • 1k • 14
• 1
Viewer
• Updated • 136k • 190
• 3
theblackcat102/alexa-qa-with-rank
Viewer
• Updated • 70.5k • 104
• 2
theblackcat102/amazon_item_synthetic_retrieval
Viewer
• Updated • 23.6k • 7
theblackcat102/amazon_item_synthetic_retrieval_final
Viewer
• Updated • 28k • 10
theblackcat102/barexam_qa
Viewer
• Updated • 80 • 8
theblackcat102/gqa-testdev-balanced
Viewer
• Updated • 12.6k • 59
theblackcat102/law_freeform_qa
Viewer
• Updated • 7.3k • 10
theblackcat102/prime_factorization
Viewer
• Updated • 1.33k • 23
Viewer
• Updated • 103 • 29
Updated • 4.16k
• 1
Preview
• Updated • 17
Viewer
• Updated • 98.3k • 317
• 7
Preview
• Updated • 16
Preview
• Updated • 1.69k
• 12
Preview
• Updated • 100
• 1
Viewer
• Updated • 1.8k • 8
waifu-research-department/regularization
Viewer
• Updated • 6.72k • 54
• 13
walledai/ForbiddenQuestions
Viewer
• Updated • 390 • 97
• 5
Viewer
• Updated • 17.8k • 2.31k
• 29
wandb/finqa-data-processed
Viewer
• Updated • 8.28k • 2.33k
• 2
wandb/finqa-data-processed-hallucination
Viewer
• Updated • 16.6k • 809
wandb/ragbench-test-sample
Viewer
• Updated • 957 • 10
Viewer
• Updated • 90.1k • 434
• 8