Context Engineering A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 264
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 264
Benchmark/ Leaderboards Running on CPU Upgrade 7.44k MTEB Leaderboard 🥇 7.44k Embedding Leaderboard
Text to SQL gretelai/synthetic_text_to_sql Viewer • Updated Dec 16, 2025 • 106k • 3.19k • 659 b-mc2/sql-create-context Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499
PDF Datasets pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 4.05k • 193 pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 6.8k • 159
Tentative microsoft/orca-math-word-problems-200k Viewer • Updated Mar 4, 2024 • 200k • 6.87k • 483 Salesforce/xlam-function-calling-60k Viewer • Updated Jan 24, 2025 • 60k • 32.8k • 626 cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 549k • 755 TIGER-Lab/MMLU-Pro Benchmark • Updated about 1 month ago • 12.1k • 158k • 478
Datasets - RAG Evaluation Bench Mark galileo-ai/ragbench Viewer • Updated Jun 11, 2024 • 95.4k • 4.79k • 114 rajpurkar/squad Viewer • Updated Mar 4, 2024 • 98.2k • 159k • 366 google/FACTS-grounding-public Viewer • Updated Dec 19, 2024 • 868 • 2.02k • 46 b-mc2/sql-create-context Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499
Context Engineering A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 264
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 264
PDF Datasets pixparse/idl-wds Viewer • Updated Mar 29, 2024 • 3.41M • 4.05k • 193 pixparse/pdfa-eng-wds Viewer • Updated Mar 29, 2024 • 7.1k • 6.8k • 159
Benchmark/ Leaderboards Running on CPU Upgrade 7.44k MTEB Leaderboard 🥇 7.44k Embedding Leaderboard
Tentative microsoft/orca-math-word-problems-200k Viewer • Updated Mar 4, 2024 • 200k • 6.87k • 483 Salesforce/xlam-function-calling-60k Viewer • Updated Jan 24, 2025 • 60k • 32.8k • 626 cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 549k • 755 TIGER-Lab/MMLU-Pro Benchmark • Updated about 1 month ago • 12.1k • 158k • 478
Text to SQL gretelai/synthetic_text_to_sql Viewer • Updated Dec 16, 2025 • 106k • 3.19k • 659 b-mc2/sql-create-context Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499
Datasets - RAG Evaluation Bench Mark galileo-ai/ragbench Viewer • Updated Jun 11, 2024 • 95.4k • 4.79k • 114 rajpurkar/squad Viewer • Updated Mar 4, 2024 • 98.2k • 159k • 366 google/FACTS-grounding-public Viewer • Updated Dec 19, 2024 • 868 • 2.02k • 46 b-mc2/sql-create-context Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499