Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Jai's picture

Jai

jai23

Mi6paulino's profile picture

·

AI & ML interests

None yet

Organizations

None yet

jai23 's collections 6

Context Engineering

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 264

Benchmark/ Leaderboards

Running on CPU Upgrade

7.44k

MTEB Leaderboard

🥇

7.44k

Embedding Leaderboard

gretelai/synthetic_text_to_sql

Viewer • Updated Dec 16, 2025 • 106k • 3.19k • 659
b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499

pixparse/idl-wds

Viewer • Updated Mar 29, 2024 • 3.41M • 4.05k • 193
pixparse/pdfa-eng-wds

Viewer • Updated Mar 29, 2024 • 7.1k • 6.8k • 159

microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4, 2024 • 200k • 6.87k • 483
Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24, 2025 • 60k • 32.8k • 626
cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 549k • 755
TIGER-Lab/MMLU-Pro

Benchmark • Updated about 1 month ago • 12.1k • 158k • 478

Datasets - RAG Evaluation Bench Mark

galileo-ai/ragbench

Viewer • Updated Jun 11, 2024 • 95.4k • 4.79k • 114
rajpurkar/squad

Viewer • Updated Mar 4, 2024 • 98.2k • 159k • 366
google/FACTS-grounding-public

Viewer • Updated Dec 19, 2024 • 868 • 2.02k • 46
b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499

Context Engineering

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 264

pixparse/idl-wds

Viewer • Updated Mar 29, 2024 • 3.41M • 4.05k • 193
pixparse/pdfa-eng-wds

Viewer • Updated Mar 29, 2024 • 7.1k • 6.8k • 159

Benchmark/ Leaderboards

Running on CPU Upgrade

7.44k

MTEB Leaderboard

🥇

7.44k

Embedding Leaderboard

microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4, 2024 • 200k • 6.87k • 483
Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24, 2025 • 60k • 32.8k • 626
cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 549k • 755
TIGER-Lab/MMLU-Pro

Benchmark • Updated about 1 month ago • 12.1k • 158k • 478

gretelai/synthetic_text_to_sql

Viewer • Updated Dec 16, 2025 • 106k • 3.19k • 659
b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499

Datasets - RAG Evaluation Bench Mark

galileo-ai/ragbench

Viewer • Updated Jun 11, 2024 • 95.4k • 4.79k • 114
rajpurkar/squad

Viewer • Updated Mar 4, 2024 • 98.2k • 159k • 366
google/FACTS-grounding-public

Viewer • Updated Dec 19, 2024 • 868 • 2.02k • 46
b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 3.86k • 499

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs