Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
pietrolesci
's Collections
UnimixLM
Interesting Pre-Training Datasets
The Pile Companion
Generalisation-Profiles
Machine Translation Datasets
Text Classification Datasets
Dialogue State Tracking Datasets
NLI Eval Datasets
AnchorAL
Memorisation-Profiles
Tokenisation-Bias
Text Classification Datasets
updated
Nov 12, 2024
A curated collection of common datasets for text classification
Upvote
1
pietrolesci/amazoncat-13k
Viewer
•
Updated
Apr 9, 2025
•
5.99M
•
1.76k
•
2
pietrolesci/civilcomments-wilds
Viewer
•
Updated
Jul 2, 2024
•
893k
•
318
•
2
pietrolesci/dbpedia_14_indexed
Viewer
•
Updated
May 11, 2023
•
630k
•
341
pietrolesci/DBPedia_Classes_indexed
Viewer
•
Updated
May 11, 2023
•
338k
•
97
pietrolesci/pubmed-20k-rct
Viewer
•
Updated
May 12, 2023
•
236k
•
289
pietrolesci/eurlex-57k
Viewer
•
Updated
Sep 11, 2023
•
235k
•
665
pietrolesci/pubmed-200k-rct
Viewer
•
Updated
Sep 11, 2023
•
9.08M
•
336
pietrolesci/imdb
Viewer
•
Updated
Sep 11, 2023
•
200k
•
55
•
2
pietrolesci/agnews
Viewer
•
Updated
Apr 9, 2025
•
510k
•
90
pietrolesci/wikitoxic
Viewer
•
Updated
Apr 9, 2025
•
894k
•
199
•
1
pietrolesci/hyperpartisan_news_detection
Viewer
•
Updated
Sep 25, 2023
•
1.5M
•
201
•
2
pietrolesci/yahoo_answers_topics
Viewer
•
Updated
Sep 25, 2023
•
2.92M
•
64
Upvote
1
Share collection
View history
Collection guide
Browse collections