Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

datablations

https://github.com/huggingface/datablations
Activity Feed Request to join this org

AI & ML interests

Scaling Data-Constrained Language Models

Niklas Muennighoff's profile pictureTeven Le Scao's profile pictureNouamane Tazi's profile pictureRisto Luukkonen's profile pictureAleksandra Piktus's profile pictureSampo Pyysalo's profile pictureColin Raffel's profile pictureThomas Wolf's profile pictureSasha Rush's profile picture

models 38

datablations/lm1-2b8-55b-oscartasky

Updated Jun 24, 2023

datablations/lm1-2b8-55b-tasky

Updated Jun 13, 2023

datablations/lm1-8b7-178b-c4-repetitions

Updated May 30, 2023

datablations/lm1-8b7-178b-oscar-repetitions

Updated May 30, 2023 • 1

datablations/lm1-misc

Updated May 30, 2023

datablations/lm1-4b2-84b-c4-repetitions

Updated May 30, 2023

datablations/lm1-2b8-55b-c4-perplexity

Updated May 26, 2023

datablations/lm1-misc-pile

Updated May 25, 2023

datablations/lm1-2b8-55b-c4-repetitions

Updated May 20, 2023

datablations/lm1-misc-oscar

Updated May 20, 2023
View 38 models

datasets 13

datablations/scripts

Viewer • Updated Jun 15, 2023 • 3.48M • 820

datablations/oscar-subsets

Viewer • Updated Jun 14, 2023 • 365k • 1.36k

datablations/c4-subsets

Viewer • Updated Jun 14, 2023 • 729k • 1.11k • 6

datablations/c4-filter-megatron

Updated May 28, 2023 • 1.19k

datablations/oscar-filter-megatron

Updated May 27, 2023 • 771

datablations/python-megatron

Updated May 22, 2023 • 4.16k • 1

datablations/subsets

Viewer • Updated May 10, 2023 • 365k • 72

datablations/oscar-filter

Viewer • Updated May 10, 2023 • 432M • 385

datablations/oscar-dedup-expanded

Viewer • Updated May 10, 2023 • 432M • 390 • 1

datablations/mup

Updated Apr 24, 2023 • 151
View 13 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs