Buckets:

Neon-coding/Tok / meta.json
Neon-tech's picture
download
raw
326 Bytes
{
"vocab_size": 65536,
"context_len": 1024,
"total_tokens": 15539648641,
"val_tokens": 2000000,
"phi_repeats": 10,
"textbook_repeats": 20,
"sources": [
"fineweb_merged.bin",
"wiki_merged.bin",
"openwebmath_merged.bin",
"code_merged.bin",
"phi__programming_books.bin",
"textbook.bin"
]
}

Xet Storage Details

Size:
326 Bytes
·
Xet hash:
7cb2baf49e21245d94919b0e608f2b31e6fc07001c9c1c1e585225d13de1e97e

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.