Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ByteSpanTokenisers
/
fw57M-tied_finewebedu-20B_ByteSpanSurprisalCombinedSeeding_64000
like
0
Follow
ByteSpan Tokenisers
4
TensorBoard
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
fw57M-tied_finewebedu-20B_ByteSpanSurprisalCombinedSeeding_64000
37.4 MB
1 contributor
History:
3 commits
codebyzeb
Upload tokenizer
4d9439c
verified
5 months ago
version_0
Upload folder using huggingface_hub
5 months ago
.gitattributes
Safe
1.52 kB
initial commit
5 months ago
README.md
Safe
2.29 kB
Upload tokenizer
5 months ago
blimp_results.json
Safe
89.9 kB
Upload folder using huggingface_hub
5 months ago
hparams.yaml
Safe
2.24 kB
Upload folder using huggingface_hub
5 months ago
special_tokens_map.json
Safe
579 Bytes
Upload tokenizer
5 months ago
tb_logs.parquet
2.14 MB
xet
Upload folder using huggingface_hub
5 months ago
tokenizer.json
Safe
1.56 MB
Upload tokenizer
5 months ago
tokenizer_config.json
Safe
872 Bytes
Upload tokenizer
5 months ago