Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

versae
/
scandeng-tokenizer

Text Generation
Transformers
mistral
Model card Files Files and versions
xet
Community
scandeng-tokenizer
4.37 MB
  • 1 contributor
History: 4 commits
versae's picture
versae
Add HF tokenizer converted from SentencePiece
04b8476 over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    28 Bytes
    initial commit over 1 year ago
  • config.json
    595 Bytes
    Create config.json over 1 year ago
  • convert.sh
    1.1 kB
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • merges.txt
    568 kB
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • sentencepiece.model
    770 kB
    xet
    SentencePiece byte fallback 32k over 1 year ago
  • sentencepiece.vocab.bak
    486 kB
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • special_tokens_map.json
    72 Bytes
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • tokenizer.json
    1.85 MB
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • tokenizer_config.json
    826 Bytes
    Add HF tokenizer converted from SentencePiece over 1 year ago
  • vocab.json
    687 kB
    Add HF tokenizer converted from SentencePiece over 1 year ago