mdeberta-id-20k

This model is a vocabulary-pruned version of microsoft/mdeberta-v3-base, specifically optimized for the Indonesian language.

Vocabulary: 20k tokens (Indonesian)

Note: This model is part of an ongoing research project on efficient Transformer deployment. Full paper and benchmarks will be linked upon publication.

Downloads last month
16
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including muchad/mdeberta-id-20k