Model Card for wmt25-cs-de-20layers-2e-05-100k-news-sentences
This model is a fine-tuned version of ymoslem/Aya-Expanse-8B-De-20Layers.
Quick start
Training procedure
Framework versions
- TRL: 0.19.1
- Transformers: 4.53.2
- Pytorch: 2.7.1
- Datasets: 4.0.0
- Tokenizers: 0.21.2
Citations
- Downloads last month
- 1
Model tree for ymoslem/wmt25-cs-de-20layers-2e-05-100k-news-sentences-doc-eval
Base model
CohereLabs/aya-expanse-8b