Update README.md
Browse files
README.md
CHANGED
|
@@ -54,7 +54,7 @@ To unlock generative instruction-following capabilities, we utilized a two-stage
|
|
| 54 |
|
| 55 |
The model was evaluated on a representative subset of the **CETVEL Benchmark Suite**. DiffutronLM-0.3B (2nd Stage) demonstrates remarkable parameter efficiency, outperforming models up to 7x its size (e.g., Kumru-2B and TURNA-1.1B) on average scores.
|
| 56 |
|
| 57 |
-
| Benchmark | Diffutron-1st (0.3B) | Diffutron-2nd (0.3B) | TURNA (1.1B) | Kumru (2B) | Kanarya (2B) | Llama-3.2 (3B) | Trendyol (7B) | Aya-101 (13B) |
|
| 58 |
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|
| 59 |
| **Belebele_TR** | 22.22 | 27.00 | 22.56 | 29.00 | 28.11 | **55.78** | 36.22 | 22.89 |
|
| 60 |
| **EXAMS_TR** | 25.95 | 27.74 | 23.66 | **30.03** | **30.03** | 26.21 | 28.50 | 22.90 |
|
|
|
|
| 54 |
|
| 55 |
The model was evaluated on a representative subset of the **CETVEL Benchmark Suite**. DiffutronLM-0.3B (2nd Stage) demonstrates remarkable parameter efficiency, outperforming models up to 7x its size (e.g., Kumru-2B and TURNA-1.1B) on average scores.
|
| 56 |
|
| 57 |
+
| Benchmark | Diffutron-1st-Stage (0.3B) | Diffutron-2nd-Stage (0.3B) | TURNA (1.1B) | Kumru (2B) | Kanarya (2B) | Llama-3.2 (3B) | Trendyol (7B) | Aya-101 (13B) |
|
| 58 |
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|
| 59 |
| **Belebele_TR** | 22.22 | 27.00 | 22.56 | 29.00 | 28.11 | **55.78** | 36.22 | 22.89 |
|
| 60 |
| **EXAMS_TR** | 25.95 | 27.74 | 23.66 | **30.03** | **30.03** | 26.21 | 28.50 | 22.90 |
|