Quality & Performance updated
Browse files
Qwen3-4B-Q5_K_S/README.md
CHANGED
|
@@ -14,7 +14,7 @@ base_model: Qwen/Qwen3-4B
|
|
| 14 |
author: geoffmunn
|
| 15 |
---
|
| 16 |
|
| 17 |
-
# Qwen3-4B
|
| 18 |
|
| 19 |
Quantized version of [Qwen/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) at **Q5_K_S** level, derived from **f16** base weights.
|
| 20 |
|
|
@@ -30,10 +30,9 @@ Quantized version of [Qwen/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) at **
|
|
| 30 |
|
| 31 |
| Metric | Value |
|
| 32 |
|-------|-------|
|
| 33 |
-
| **Quality** | High |
|
| 34 |
| **Speed** | 🐢 Medium |
|
| 35 |
| **RAM Required** | ~3.5 GB |
|
| 36 |
-
| **Recommendation** |
|
| 37 |
|
| 38 |
## Prompt Template (ChatML)
|
| 39 |
|
|
|
|
| 14 |
author: geoffmunn
|
| 15 |
---
|
| 16 |
|
| 17 |
+
# Qwen3-4B:Q5_K_S
|
| 18 |
|
| 19 |
Quantized version of [Qwen/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) at **Q5_K_S** level, derived from **f16** base weights.
|
| 20 |
|
|
|
|
| 30 |
|
| 31 |
| Metric | Value |
|
| 32 |
|-------|-------|
|
|
|
|
| 33 |
| **Speed** | 🐢 Medium |
|
| 34 |
| **RAM Required** | ~3.5 GB |
|
| 35 |
+
| **Recommendation** | Did not appear in the top 3 for any question. Not recommended. |
|
| 36 |
|
| 37 |
## Prompt Template (ChatML)
|
| 38 |
|