Table formatting updated
Browse files- Qwen3-4B-Q3_K_S/README.md +18 -18
Qwen3-4B-Q3_K_S/README.md
CHANGED
|
@@ -32,10 +32,10 @@ Quantized version of [Qwen/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) at **
|
|
| 32 |
|
| 33 |
## Quality & Performance
|
| 34 |
|
| 35 |
-
| Metric
|
| 36 |
-
|
| 37 |
-
| **Speed**
|
| 38 |
-
| **RAM Required**
|
| 39 |
| **Recommendation** | 🥈 Runner up. A very good model for a wide range of queries. |
|
| 40 |
|
| 41 |
## Prompt Template (ChatML)
|
|
@@ -57,13 +57,13 @@ Set this in your app (LM Studio, OpenWebUI, etc.) for best results.
|
|
| 57 |
### Thinking Mode (Recommended for Logic)
|
| 58 |
Use when solving math, coding, or logical problems.
|
| 59 |
|
| 60 |
-
| Parameter
|
| 61 |
-
|
| 62 |
-
| Temperature
|
| 63 |
-
| Top-P
|
| 64 |
-
| Top-K
|
| 65 |
-
| Min-P
|
| 66 |
-
| Repeat Penalty | 1.1
|
| 67 |
|
| 68 |
> ❗ DO NOT use greedy decoding — it causes infinite loops.
|
| 69 |
|
|
@@ -74,13 +74,13 @@ Enable via:
|
|
| 74 |
### Non-Thinking Mode (Fast Dialogue)
|
| 75 |
For casual chat and quick replies.
|
| 76 |
|
| 77 |
-
| Parameter
|
| 78 |
-
|
| 79 |
-
| Temperature
|
| 80 |
-
| Top-P
|
| 81 |
-
| Top-K
|
| 82 |
-
| Min-P
|
| 83 |
-
| Repeat Penalty | 1.1
|
| 84 |
|
| 85 |
Enable via:
|
| 86 |
- `enable_thinking=False`
|
|
|
|
| 32 |
|
| 33 |
## Quality & Performance
|
| 34 |
|
| 35 |
+
| Metric | Value |
|
| 36 |
+
|--------------------|--------------------------------------------------------------|
|
| 37 |
+
| **Speed** | ⚡ Fast |
|
| 38 |
+
| **RAM Required** | ~2.4 GB |
|
| 39 |
| **Recommendation** | 🥈 Runner up. A very good model for a wide range of queries. |
|
| 40 |
|
| 41 |
## Prompt Template (ChatML)
|
|
|
|
| 57 |
### Thinking Mode (Recommended for Logic)
|
| 58 |
Use when solving math, coding, or logical problems.
|
| 59 |
|
| 60 |
+
| Parameter | Value |
|
| 61 |
+
|----------------|-------|
|
| 62 |
+
| Temperature | 0.6 |
|
| 63 |
+
| Top-P | 0.95 |
|
| 64 |
+
| Top-K | 20 |
|
| 65 |
+
| Min-P | 0.0 |
|
| 66 |
+
| Repeat Penalty | 1.1 |
|
| 67 |
|
| 68 |
> ❗ DO NOT use greedy decoding — it causes infinite loops.
|
| 69 |
|
|
|
|
| 74 |
### Non-Thinking Mode (Fast Dialogue)
|
| 75 |
For casual chat and quick replies.
|
| 76 |
|
| 77 |
+
| Parameter | Value |
|
| 78 |
+
|----------------|-------|
|
| 79 |
+
| Temperature | 0.7 |
|
| 80 |
+
| Top-P | 0.8 |
|
| 81 |
+
| Top-K | 20 |
|
| 82 |
+
| Min-P | 0.0 |
|
| 83 |
+
| Repeat Penalty | 1.1 |
|
| 84 |
|
| 85 |
Enable via:
|
| 86 |
- `enable_thinking=False`
|