Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,23 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- nl
|
| 5 |
+
tags:
|
| 6 |
+
- gguf
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
This repository contains quantized versions of [BramVanroy/fietje-2b-chat](https://huggingface.co/BramVanroy/fietje-2b-chat):
|
| 10 |
+
|
| 11 |
+
- `-f16` (5.6GB): best quality, but largest and slowest (recommended if you have the capacity, otherwise q8_0)
|
| 12 |
+
- `-q8_0` (3.0GB): minimal quality loss, smaller
|
| 13 |
+
- `-q5_k_m` (2.0GB): users have reported considerable quality loss in the chat `q5_k_m` version so you may want to avoid it
|
| 14 |
+
|
| 15 |
+
Also available on ollama:
|
| 16 |
+
|
| 17 |
+
```sh
|
| 18 |
+
# defaults to f16
|
| 19 |
+
ollama run bramvanroy/fietje-2b-chat
|
| 20 |
+
ollama run bramvanroy/fietje-2b-chat:f16
|
| 21 |
+
ollama run bramvanroy/fietje-2b-chat:q8_0
|
| 22 |
+
ollama run bramvanroy/fietje-2b-chat:q5_k_m
|
| 23 |
+
```
|