cgus commited on
Commit
0fe98d2
·
verified ·
1 Parent(s): 7f2f0b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -5
README.md CHANGED
@@ -1,10 +1,7 @@
1
  ---
2
  base_model:
3
- - IlyaGusev/saiga_nemo_12b
4
- - elinas/Chronos-Gold-12B-1.0
5
- - Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24
6
- - MarinaraSpaghetti/NemoMix-Unleashed-12B
7
- library_name: transformers
8
  tags:
9
  - mergekit
10
  - merge
@@ -15,6 +12,24 @@ language:
15
  - ru
16
  - en
17
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  # SAINEMO-reMIX
19
  ![SAINEMO-reMIX](./remixwife.webp)
20
 
 
1
  ---
2
  base_model:
3
+ - Moraliane/SAINEMO-reMIX
4
+ library_name: exllamav2
 
 
 
5
  tags:
6
  - mergekit
7
  - merge
 
12
  - ru
13
  - en
14
  ---
15
+ # SAINEMO-reMIX-exl2
16
+ Original model: [SAINEMO-reMIX](https://huggingface.co/Moraliane/SAINEMO-reMIX) by [Moraliane](https://huggingface.co/Moraliane)
17
+
18
+ ## Quants
19
+ [4bpw h6 (main)](https://huggingface.co/cgus/SAINEMO-reMIX-exl2/tree/main)
20
+ [4.5bpw h6](https://huggingface.co/cgus/SAINEMO-reMIX-exl2/tree/4.5bpw-h6)
21
+ [5bpw h6](https://huggingface.co/cgus/SAINEMO-reMIX-exl2/tree/5bpw-h6)
22
+ [6bpw h6](https://huggingface.co/cgus/SAINEMO-reMIX-exl2/tree/6bpw-h6)
23
+ [8bpw h8](https://huggingface.co/cgus/SAINEMO-reMIX-exl2/tree/8bpw-h8)
24
+
25
+ ## Quantization notes
26
+ Made with Exllamav2 0.2.8 with default dataset. This seems to be a Russian RP and perhaps a general-purpose model.
27
+ It can be used with TabbyAPI, Text-Generation-WebUI with RTX GPU (Windows) or RTX/ROCm (Linux).
28
+ The model has to fully fit your VRAM for optimal performance. If it doesn't fit, use GGUF version instead.
29
+
30
+ Для использования требуется TabbyAPI или Text-Generation-WebUI и видеокарта RTX (Windows) или RTX/ROCm (Linux).
31
+ Модели в exl2 формате должны помещаться в видеопамять видеокарты.
32
+ Если видеопамяти недостаточно, то лучше использовать модели в GGUF формате.
33
  # SAINEMO-reMIX
34
  ![SAINEMO-reMIX](./remixwife.webp)
35