|
|
--- |
|
|
license: mit |
|
|
base_model: zai-org/GLM-4.6V |
|
|
base_model_relation: quantized |
|
|
quantized_by: turboderp |
|
|
tags: |
|
|
- exl3 |
|
|
--- |
|
|
|
|
|
EXL3 quants of [GLM-4.6V](https://huggingface.co/zai-org/GLM-4.6V) |
|
|
|
|
|
⚠️ Requires ExLlamaV3 v0.0.18 (or v0.0.17 `dev` branch) |
|
|
|
|
|
Base bitrates: |
|
|
|
|
|
[2.00 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/2.00bpw) |
|
|
[3.00 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/3.00bpw) |
|
|
[4.00 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/4.00bpw) |
|
|
[5.00 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/5.00bpw) |
|
|
|
|
|
Optimized: |
|
|
|
|
|
[2.13 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/2.13bpw) |
|
|
[2.32 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/2.32bpw) |
|
|
[2.55 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/2.55bpw) |
|
|
[2.80 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/2.80bpw) |
|
|
[3.13 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/3.13bpw) |
|
|
[3.55 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/3.55bpw) |
|
|
[4.07 bits per weight](https://huggingface.co/turboderp/GLM-4.6V-exl3/tree/4.07bpw) |
|
|
|