cyankiwi
/

GLM-4.6V-AWQ-8bit

Image-Text-to-Text

compressed-tensors

Model card Files Files and versions

cpatonn commited on 6 days ago

Commit

c2925c8

·

verified ·

1 Parent(s): eff2e78

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ base_model: zai-org/GLM-4.6V
 ### Quantization Details
-- **Quantization Method:** AWQ
 - **Bits:** 8
 - **Group Size:** 32
 - **Calibration Dataset:** [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct)
@@ -45,6 +45,10 @@ vllm serve cyankiwi/GLM-4.6V-AWQ-8bit
 ## Additional Information
 ### Changelog
 - **v1.0.0** - Initial quantized release

 ### Quantization Details
+- **Quantization Method:** cyankiwi AWQ v1.0
 - **Bits:** 8
 - **Group Size:** 32
 - **Calibration Dataset:** [5CD-AI/LLaVA-CoT-o1-Instruct](https://huggingface.co/datasets/5CD-AI/LLaVA-CoT-o1-Instruct)
 ## Additional Information
+### Known Issues
+- `tensor-parallel-size > 2` requires `--enable-expert-parallel`
 ### Changelog
 - **v1.0.0** - Initial quantized release