π§ Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2 GGUFs
Quantized version of: BasedBase/Qwen3-Coder-30B-A3B-Instruct-480B-Distill-V2-Fp32
π¦ Available GGUFs
| Format | Description |
|---|---|
| F16 | Full precision (16-bit), better quality, larger size βοΈ |
| Q3_K_XL | Quantized (3-bit XL variant, based on the quantization table of the unsloth model Qwen3-30B-A3B-Thinking-2507), smaller size, faster inference β‘ |
| Q4_K_XL | Quantized (4-bit XL variant, based on the quantization table of the unsloth model Qwen3-30B-A3B-Thinking-2507), smaller size, faster inference β‘ |
| Q5_K_XL | Quantized (5-bit XL variant, based on the quantization table of the unsloth model Qwen3-30B-A3B-Thinking-2507), medium size, faster inference β‘ |
π Usage
Example with llama.cpp:
./main -m ./gguf-file-name.gguf -p "Hello world!"
- Downloads last month
- 445
Hardware compatibility
Log In
to view the estimation
3-bit
4-bit
5-bit
16-bit