q4_1, q4_0 and q3_K_L Quants for the GPU poor(me)

Downloads last month
121
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for tikeape/Qwen3-4B-Thinking-2507-Command-A-Reasoning-Distill-GGUF