This is Qwen/Qwen3-VL-8B-Instruct quantized with AutoRound in W4A16 (GPTQ format). The model has been created, tested, and evaluated by The Kaitchup. The model is NOT compatible with vLLM (as of v0.11).

Developed by: The Kaitchup
License: Apache 2.0 license

How to Support My Work

Subscribe to The Kaitchup. This helps me a lot to continue quantizing and evaluating models for free. Or you prefer to give some GPU hours, "buy me a coffee"

Downloads last month: 11

Safetensors

Model size

2B params

Tensor type

I32

BF16

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kaitchup/Qwen3-VL-8B-Instruct-W4A16

Base model

Qwen/Qwen3-VL-8B-Instruct

Quantized

(33)

this model

Collection including kaitchup/Qwen3-VL-8B-Instruct-W4A16

Quantized Qwen3-VL

Collection

4 items • Updated 5 days ago