Has anyone successfully loaded q8_0 gguf?

by rthug - opened Oct 14

Oct 14

I see 19k downloads, but I can't find any information about how to get around the qwen3vlmoe architecture error. If anyone has a solution it would be much appreciated!

huihui-ai

Owner Oct 14

What error message?

rthug

Oct 14

🥲 Failed to load the model

Failed to load model

error loading model: error loading model architecture: unknown model architecture: 'qwen3vlmoe'

^from LM-Studio, I get a similar error in ollama/llama.cpp

shibuzhang

Oct 15

LMStudio can run the MLX version of Qwen3 VL, but it doesn't currently support the GGUF version. It seems LMStudio won't update its llama.cpp fork, and the official llama.cpp appears to have never supported Qwen3 VL.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment