These are quantizations of the model Gelato-30B-A3B

The imatrix has been used from mradermacher.
As most of the quants are available from the great mradermacher team, I will include here only the quants that are missing.

Usage Notes:

  • Download the latest llama.cpp to use these quantizations.
  • Try to use the best quality you can run.
  • For the mmproj file, the F32 version is recommended for best results (F32 > BF16 > F16).
Downloads last month
2,001
GGUF
Model size
31B params
Architecture
qwen3vlmoe
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for noctrex/Gelato-30B-A3B-i1-GGUF

Quantized
(5)
this model