ggml-org/granite-4.0-h-small-Q8_0-GGUF
This model was converted to GGUF format from ibm-granite/granite-4.0-h-small using llama.cpp via the ggml.ai's GGUF-my-repo space.
Refer to the original model card for more details on the model.
- Downloads last month
- 320
Hardware compatibility
Log In
to view the estimation
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for ggml-org/granite-4.0-h-small-Q8_0-GGUF
Base model
ibm-granite/granite-4.0-h-small