This is Q8_0 quantization model of Llava1.6.
Run it by llama_cpp
# !pip install llama-cpp-python
from llama_cpp import Llama
llm = Llama.from_pretrained(
repo_id="Steven0090/Llama3.2-Instruct-1B-gguf",
filename="llama32_1B_q8_0.gguf",
)
- Downloads last month
- 16
Hardware compatibility
Log In
to view the estimation
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Steven0090/Llama3.2-Instruct-1B-gguf
Base model
meta-llama/Llama-3.2-3B-Instruct