theprint-12B-MoE-3A
An experimental Mixture of Experts (MoE) model combining four specialized Llama 3.2 3B fine-tunes into a single 12B parameter model with ~3B active parameters per token.
For more information, please see theprint/theprint-12B-MoE-3A.
- Downloads last month
- 291
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for theprint/theprint-12B-MoE-3A-GGUF
Base model
meta-llama/Llama-3.2-3B-Instruct
Finetuned
theprint/theprint-12B-MoE-3A