This repo contains several GGUF quants of Hestia-20b.

This is a task_arithmetic merge of Harmonia (my 20b faux base model) with Noromaid and my LORA-glued Nethena. Solidly outperforms Harmonia.

merge_method: task_arithmetic

base_model: athirdpath/Harmonia-20b

models:

dtype: float16

Thanks to Undi95 for pioneering the 20B recipe, and for most of the models involved.

GGUF

Model size

20B params

Architecture

llama

Hardware compatibility

4-bit

5-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including athirdpath/Hestia-20b-GGUF