Mistral based Models
Collection
5 items
β’
Updated
β’
3
4-bit GGUF models for CPU+GPU inference
This model is the static version of moloras (Mixture-of-multi-LoRAs) based on the following 6 Mistral-based LoRa modules.
Totally 6 LoRA modules from speechless-mistral-7b-dare-0.85
The router of mixture-of-multi-loras enables an automatic assembling of LoRA modules, using a gradientfree approach to obtain the coefficients of LoRA modules and requiring only a handful of inference steps for unseen tasks.
Code: https://github.com/uukuguy/multi_loras?tab=readme-ov-file#mixture-of-multi-loras
| Metric | Value |
|---|---|
| ARC | 59.98 |
| HellaSwag | 83.29 |
| MMLU | 64.12 |
| TruthfulQA | 42.15 |
| Winogrande | 78.37 |
| GSM8K | 37.68 |
| Average | 60.93 |