|
|
--- |
|
|
license: mit |
|
|
language: |
|
|
- zho |
|
|
- eng |
|
|
- fra |
|
|
- spa |
|
|
- por |
|
|
- deu |
|
|
- ita |
|
|
- rus |
|
|
- jpn |
|
|
- kor |
|
|
- vie |
|
|
- tha |
|
|
- ara |
|
|
pipeline_tag: text-generation |
|
|
datasets: |
|
|
- DiamondGotCat/Zeta-2-Dataset |
|
|
new_version: Zeta-LLM/Zeta-3 |
|
|
--- |
|
|
|
|
|
 |
|
|
|
|
|
# Further into the future. Better. New achievement: Zeta 2. |
|
|
Zeta 2 is a new version of Zeta that aims to take the next step forward from Zeta 1, which proved that "ordinary consumers can create LLMs." |
|
|
|
|
|
**Ollama:** [DiamondGotCat/Zeta-2](https://ollama.com/DiamondGotCat/Zeta-2) |
|
|
|
|
|
## Quantized Model (GGUF) |
|
|
- [Static Quant](https://huggingface.co/mradermacher/Zeta-2-GGUF) |
|
|
- [Weighted/imatrix Quant](https://huggingface.co/mradermacher/Zeta-2-i1-GGUF) |
|
|
|
|
|
## About Zeta 2 |
|
|
Zeta 2 is an LLM with about 460 million parameters. |
|
|
|
|
|
It might be better to call it an SLM. |
|
|
|
|
|
Zeta 2 is an SLM that was painstakingly created on a consumer computer. |
|
|
|
|
|
## Prompt Template |
|
|
``` |
|
|
{{ if .System }}{{ .System }}{{ end }} |
|
|
{{ if .Prompt }}<USER>{{ .Prompt }}</USER>{{ end }} |
|
|
<ASSISTANT> |
|
|
``` |
|
|
|
|
|
## Computer Spec |
|
|
|
|
|
**Machine:** *Mac Mini (M2 Pro, 32GB RAM, 2023)* |
|
|
|
|
|
This is a technical matter, but on a Mac with Apple Silicon you can only choose between CPU or MPS (Apple's proprietary API). |
|
|
|
|
|
CPU has some compatibility but is too slow. |
|
|
|
|
|
MPS is somewhat faster but doesn't allow the use of optimization systems such as fp16. |
|
|
|
|
|
Zeta 2 was built using MPS. |
|
|
|
|
|
## Arguments of Trainer |
|
|
- **train epochs:** 3 |
|
|
- **warmup steps:** 100 |
|
|
|
|
|
## Datasets |
|
|
Details of the dataset used can be found [here](https://github.com/DiamondGotCat/Zeta-Dataset/releases/tag/zeta-2) |
|
|
|
|
|
## Links |
|
|
GitHub: [Zeta](https://github.com/DiamondGotCat/Zeta) |
|
|
|
|
|
--- |
|
|
|
|
|
Zeta is just a small SLM. |
|
|
But don't forget that it has big dreams inside. |