Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,8 @@ library_name: transformers
|
|
| 7 |
tags:
|
| 8 |
- mergekit
|
| 9 |
- merge
|
| 10 |
-
|
|
|
|
| 11 |
---
|
| 12 |
# merge
|
| 13 |
|
|
@@ -21,7 +22,7 @@ This model was merged using the SLERP merge method.
|
|
| 21 |
### Models Merged
|
| 22 |
|
| 23 |
The following models were included in the merge:
|
| 24 |
-
* [v000000/Qwen2.5-14B-Gutenberg-
|
| 25 |
* [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
| 26 |
|
| 27 |
### Configuration
|
|
@@ -32,9 +33,9 @@ The following YAML configuration was used to produce this model:
|
|
| 32 |
models:
|
| 33 |
- model: Qwen/Qwen2.5-14B-Instruct
|
| 34 |
merge_method: slerp
|
| 35 |
-
base_model: v000000/Qwen2.5-14B-Gutenberg-
|
| 36 |
parameters:
|
| 37 |
t:
|
| 38 |
- value: [0, 0, 0.3, 0.4, 0.5, 0.6, 0.5, 0.4, 0.3, 0, 0]
|
| 39 |
dtype: bfloat16
|
| 40 |
-
```
|
|
|
|
| 7 |
tags:
|
| 8 |
- mergekit
|
| 9 |
- merge
|
| 10 |
+
- qwen2
|
| 11 |
+
- dpo
|
| 12 |
---
|
| 13 |
# merge
|
| 14 |
|
|
|
|
| 22 |
### Models Merged
|
| 23 |
|
| 24 |
The following models were included in the merge:
|
| 25 |
+
* [v000000/Qwen2.5-14B-Gutenberg-1e-Delta](https://huggingface.co/v000000/Qwen2.5-14B-Gutenberg-1e-Delta)
|
| 26 |
* [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
| 27 |
|
| 28 |
### Configuration
|
|
|
|
| 33 |
models:
|
| 34 |
- model: Qwen/Qwen2.5-14B-Instruct
|
| 35 |
merge_method: slerp
|
| 36 |
+
base_model: v000000/Qwen2.5-14B-Gutenberg-1e-Delta
|
| 37 |
parameters:
|
| 38 |
t:
|
| 39 |
- value: [0, 0, 0.3, 0.4, 0.5, 0.6, 0.5, 0.4, 0.3, 0, 0]
|
| 40 |
dtype: bfloat16
|
| 41 |
+
```
|