Files changed (1) hide show
  1. README.md +42 -30
README.md CHANGED
@@ -1,31 +1,43 @@
1
- ---
2
- license: mit
3
- datasets:
4
- - kxdw2580/catgirl-dataset
5
- language:
6
- - zh
7
- base_model:
8
- - Qwen/Qwen2.5-0.5B-Instruct
9
- ---
10
-
11
- # kxdw2580/Qwen2.5-0.5B-Catgirl-test0426
12
-
13
- This model is a test model, designed for phased lightweight testing of the dataset.
14
-
15
- The test dataset has fixed this issue:
16
-
17
- - Model's outputs "~" causing rendering errors.
18
-
19
- After testing, the objectives of the dataset fixes have been achieved.
20
-
21
- ## Other
22
-
23
- As a 0.5b model, its performance is very poor, especially in the missing English part of the dataset. We do not recommend using this model unless there is a specific need.
24
-
25
- We are working hard to improve the training results on smaller models, but it is obviously unlikely for the 0.5b model.
26
-
27
- Specific training results can be seen at [swanlab](https://swanlab.cn/@shadow01a/qwen-catgirl/runs/q58lh8yf6itgoamcoq4q8/chart)
28
-
29
- Additionally, I have observed that with models of this size, a smaller training loss does not always indicate better model performance, and sometimes even leads to a decline in performance. [This swanlab record](https://swanlab.cn/@shadow01a/qwen-catgirl/runs/d29hi6y9d7g772ib5vbtx/chart) is the result of further training for this model. After testing, I found that its performance is even worse than the original model. This model has not been publicly released and has been deleted.
30
-
 
 
 
 
 
 
 
 
 
 
 
 
31
  I would be very happy to communicate if you wish to!
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - kxdw2580/catgirl-dataset
5
+ language:
6
+ - zho
7
+ - eng
8
+ - fra
9
+ - spa
10
+ - por
11
+ - deu
12
+ - ita
13
+ - rus
14
+ - jpn
15
+ - kor
16
+ - vie
17
+ - tha
18
+ - ara
19
+ base_model:
20
+ - Qwen/Qwen2.5-0.5B-Instruct
21
+ ---
22
+
23
+ # kxdw2580/Qwen2.5-0.5B-Catgirl-test0426
24
+
25
+ This model is a test model, designed for phased lightweight testing of the dataset.
26
+
27
+ The test dataset has fixed this issue:
28
+
29
+ - Model's outputs "~" causing rendering errors.
30
+
31
+ After testing, the objectives of the dataset fixes have been achieved.
32
+
33
+ ## Other
34
+
35
+ As a 0.5b model, its performance is very poor, especially in the missing English part of the dataset. We do not recommend using this model unless there is a specific need.
36
+
37
+ We are working hard to improve the training results on smaller models, but it is obviously unlikely for the 0.5b model.
38
+
39
+ Specific training results can be seen at [swanlab](https://swanlab.cn/@shadow01a/qwen-catgirl/runs/q58lh8yf6itgoamcoq4q8/chart)
40
+
41
+ Additionally, I have observed that with models of this size, a smaller training loss does not always indicate better model performance, and sometimes even leads to a decline in performance. [This swanlab record](https://swanlab.cn/@shadow01a/qwen-catgirl/runs/d29hi6y9d7g772ib5vbtx/chart) is the result of further training for this model. After testing, I found that its performance is even worse than the original model. This model has not been publicly released and has been deleted.
42
+
43
  I would be very happy to communicate if you wish to!