Files changed (1) hide show
  1. README.md +31 -19
README.md CHANGED
@@ -1,20 +1,32 @@
1
- ---
2
- library_name: transformers
3
- license: apache-2.0
4
- datasets:
5
- - Tesslate/Gradient-Reasoning
6
- language:
7
- - en
8
- base_model:
9
- - Qwen/Qwen2.5-3B-Instruct
10
- ---
11
-
12
- # Model Card for Gradience-3B
13
-
14
- This model is still in preview/beta. We're still working on it! This is just so the community can try out our new "Gradient Reasoning" that intends to break problems down and reason faster.
15
-
16
-
17
- You can use a system prompt to enable thinking:
18
- "First, think step-by-step to reach the solution. Enclose your entire reasoning process within <|begin_of_thought|> and <|end_of_thought|> tags."
19
- You can try sampling params:
 
 
 
 
 
 
 
 
 
 
 
 
20
  Temp: 0.76, TopP: 0.62, Topk 30-68, Rep: 1.0, minp: 0.05
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ datasets:
5
+ - Tesslate/Gradient-Reasoning
6
+ language:
7
+ - zho
8
+ - eng
9
+ - fra
10
+ - spa
11
+ - por
12
+ - deu
13
+ - ita
14
+ - rus
15
+ - jpn
16
+ - kor
17
+ - vie
18
+ - tha
19
+ - ara
20
+ base_model:
21
+ - Qwen/Qwen2.5-3B-Instruct
22
+ ---
23
+
24
+ # Model Card for Gradience-3B
25
+
26
+ This model is still in preview/beta. We're still working on it! This is just so the community can try out our new "Gradient Reasoning" that intends to break problems down and reason faster.
27
+
28
+
29
+ You can use a system prompt to enable thinking:
30
+ "First, think step-by-step to reach the solution. Enclose your entire reasoning process within <|begin_of_thought|> and <|end_of_thought|> tags."
31
+ You can try sampling params:
32
  Temp: 0.76, TopP: 0.62, Topk 30-68, Rep: 1.0, minp: 0.05