|
|
---
|
|
|
library_name: transformers
|
|
|
license: apache-2.0
|
|
|
datasets:
|
|
|
- Tesslate/Gradient-Reasoning
|
|
|
language:
|
|
|
- zho
|
|
|
- eng
|
|
|
- fra
|
|
|
- spa
|
|
|
- por
|
|
|
- deu
|
|
|
- ita
|
|
|
- rus
|
|
|
- jpn
|
|
|
- kor
|
|
|
- vie
|
|
|
- tha
|
|
|
- ara
|
|
|
base_model:
|
|
|
- Qwen/Qwen2.5-3B-Instruct
|
|
|
---
|
|
|
|
|
|
# Model Card for Gradience-3B
|
|
|
|
|
|
This model is still in preview/beta. We're still working on it! This is just so the community can try out our new "Gradient Reasoning" that intends to break problems down and reason faster.
|
|
|
|
|
|
|
|
|
You can use a system prompt to enable thinking:
|
|
|
"First, think step-by-step to reach the solution. Enclose your entire reasoning process within <|begin_of_thought|> and <|end_of_thought|> tags."
|
|
|
You can try sampling params:
|
|
|
Temp: 0.76, TopP: 0.62, Topk 30-68, Rep: 1.0, minp: 0.05 |