Update README.md
Browse files
README.md
CHANGED
|
@@ -8,6 +8,8 @@ pipeline_tag: text-generation
|
|
| 8 |
tags:
|
| 9 |
- qwen3
|
| 10 |
- newstar
|
|
|
|
|
|
|
| 11 |
---
|
| 12 |
## Overview
|
| 13 |
Newstar-Qwen3-0.6B is a finetuned version of the Qwen3-0.6B base model. It uses Newstar’s instruction tuning on top of Qwen3’s pretrained weights. The tuning was done using the ITP-v2 dataset.
|
|
@@ -59,4 +61,4 @@ This model is designed **without thinking capabilities**. It intentionally avoid
|
|
| 59 |
primaryClass={cs.CL},
|
| 60 |
url={https://arxiv.org/abs/2505.09388},
|
| 61 |
}
|
| 62 |
-
```
|
|
|
|
| 8 |
tags:
|
| 9 |
- qwen3
|
| 10 |
- newstar
|
| 11 |
+
datasets:
|
| 12 |
+
- marcuscedricridia/itp-v2
|
| 13 |
---
|
| 14 |
## Overview
|
| 15 |
Newstar-Qwen3-0.6B is a finetuned version of the Qwen3-0.6B base model. It uses Newstar’s instruction tuning on top of Qwen3’s pretrained weights. The tuning was done using the ITP-v2 dataset.
|
|
|
|
| 61 |
primaryClass={cs.CL},
|
| 62 |
url={https://arxiv.org/abs/2505.09388},
|
| 63 |
}
|
| 64 |
+
```
|