Add link to Kaiyuan-Spark repo on GitHub
Browse files
README.md
CHANGED
|
@@ -22,9 +22,11 @@ it achieves performance competitive with current state-of-the-art fully open mod
|
|
| 22 |
<img alt="Model Performance Comparison" src="model_performance_comparison.svg"/>
|
| 23 |
</center>
|
| 24 |
|
| 25 |
-
|
| 26 |
|
| 27 |
-
The
|
|
|
|
|
|
|
| 28 |
|
| 29 |
## Introduction
|
| 30 |
|
|
@@ -78,7 +80,8 @@ Please cite [our technical report](https://arxiv.org/abs/2512.07612) if you use
|
|
| 78 |
|
| 79 |
## License
|
| 80 |
|
| 81 |
-
|
|
|
|
| 82 |
|
| 83 |
```text
|
| 84 |
Copyright 2025 Tsinghua University & Peng Cheng Laboratory
|
|
|
|
| 22 |
<img alt="Model Performance Comparison" src="model_performance_comparison.svg"/>
|
| 23 |
</center>
|
| 24 |
|
| 25 |
+
Other artifacts:
|
| 26 |
|
| 27 |
+
* The dataset used to train Kaiyuan-2B is published at <https://huggingface.co/datasets/thu-pacman/PCMind-2.1-Kaiyuan-2B>.
|
| 28 |
+
* The _PCMind-2.1-Kaiyuan-2B Technical Report_ is published at <https://arxiv.org/abs/2512.07612>.
|
| 29 |
+
* The data preprocessing framework built on PySpark is published at <https://github.com/thu-pacman/Kaiyuan-Spark>.
|
| 30 |
|
| 31 |
## Introduction
|
| 32 |
|
|
|
|
| 80 |
|
| 81 |
## License
|
| 82 |
|
| 83 |
+
All artifacts (including code, model weights, and training data) of Kaiyuan-2B
|
| 84 |
+
are licensed under [Apache-2.0 License](LICENSE) with the following copyright notice:
|
| 85 |
|
| 86 |
```text
|
| 87 |
Copyright 2025 Tsinghua University & Peng Cheng Laboratory
|