harryleafchen commited on
Commit
be2d39f
·
verified ·
1 Parent(s): 5af9ddb

Add link to Kaiyuan-Spark repo on GitHub

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -22,9 +22,11 @@ it achieves performance competitive with current state-of-the-art fully open mod
22
  <img alt="Model Performance Comparison" src="model_performance_comparison.svg"/>
23
  </center>
24
 
25
- The dataset used to train Kaiyuan-2B is published at <https://huggingface.co/datasets/thu-pacman/PCMind-2.1-Kaiyuan-2B>.
26
 
27
- The _PCMind-2.1-Kaiyuan-2B Technical Report_ is published at <https://arxiv.org/abs/2512.07612>.
 
 
28
 
29
  ## Introduction
30
 
@@ -78,7 +80,8 @@ Please cite [our technical report](https://arxiv.org/abs/2512.07612) if you use
78
 
79
  ## License
80
 
81
- The code and model weights of Kaiyuan-2B are licensed under [Apache-2.0 License](LICENSE) with the following copyright notice.
 
82
 
83
  ```text
84
  Copyright 2025 Tsinghua University & Peng Cheng Laboratory
 
22
  <img alt="Model Performance Comparison" src="model_performance_comparison.svg"/>
23
  </center>
24
 
25
+ Other artifacts:
26
 
27
+ * The dataset used to train Kaiyuan-2B is published at <https://huggingface.co/datasets/thu-pacman/PCMind-2.1-Kaiyuan-2B>.
28
+ * The _PCMind-2.1-Kaiyuan-2B Technical Report_ is published at <https://arxiv.org/abs/2512.07612>.
29
+ * The data preprocessing framework built on PySpark is published at <https://github.com/thu-pacman/Kaiyuan-Spark>.
30
 
31
  ## Introduction
32
 
 
80
 
81
  ## License
82
 
83
+ All artifacts (including code, model weights, and training data) of Kaiyuan-2B
84
+ are licensed under [Apache-2.0 License](LICENSE) with the following copyright notice:
85
 
86
  ```text
87
  Copyright 2025 Tsinghua University & Peng Cheng Laboratory