Update README.md
Browse files
README.md
CHANGED
|
@@ -3,4 +3,9 @@ library_name: transformers
|
|
| 3 |
tags: []
|
| 4 |
---
|
| 5 |
|
| 6 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
tags: []
|
| 4 |
---
|
| 5 |
|
| 6 |
+
## Description
|
| 7 |
+
|
| 8 |
+
Llama3-Instruct-8B model finetuned by off-polciy WPO. Details in [WPO: Enhancing RLHF with Weighted Preference Optimization](https://arxiv.org/abs/2406.11827).
|
| 9 |
+
|
| 10 |
+
## License
|
| 11 |
+
This model is licensed under the Zoom software license and is permitted for use only for noncommercial, educational, or academic research purposes.
|