dyyyyyyyy
/

FAPO-GenRM-4B

Text Generation

text-generation-inference

Model card Files Files and versions

dyyyyyyyy commited on Oct 28, 2025

Commit

d525519

·

verified ·

1 Parent(s): 9062fda

Update README.md

Files changed (1) hide show

README.md +23 -18

README.md CHANGED Viewed

@@ -1,18 +1,23 @@
----
-license: apache-2.0
----
-Generative Reward Model trained with [FAPO-Critic](https://huggingface.co/datasets/dyyyyyyyy/FAPO-Critic)
----
-Project Homepage: https://fapo-rl.github.io/
-Code Implementation: https://github.com/volcengine/verl/tree/main/recipe/fapo
-Welcome to follow and cite our works!
-BibTeX citation:
-```bibtex
-comming soon
-```

+---
+license: apache-2.0
+---
+Generative Reward Model trained with [FAPO-Critic](https://huggingface.co/datasets/dyyyyyyyy/FAPO-Critic)
+---
+Project Homepage: https://fapo-rl.github.io/
+Code Implementation: https://github.com/volcengine/verl/tree/main/recipe/fapo
+Welcome to follow and cite our works!
+BibTeX citation:
+```bibtex
+@article{ding2025fapo,
+  title={FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning},
+  author={Ding, Yuyang and Zhang, Chi and Li, Juntao and Lin, Haibin and Liu, Xin and Zhang, Min},
+  journal={arXiv preprint arXiv:2510.22543},
+  year={2025}
+}
+```