This repository hosts a [Janus-Pro 7B] trained by GCPO. The reward model is Geneval.
The training code is available at GCPO
Files info
Base model