alvarobartt/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 20, 2023 • 155k • 13
alvarobartt/social-reasoning-rlhf-ULTRAFEEDBACK-honesty Viewer • Updated Nov 7, 2023 • 100 • 8 • 1