rulins/rl_rag_surveyqa_validation_longform_finegrained_with_system_prompt Viewer • Updated Aug 21 • 703 • 11
rulins/rl_rag_surveyqa_validation_longform_averaged_outcome_with_system_prompt Viewer • Updated Aug 20 • 703 • 17
rulins/4math-openthiner3-n64-filtered-10_max_each-openthoughts3-searched-results-merged Updated Jul 21 • 1.02k
rulins/rl_rag_surveyqa_validation_longform_rubrics_only_with_system_prompt Viewer • Updated Jul 20 • 703 • 9
rulins/4math-openthiner3-n64-filtered-5each-openthoughts3-searched-results-merged Updated Jul 17 • 17
rulins/5math-5assessors-n64-filtered-20each-openthoughts3-searched-results-merged Updated Jul 17 • 92
rulins/math-qwen25-math-7b-n64-filtered-10each-openthoughts3-searched-results-merged Updated Jul 17 • 57
rulins/5math-qwen-math-n64-mv-filtered-10each-openthoughts3-searched-results-merged Updated Jul 16 • 59