rubricreward/arena-human-preference-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
61.1k
•
7
rubricreward/arena-human-preference-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
60.7k
•
10
rubricreward/arena-human-preference-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
60.3k
•
11
rubricreward/arena-human-preference-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
120k
•
17
rubricreward/arena-human-preference-tgt_prompt_en_thinking
Viewer
•
Updated
•
120k
•
10
rubricreward/arena-human-preference-en_prompt_en_thinking
Viewer
•
Updated
•
120k
•
13
rubricreward/PolyGuardMix
Viewer
•
Updated
•
2.93M
•
14
rubricreward/mR3-Dataset-Filtered1
Viewer
•
Updated
•
696k
•
41
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
624k
•
9
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
631k
•
12
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
638k
•
9
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
890k
•
15
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking
Viewer
•
Updated
•
904k
•
16
•
1
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking
Viewer
•
Updated
•
903k
•
22
Viewer
•
Updated
•
40.5k
•
8
rubricreward/HelpSteer3-en_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
12.7k
•
11
rubricreward/HumanEval-XL-Python-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
3.1k
•
7
rubricreward/MATH-500-Multilingual-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
4.57k
•
7
rubricreward/MMMLU-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
148k
•
8
rubricreward/HumanEval-XL-Python-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
3.2k
•
8
rubricreward/MATH-500-Multilingual-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
4.83k
•
8
rubricreward/MMMLU-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
158k
•
11
rubricreward/HumanEval-XL-Python-en_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
3.09k
•
8
rubricreward/MATH-500-Multilingual-en_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
4.59k
•
8
rubricreward/MMMLU-en_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
148k
•
6
rubricreward/arena-human-preference-en_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
51.4k
•
8
rubricreward/HumanEval-XL-Python-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
3.25k
•
7
rubricreward/MATH-500-Multilingual-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
4.83k
•
6
rubricreward/MMMLU-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
158k
•
10
rubricreward/HumanEval-XL-Python-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
3.65k
•
8