arxiv:2601.19532
Marthe Ballon
martheballon
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Benchmarks Saturate When The Model Gets Smarter Than The Judge
updated
a dataset
1 day ago
martheballon/Omni-MATH-2
submitted
a paper
1 day ago
Benchmarks Saturate When The Model Gets Smarter Than The Judge
Organizations
None yet