arXiv:2510.23393
Evgeniy Glukhov
jenyag
AI & ML interests
None yet
Recent Activity
authored
a paper
16 days ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N
Sampling via max@k Optimisation
commented on
a paper
16 days ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N
Sampling via max@k Optimisation