Mikhail Terekhov's picture

1 5

Mikhail Terekhov

terekhov

·

MikhailTerekhov

AI & ML interests

Reinforcement Learning, Multi-objective Reinforcement Learning, RLHF

Recent Activity

upvoted a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

authored a paper about 1 month ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

authored a paper about 1 month ago

Control Tax: The Price of Keeping AI in Check

View all activity

Organizations

Papers 3

arxiv:2510.09462

arxiv:2506.05296

arxiv:2410.22366

models 0

None public yet

datasets 0

None public yet