Yaqi Li's picture

2

Yaqi Li

lyq9811

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4 • 36

upvoted a paper 6 months ago

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published May 26 • 43