Kai Yang's picture

1 2 2

Kai Yang

yangkaiSIGS

·

https://yk7333.github.io/

yk7333

AI & ML interests

None yet

Recent Activity

updated a Space 4 days ago

yangkaiSIGS/entropic

authored a paper 21 days ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

authored a paper 21 days ago

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

View all activity

Organizations

upvoted a paper 21 days ago

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

Paper • 2511.15248 • Published 23 days ago • 6

upvoted a paper about 2 years ago

Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model

Paper • 2311.13231 • Published Nov 22, 2023 • 29