Xianyu's picture

1 4 8

Xianyu

catqaq

·

https://github.com/catqaq

AI & ML interests

Founder of OpenLLMAI, do something cool! Efficient LLM，Alignment，data efficiency，Parameter efficiency，RLHF

Organizations

upvoted a paper 5 months ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17 • 23

upvoted 2 articles over 1 year ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.02k

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

Jul 23, 2024

•

241

upvoted a paper over 1 year ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 40