Wang's picture

2 23

Wang

VincentWang

·

VincentWong1

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

OpenAssistant/reward-model-deberta-v3-large-v2

liked a dataset 7 days ago

Mxode/Chinese-Instruct

liked a dataset 8 days ago

BAAI/IndustryCorpus2

View all activity

Organizations

None yet

upvoted an article 8 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

94

upvoted a paper over 1 year ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 51