gggg's picture

3 8 6

gggg

justin6667

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

886

upvoted a paper 10 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6

upvoted 3 collections over 1 year ago

CoT

101 items • Updated Oct 4 • 8

Evaluation

6 items • Updated Aug 23 • 4

TIGERScore

List of model variates of TIGEREScore checkpoints and the associated dataset • 8 items • Updated Sep 26, 2024 • 5

upvoted 2 papers over 1 year ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 48

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107