Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gggg's picture
3 8 6

gggg

justin6667
·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80
upvoted an article 9 months ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28
•
886
upvoted a paper 10 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 6
upvoted 3 collections over 1 year ago

CoT

Collection
101 items • Updated Oct 4 • 8

Evaluation

Collection
6 items • Updated Aug 23 • 4

TIGERScore

Collection
List of model variates of TIGEREScore checkpoints and the associated dataset • 8 items • Updated Sep 26, 2024 • 5
upvoted 2 papers over 1 year ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 48

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs