Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ByRookie 's Collections
kd
pretrain data selectection
llm math
llm engineer
llm length control
reward model
dataset

kd

updated Oct 23, 2024
Upvote
-

  • Aligning Teacher with Student Preferences for Tailored Training Data Generation

    Paper • 2406.19227 • Published Jun 27, 2024 • 25

  • Pre-training Distillation for Large Language Models: A Design Space Exploration

    Paper • 2410.16215 • Published Oct 21, 2024 • 16

  • Baichuan Alignment Technical Report

    Paper • 2410.14940 • Published Oct 19, 2024 • 51

  • MiniPLM: Knowledge Distillation for Pre-Training Language Models

    Paper • 2410.17215 • Published Oct 22, 2024 • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs