Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.02097
23\ Info - pages
1. TinyGSM: achieving >80% on GSM8k with small language models
Collection by Apr 7
-
  • TinyGSM: achieving >80% on GSM8k with small language models

    Paper • 2312.09241 • Published Dec 14, 2023 • 40
  • ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

    Paper • 2403.03853 • Published Mar 6, 2024 • 66
  • Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

    Paper • 2403.18795 • Published Mar 27, 2024 • 20
  • Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models

    Paper • 2404.04478 • Published Apr 6, 2024 • 13
23\ Info - pages
1. TinyGSM: achieving >80% on GSM8k with small language models
Collection by Apr 7
-
  • TinyGSM: achieving >80% on GSM8k with small language models

    Paper • 2312.09241 • Published Dec 14, 2023 • 40
  • ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

    Paper • 2403.03853 • Published Mar 6, 2024 • 66
  • Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

    Paper • 2403.18795 • Published Mar 27, 2024 • 20
  • Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models

    Paper • 2404.04478 • Published Apr 6, 2024 • 13
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs