Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2406.09308
Transformers
Collection by Aug 9, 2024
-
  • Uncovering mesa-optimization algorithms in Transformers

    Paper • 2309.05858 • Published Sep 11, 2023 • 13
  • ProPainter: Improving Propagation and Transformer for Video Inpainting

    Paper • 2309.03897 • Published Sep 7, 2023 • 27
  • Approximating Two-Layer Feedforward Networks for Efficient Transformers

    Paper • 2310.10837 • Published Oct 16, 2023 • 11
  • CLEX: Continuous Length Extrapolation for Large Language Models

    Paper • 2310.16450 • Published Oct 25, 2023 • 10
Transformers
Collection by Aug 9, 2024
-
  • Uncovering mesa-optimization algorithms in Transformers

    Paper • 2309.05858 • Published Sep 11, 2023 • 13
  • ProPainter: Improving Propagation and Transformer for Video Inpainting

    Paper • 2309.03897 • Published Sep 7, 2023 • 27
  • Approximating Two-Layer Feedforward Networks for Efficient Transformers

    Paper • 2310.10837 • Published Oct 16, 2023 • 11
  • CLEX: Continuous Length Extrapolation for Large Language Models

    Paper • 2310.16450 • Published Oct 25, 2023 • 10
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs