Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kazuuu 's Collections
llm app
Vision and Language
RL
Vision
Archtecture

RL

updated Mar 11, 2024
Upvote
-

  • Teaching Large Language Models to Reason with Reinforcement Learning

    Paper • 2403.04642 • Published Mar 7, 2024 • 50

  • Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

    Paper • 2403.03950 • Published Mar 6, 2024 • 16

  • RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches

    Paper • 2403.02709 • Published Mar 5, 2024 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs