Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ikarth 's Collections
Training Research

Training Research

updated Oct 14
Upvote
-

  • Language Models Can Learn from Verbal Feedback Without Scalar Rewards

    Paper • 2509.22638 • Published Sep 26 • 67

  • Don't Just Fine-tune the Agent, Tune the Environment

    Paper • 2510.10197 • Published Oct 11 • 28

  • Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

    Paper • 2510.08673 • Published Oct 9 • 121

  • Agent Learning via Early Experience

    Paper • 2510.08558 • Published Oct 9 • 262

  • Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models

    Paper • 2510.08492 • Published Oct 9 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs