Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rayruiyang 's Collections
VST
Haplo-VL

VST

updated 12 days ago

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.

Upvote
6

  • rayruiyang/VST-3B-RL

    Image-Text-to-Text • 4B • Updated 13 days ago • 446 • 2

  • rayruiyang/VST-3B-SFT

    Image-Text-to-Text • 4B • Updated 13 days ago • 1.4k

  • rayruiyang/VST-7B-SFT

    Image-Text-to-Text • 8B • Updated 13 days ago • 1.92k

  • rayruiyang/VST-7B-RL

    Image-Text-to-Text • 8B • Updated 13 days ago • 579

  • Visual Spatial Tuning

    Paper • 2511.05491 • Published 17 days ago • 49
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs