Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sikang99 's Collections
SLAM
3DGS NeRF
Diffusion Models
VLM, MLLM
Diffusion Model
Reinforcement Learning
Vision Processing
Simulation
VLA Models
AI Agents
3D Generation
Video Generation

VLM, MLLM

updated Jul 1
Upvote
-

  • UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding

    Paper • 2506.23219 • Published Jun 29 • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs