Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bilickiv 's Collections
Agents
Vide analysis
Teaching
AI image generation

Vide analysis

updated Dec 4, 2024
Upvote
-

  • Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

    Paper • 2410.03290 • Published Oct 4, 2024 • 7

  • TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

    Paper • 2411.18671 • Published Nov 27, 2024 • 20

  • VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

    Paper • 2412.00927 • Published Dec 1, 2024 • 29
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs