Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Daniel Gao's picture
6 2

Daniel Gao

maybelu9
·

AI & ML interests

None yet

Organizations

maybelu9's profile picture

upvoted 3 papers 9 months ago

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published Mar 3 • 44

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published Feb 12 • 28

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11 • 46
upvoted a paper 12 months ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 90
upvoted a paper about 1 year ago

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published Nov 15, 2024 • 34
upvoted a collection about 1 year ago

Qwen2-VL

Collection
Vision-language model series based on Qwen2 • 16 items • Updated Jul 21 • 226
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs