Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
13
Xi Yang
xiyang99
Follow
SteveSHEN's profile picture
Stars321123's profile picture
Fishtiks's profile picture
7 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
authored
a paper
about 2 months ago
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
authored
a paper
about 2 months ago
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation
View all activity
Organizations
Articles
1
Article
33
Letting Large Models Debate: The First Multilingual LLM Debate Competition
Papers
13
arxiv:
2509.17177
arxiv:
2508.11252
arxiv:
2508.10015
arxiv:
2508.02178
Expand 13 papers
models
0
None public yet
datasets
0
None public yet