Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhiyuan Li's picture
1 4 2

Zhiyuan Li

zhiyuan218
·
  • ZhiyuanLi218

AI & ML interests

None yet

Organizations

jilin university's profile picture

upvoted 2 papers 3 months ago

THINK-Bench: Evaluating Thinking Efficiency and Chain-of-Thought Quality of Large Reasoning Models

Paper • 2505.22113 • Published May 28 • 1

Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Paper • 2508.04017 • Published Aug 6 • 11
upvoted a paper 9 months ago

StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following

Paper • 2502.14494 • Published Feb 20 • 15
upvoted a paper about 1 year ago

Large Language Model Evaluation via Matrix Nuclear-Norm

Paper • 2410.10672 • Published Oct 14, 2024 • 19
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs