Chenyang Li's picture

42 10

Chenyang Li

MorningsunLee

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

upvoted a paper 10 days ago

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

upvoted a paper 10 days ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

View all activity

Organizations

None yet

upvoted 4 papers 10 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published 14 days ago • 7

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published 12 days ago • 7

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published 13 days ago • 12

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published 13 days ago • 26

upvoted 4 papers 11 days ago

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published 12 days ago • 94

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 12 days ago • 191

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Paper • 2511.04307 • Published 12 days ago • 14

HoneyBee: Data Recipes for Vision-Language Reasoners

Paper • 2510.12225 • Published Oct 14 • 10

upvoted 4 papers 19 days ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 21 days ago • 92

LongCat-Video Technical Report

Paper • 2510.22200 • Published 24 days ago • 24

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published 22 days ago • 20

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

Paper • 2510.23587 • Published 22 days ago • 65

upvoted a paper 22 days ago

Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

Paper • 2510.21583 • Published 25 days ago • 30

upvoted 2 papers 23 days ago

A Definition of AGI

Paper • 2510.18212 • Published 29 days ago • 33

UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Paper • 2510.20286 • Published 26 days ago • 23

upvoted a collection 24 days ago

Qwen3-VL

37 items • Updated 17 days ago • 411

upvoted a paper 25 days ago

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published 26 days ago • 38

upvoted a paper 30 days ago

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16 • 11

upvoted a paper about 2 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36

upvoted a paper 2 months ago

LLM-I: LLMs are Naturally Interleaved Multimodal Creators

Paper • 2509.13642 • Published Sep 17 • 8