Shudong Liu's picture

Shudong Liu

Sudanl

·

http://sudanl.github.io

AI & ML interests

NLP, LLM

Recent Activity

upvoted a paper 6 days ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

authored a paper 11 days ago

Kimi K2.5: Visual Agentic Intelligence

upvoted a paper 12 days ago

Kimi K2.5: Visual Agentic Intelligence

View all activity

Organizations

upvoted a paper 6 days ago

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published 10 days ago • 28

authored a paper 11 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 13 days ago • 232

upvoted a paper 12 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 13 days ago • 232

liked a model 20 days ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 171B • Updated 11 days ago • 757k • • 2.19k

upvoted a paper 25 days ago

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

Paper • 2601.14027 • Published 26 days ago • 12

updated 2 models about 1 month ago

opencompass/CompassVerifier-3B

3B • Updated Jan 4 • 922 • 7

opencompass/CompassVerifier-32B

33B • Updated Jan 4 • 2 • 7

liked a Space about 2 months ago

ATLAS Benchmark

ATLAS for Frontier Scientific Benchmark

authored a paper 2 months ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 56

upvoted a paper 3 months ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 56

updated a model 3 months ago

opencompass/CompassVerifier-7B

8B • Updated Nov 26, 2025 • 1.31k • 4

authored a paper 3 months ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 3

upvoted a paper 3 months ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 3

updated a Space 3 months ago

ATLAS Benchmark

ATLAS for Frontier Scientific Benchmark

authored a paper 3 months ago

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 17

upvoted a paper 3 months ago

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 17

published a Space 3 months ago

ATLAS Benchmark

ATLAS for Frontier Scientific Benchmark

updated a Space 4 months ago

SAGE Benchmark

SAGE Scientific Reasoning Benchmark Leaderboard

upvoted a paper 5 months ago

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80

liked a model 5 months ago

opencompass/CompassVerifier-3B

3B • Updated Jan 4 • 922 • 7