XiaoranShang
xrose
AI & ML interests
LLM attack and defense
Recent Activity
upvoted
a
paper
3 days ago
OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
upvoted
a
paper
about 1 month ago
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
upvoted
a
paper
about 2 months ago
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model
Reasoning