16 31 48

Zhaorun Chen

Zhaorun

https://billchan226.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper 8 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

upvoted a paper 10 days ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

View all activity

Organizations

upvoted a paper 7 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published 8 days ago • 72

upvoted a paper 8 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published 9 days ago • 53

upvoted a paper 10 days ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published 11 days ago • 31

upvoted a paper 20 days ago

Thought Communication in Multiagent Collaboration

Paper • 2510.20733 • Published 21 days ago • 13

upvoted a paper 28 days ago

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published 28 days ago • 71

upvoted 3 papers about 1 month ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 262

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1 • 39

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 58

upvoted a paper 3 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 138

upvoted a paper 4 months ago

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15 • 64

upvoted 3 papers 5 months ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16 • 18

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published Jun 11 • 22

SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations

Paper • 2412.06878 • Published Dec 9, 2024 • 1

upvoted a collection 7 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.42k

upvoted a paper 7 months ago

ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning

Paper • 2503.22738 • Published Mar 26 • 17

upvoted a paper 8 months ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31 • 30

upvoted 2 papers 9 months ago

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Paper • 2502.20307 • Published Feb 27 • 19

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

upvoted a paper 11 months ago

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Paper • 2412.04440 • Published Dec 5, 2024 • 22

upvoted a paper 12 months ago

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 47

Zhaorun Chen

AI & ML interests

Recent Activity

Organizations

Zhaorun's activity