Zikun Li's picture

140 9

Zikun Li

zikun-li

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

The Art of Scaling Reinforcement Learning Compute for LLMs

upvoted a paper 23 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

upvoted a paper 24 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

View all activity

Organizations

None yet

upvoted 2 papers 23 days ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published about 1 month ago • 30

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 24 days ago • 111

upvoted a paper 24 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 26 days ago • 117

upvoted 3 papers 25 days ago

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 76

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 92

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 262

upvoted a paper about 1 month ago

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Paper • 2510.01161 • Published Oct 1 • 13

upvoted 6 papers about 2 months ago

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16 • 78

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Paper • 2509.13309 • Published Sep 16 • 67

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16 • 70

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16 • 89

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 115

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16 • 105

upvoted 4 papers 3 months ago

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13 • 56

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published Aug 14 • 28

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

upvoted a paper 4 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 92

upvoted 2 papers 8 months ago

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Paper • 2503.21460 • Published Mar 27 • 83

Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

Paper • 2503.21380 • Published Mar 27 • 38