Li Dong's picture

Li Dong

unilm

·

AI & ML interests

Language Model Pre-Training

Recent Activity

authored a paper 8 days ago

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

authored a paper 8 days ago

DocReward: A Document Reward Model for Structuring and Stylizing

authored a paper 8 days ago

Information-Preserving Reformulation of Reasoning Traces for Antidistillation

View all activity

Organizations

upvoted 3 papers 9 days ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published 9 days ago • 23

The Principles of Diffusion Models

Paper • 2510.21890 • Published 16 days ago • 51

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 10 days ago • 44

upvoted a paper 15 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 17 days ago • 110

upvoted 2 papers 16 days ago

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

Paper • 2510.19808 • Published 17 days ago • 28

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published 17 days ago • 58

upvoted 3 papers 18 days ago

BitNet Distillation

Paper • 2510.13998 • Published 24 days ago • 52

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published 19 days ago • 62

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published 19 days ago • 32

upvoted 2 papers 25 days ago

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published Oct 7 • 31

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 26 days ago • 26

upvoted 3 papers 29 days ago

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 51

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 468

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3 • 49

upvoted 4 papers about 1 month ago

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26 • 39

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 76

Thinking Augmented Pre-training

Paper • 2509.20186 • Published Sep 24 • 23

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 181

upvoted 2 papers 2 months ago

Fantastic Pretraining Optimizers and Where to Find Them

Paper • 2509.02046 • Published Sep 2 • 12

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 123