5 2

Yinmin Zhong

PKUFlyingPig

https://yinminzhong.com

PKUFlyingPig

AI & ML interests

Machine Learning Systems

Recent Activity

authored a paper 5 days ago

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

upvoted a paper 5 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.2-Exp

View all activity

Organizations

None yet

authored a paper 5 days ago

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published 10 days ago • 38

upvoted a paper 5 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20, 2025 • 123

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • Updated Nov 18, 2025 • 52.8k • • 967

upvoted a collection 5 months ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 531

liked a Space 12 months ago

The Ultra-Scale Playbook

🌌

3.72k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper about 1 year ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6, 2025 • 51

updated a model about 1 year ago

PKUFlyingPig/gpt175b-config

Updated Feb 6, 2025 • 4

published a model about 1 year ago

PKUFlyingPig/gpt175b-config

Updated Feb 6, 2025 • 4

upvoted a paper about 1 year ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36

updated a model almost 2 years ago

PKUFlyingPig/ppo-LunarLander-v2

Reinforcement Learning • Updated May 13, 2024

upvoted a paper about 2 years ago

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Paper • 2402.15627 • Published Feb 23, 2024 • 36

authored a paper about 2 years ago