44 57 119

Junlin Zhou

jlzhou

edwardzjl

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

liked a model 6 days ago

inclusionAI/LLaDA2.0-flash-preview

liked a model about 1 month ago

inclusionAI/LLaDA2.0-mini-preview

View all activity

Organizations

upvoted an article 2 days ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Oct 24, 2024

•

upvoted an article 2 months ago

Article

Diffusion Language Models: The New Paradigm

Jun 10

•

upvoted a paper 2 months ago

Rope to Nope and Back Again: A New Hybrid Attention Strategy

Paper • 2501.18795 • Published Jan 30 • 12

upvoted an article 3 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

264

upvoted a paper 4 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted 2 articles 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

725

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

717

upvoted 3 papers 5 months ago

Don't Pay Attention

Paper • 2506.11305 • Published Jun 12 • 7

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6 • 30

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

upvoted 2 papers 6 months ago

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Paper • 2505.17870 • Published May 23 • 5

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Paper • 2312.03209 • Published Dec 6, 2023 • 22

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

721

upvoted a paper 7 months ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14 • 10

upvoted an article 8 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

397

upvoted a paper 8 months ago

Min P Sampling: Balancing Creativity and Coherence at High Temperature

Paper • 2407.01082 • Published Jul 1, 2024 • 1

upvoted an article 8 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

•

470

upvoted a paper 8 months ago

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Paper • 2502.18080 • Published Feb 25 • 2

upvoted 2 articles 8 months ago

Article

Open R1: Update #3

Mar 11

•

296

Article

From Files to Chunks: Improving HF Storage Efficiency

Nov 20, 2024

•

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Diffusion Language Models: The New Paradigm

How to generate text: using different decoding methods for language generation with Transformers

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Uncensor any LLM with abliteration

You could have designed state of the art positional encoding

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Open R1: Update #3

From Files to Chunks: Improving HF Storage Efficiency