Gaetan Lopez

gaetanlop

gaetanlop

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Continuous batching from first principles

upvoted a paper 5 months ago

Group Sequence Policy Optimization

upvoted an article 6 months ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25

•

288

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316

upvoted 3 articles 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

740

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

263

Article

KV Cache from scratch in nanoVLM

Jun 4

•

106

upvoted 2 articles 8 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18

•

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Aug 21, 2024

•

upvoted an article 9 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17

•

348

upvoted 4 articles 10 months ago

Article

Open R1: Update #3

Mar 11

•

296

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Mar 10

•

146

Article

Process Reinforcement through Implicit Rewards

Jan 3

•

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

436

upvoted 4 articles 11 months ago

Article

1 Billion Classifications

Feb 13

•

Article

Open-R1: Update #1

Feb 2

•

305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

Jan 20

•

upvoted a paper 12 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

upvoted an article about 1 year ago

Article

A Complete Guide to Audio Datasets

Dec 15, 2022

•

upvoted a paper about 1 year ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 32

updated a dataset about 1 year ago

gaetanlop/openai-prm800k-15k-stage2-conversational

Viewer • Updated Oct 13, 2024 • 17.7k • 21

Gaetan Lopez

AI & ML interests

Recent Activity

Organizations

gaetanlop's activity

Continuous batching from first principles

SmolLM3: smol, multilingual, long-context reasoner

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

KV Cache from scratch in nanoVLM

Gotchas in Tokenizer Behavior Every Developer Should Know

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Open R1: Update #3

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

Process Reinforcement through Implicit Rewards

SmolLM - blazingly fast and remarkably powerful

1 Billion Classifications

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

A Complete Guide to Audio Datasets