leoking

leokmax

AI & ML interests

None yet

Recent Activity

liked a dataset 18 days ago

cassanof/CodeEditSearch

updated a dataset 19 days ago

leokmax/coder-sft

published a dataset 19 days ago

leokmax/coder-sft

View all activity

Organizations

None yet

liked a dataset 18 days ago

cassanof/CodeEditSearch

Viewer • Updated Apr 28, 2024 • 21.5k • 574 • 6

updated a dataset 19 days ago

leokmax/coder-sft

Viewer • Updated 19 days ago • 88 • 23

published a dataset 19 days ago

leokmax/coder-sft

Viewer • Updated 19 days ago • 88 • 23

updated a model 20 days ago

leokmax/zeta-sft

Text Generation • 0.5B • Updated 20 days ago • 48

liked a dataset 24 days ago

m-a-p/FineFineWeb

Viewer • Updated Dec 19, 2024 • 4.89B • 439k • 84

updated 2 models about 1 month ago

leokmax/Zeta-7B-gptqmodel-4bit

8B • Updated Oct 10 • 95

leokmax/Qwen2.5-Coder-1.5B-gptqmodel-4bit

2B • Updated Oct 10 • 5

published 2 models about 1 month ago

leokmax/Zeta-7B-gptqmodel-4bit

8B • Updated Oct 10 • 95

leokmax/Qwen2.5-Coder-1.5B-gptqmodel-4bit

2B • Updated Oct 10 • 5

updated a model about 2 months ago

leokmax/Zeta-7B-nf4

4B • Updated Sep 25 • 2

published a model about 2 months ago

leokmax/Zeta-7B-nf4

4B • Updated Sep 25 • 2

updated a model about 2 months ago

leokmax/QWen-Coder-1.5B-nf4

0.9B • Updated Sep 25 • 2

published a model about 2 months ago

leokmax/QWen-Coder-1.5B-nf4

0.9B • Updated Sep 25 • 2

published a model 2 months ago

leokmax/zeta-sft

Text Generation • 0.5B • Updated 20 days ago • 48

upvoted an article 5 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 369

liked a model 8 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27 • 239k • • 3.08k

liked a Space 9 months ago

3.46k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 9 months ago

Deepseek Papers

Collection

Deepseek papers collection • 25 items • Updated about 14 hours ago • 282

upvoted 2 collections about 1 year ago

LLM Pre-Train

Collection

16 items • Updated Jan 20 • 1

LLM Post Training

Collection

15 items • Updated Feb 1 • 1

leoking

AI & ML interests

Recent Activity

Organizations

leokmax's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

The Ultra-Scale Playbook