view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 369
Running 3.46k 3.46k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Deepseek Papers Collection Deepseek papers collection • 25 items • Updated about 14 hours ago • 282