360Zhinao2 Collection 360Zhinao2 language model, include both base and chat model • 7 items • Updated Oct 15, 2025 • 2
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 403