Is WSM strategy used here for RLVR?

#3
by adamo1139 - opened

Hi, are you using Warmup-Stable and Merge strategy for training this model, even in the RLVR stage?

inclusionAI org

We adopt WSM LR scheduler in the pre-training stage of Ling-1T-base, not in RLVR stage.

adamo1139 changed discussion status to closed

Sign up or log in to comment