peak-reasoning
Collection
⚠️DEPRECATED: Please switch to the Steiner-preview series models, which are trained with reinforcement learning and backtrack-able synthetic datasets. • 3 items • Updated
• 1
⚠️DEPRECATED: Please switch to the Steiner-preview series models, which are trained with reinforcement learning and backtrack-able synthetic datasets.
Base model
peakji/peak-reasoning-7b