sarashina-post-training-practice
Collection
experimental post-trained models of sbintuitions/sarashina2.2-3b and sbintuitions/sarashina2.2-3b-instruct-v0.1
•
5 items
•
Updated
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
Base model
sbintuitions/sarashina2.2-3b