🇩🇪German SFT and DPO datasets Collection Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23, 2025 • 13
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 274
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 132
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 181
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 179
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 222
Language Models are Realistic Tabular Data Generators Paper • 2210.06280 • Published Oct 12, 2022 • 1
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 263
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot +3 Jul 16, 2024 • 33