In a Training Loop 🔄

14 14 75

Vadim Borisov PRO

vdmbrsv

http://www.tabularis.ai

AI & ML interests

Specialized AI Models, Edge AI, Synthetic Data

Recent Activity

updated a dataset 3 days ago

vdmbrsv/deepmath_de

published a dataset 4 days ago

vdmbrsv/deepmath_de

upvoted a collection 6 days ago

🇩🇪German SFT and DPO datasets

View all activity

Organizations

upvoted a collection 6 days ago

🇩🇪German SFT and DPO datasets

Collection

Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23, 2025 • 13

upvoted 2 articles 22 days ago

Article

Deriving the DPO Loss from First Principles

22 days ago

•

Article

SYNTH: the new data frontier

Nov 10, 2025

•

upvoted an article about 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

274

upvoted a paper 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted an article 3 months ago

Article

Introducing the Massive Legal Embedding Benchmark (MLEB)

Oct 17, 2025

•

upvoted a paper 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 181

upvoted 2 articles 8 months ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26, 2025

•

179

upvoted an article about 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15, 2025

•

222

upvoted 2 papers over 1 year ago

Language Models are Realistic Tabular Data Generators

Paper • 2210.06280 • Published Oct 12, 2022 • 1

Open Artificial Knowledge

Paper • 2407.14371 • Published Jul 19, 2024 • 1

upvoted 2 articles over 1 year ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

263

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

•

Vadim Borisov PRO

AI & ML interests

Recent Activity

Organizations

vdmbrsv's activity

Deriving the DPO Loss from First Principles

SYNTH: the new data frontier

Transformers v5: Simple model definitions powering the AI ecosystem

Introducing the Massive Legal Embedding Benchmark (MLEB)

🐯 Liger GRPO meets TRL

Training and Finetuning Reranker Models with Sentence Transformers v4

Train 400x faster Static Embedding Models with Sentence Transformers

Training and Finetuning Embedding Models with Sentence Transformers v3

How we leveraged distilabel to create an Argilla 2.0 Chatbot