anbinx

AI & ML interests

None yet

Recent Activity

upvoted an article 5 days ago

🌳 QAT: The Art of Growing a Bonsai Model

upvoted a paper 6 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

liked a Space 15 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

upvoted an article 5 days ago

Article

🌳 QAT: The Art of Growing a Bonsai Model

8 days ago

•

upvoted a paper 6 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 9 days ago • 100

upvoted a paper 3 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

upvoted a paper 4 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

upvoted a paper 5 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

upvoted 2 papers 6 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 25

upvoted an article 7 months ago

Article

I trained a Language Model to schedule events with GRPO!

Apr 29

•

upvoted 2 papers 9 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 50

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 210

upvoted a paper 11 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 90

upvoted 3 papers about 1 year ago

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 93

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 30

anbinx

AI & ML interests

Recent Activity

Organizations

anbinx's activity

🌳 QAT: The Art of Growing a Bonsai Model

I trained a Language Model to schedule events with GRPO!