anbinx's picture

13 28

anbinx

anbinx

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

liked a Space 10 days ago

HuggingFaceTB/smol-training-playbook

updated a collection about 2 months ago

View all activity

Organizations

None yet

upvoted a paper about 3 hours ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 3 days ago • 52

liked a Space 10 days ago

The Smol Training Playbook

The secrets to building world-class LLMs

updated a collection about 2 months ago

大模型idea

18 items • Updated Sep 15 • 1

liked a dataset 2 months ago

HuggingFaceM4/FineVision

Viewer • Updated 22 days ago • 24.2M • 277k • 436

upvoted a paper 3 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 87

updated a collection 3 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 4 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

updated a collection 4 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 4 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

updated a collection 4 months ago

大模型idea

18 items • Updated Sep 15 • 1

liked a model 6 months ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 130k • • 976

upvoted a paper 6 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

updated a collection 6 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted a paper 6 months ago

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 25

updated a collection 6 months ago

大模型idea

18 items • Updated Sep 15 • 1

upvoted an article 6 months ago

Article

I trained a Language Model to schedule events with GRPO!

By

•

Apr 29

• 90

liked a dataset 7 months ago

nvidia/describe-anything-dataset

Viewer • Updated Apr 24 • 916k • 3.17k • 48

liked a model 7 months ago

zai-org/GLM-Z1-32B-0414

Text Generation • 33B • Updated Apr 28 • 1.66k • • 184