Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
184
46
KABI
dongguanting
Follow
BingSol's profile picture
asusevski's profile picture
John6666's profile picture
54 followers
·
90 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
2 days ago
Scaling Agent Learning via Experience Synthesis
upvoted
a
paper
3 days ago
V-Thinker: Interactive Thinking with Images
upvoted
a
paper
3 days ago
LiveTradeBench: Seeking Real-World Alpha with Large Language Models
View all activity
Organizations
dongguanting
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 datasets
3 days ago
We-Math/VTBench
Viewer
•
Updated
3 days ago
•
500
•
12
•
3
We-Math/V-Perception-40K
Viewer
•
Updated
3 days ago
•
36.7k
•
13
•
3
We-Math/V-Interaction-400K
Viewer
•
Updated
3 days ago
•
253k
•
103
•
3
liked
a model
2 months ago
meituan-longcat/LongCat-Flash-Chat
Text Generation
•
562B
•
Updated
Sep 24
•
21.9k
•
501
liked
a dataset
2 months ago
inclusionAI/ASearcher-train-data
Preview
•
Updated
Aug 13
•
106
•
20
liked
2 datasets
3 months ago
We-Math/We-Math2.0-Pro
Viewer
•
Updated
Aug 19
•
4.55k
•
118
•
20
We-Math/We-Math2.0-Standard
Viewer
•
Updated
Aug 19
•
5.84k
•
138
•
21
liked
2 models
3 months ago
Kwai-Klear/Klear-Reasoner-8B
8B
•
Updated
Sep 27
•
15
•
17
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28
•
32
•
3
liked
3 datasets
4 months ago
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
24 days ago
•
54.6k
•
284
•
13
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
24 days ago
•
1.07k
•
128
•
5
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
24 days ago
•
10k
•
160
•
3
liked
8 models
4 months ago
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12
•
8
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12
•
14
•
5
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19
•
7
•
2
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29
•
25
•
2
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12
•
17
•
3
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
•
2B
•
Updated
Jun 6
•
53
•
2
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
•
0.6B
•
Updated
Jun 6
•
1
dongguanting/Tool-Star-Qwen-7B
Text Generation
•
8B
•
Updated
Jun 30
•
32
•
2
Load more