Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dongguanting
's Collections
AEPO
ARPO
Tool-Star
RAG-Critic
AEPO
updated
19 days ago
The official datasets and model checkpoints of AEPO
Upvote
3
Agentic Entropy-Balanced Policy Optimization
Paper
•
2510.14545
•
Published
24 days ago
•
101
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
14 days ago
•
54
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
•
15B
•
Updated
19 days ago
•
46
•
1
dongguanting/Qwen2.5-7B-AEPO
Text Generation
•
8B
•
Updated
14 days ago
•
47
Upvote
3
Share collection
View history
Collection guide
Browse collections