Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
8
wenlong deng
dwenlong
Follow
0 followers
·
3 following
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
authored
a paper
3 days ago
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
authored
a paper
3 days ago
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning
View all activity
Organizations
dwenlong
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
7 days ago
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B
•
Updated
25 days ago
•
7.15k
•
2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning
•
3B
•
Updated
25 days ago
•
35
•
1
liked
2 models
25 days ago
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning
•
8B
•
Updated
25 days ago
•
93
•
2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning
•
8B
•
Updated
25 days ago
•
71
•
2
liked
a model
10 months ago
UCSC-VLAA/MedReason-8B
Question Answering
•
8B
•
Updated
Jul 30, 2025
•
689
•
14
liked
a dataset
10 months ago
UCSC-VLAA/MedReason
Viewer
•
Updated
May 27, 2025
•
32.7k
•
534
•
82
liked
2 models
11 months ago
deepseek-ai/DeepSeek-V3-0324
Text Generation
•
685B
•
Updated
Mar 27, 2025
•
242k
•
•
3.09k
junnyu/DeepScaleR-1.5B-Preview-Reproduce
Text Generation
•
2B
•
Updated
Feb 26, 2025
•
10
•
4