wenlong deng's picture

5 8

wenlong deng

dwenlong

·

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models

authored a paper 3 days ago

MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

authored a paper 3 days ago

Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning

View all activity

Organizations

liked 2 models 7 days ago

mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF

8B • Updated 25 days ago • 7.15k • 2

SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins

Reinforcement Learning • 3B • Updated 25 days ago • 35 • 1

liked 2 models 25 days ago

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins

Reinforcement Learning • 8B • Updated 25 days ago • 93 • 2

SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base

Reinforcement Learning • 8B • Updated 25 days ago • 71 • 2

liked a model 10 months ago

UCSC-VLAA/MedReason-8B

Question Answering • 8B • Updated Jul 30, 2025 • 689 • 14

liked a dataset 10 months ago

UCSC-VLAA/MedReason

Viewer • Updated May 27, 2025 • 32.7k • 534 • 82

liked 2 models 11 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27, 2025 • 242k • • 3.09k

junnyu/DeepScaleR-1.5B-Preview-Reproduce

Text Generation • 2B • Updated Feb 26, 2025 • 10 • 4