Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
huzican
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 7 hours ago
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
upvoted
a
paper
2 days ago
P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
upvoted
a
paper
8 days ago
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations