Guanhua Huang's picture

1 3

Guanhua Huang

Carlanlarkk

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

upvoted a paper about 1 month ago

Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning

upvoted a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

View all activity

Organizations

None yet

authored 7 papers about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3 • 63

Are AI-Generated Text Detectors Robust to Adversarial Perturbations?

Paper • 2406.01179 • Published Jun 3, 2024

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 52

AGILE: A Novel Reinforcement Learning Framework of LLM Agents

Paper • 2405.14751 • Published May 23, 2024

ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation

Paper • 2507.04952 • Published Jul 7 • 9

Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework

Paper • 2507.06829 • Published Jul 9

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 67