GuoLiangTang
Tommy930
AI & ML interests
LLM,NLP,ML
Recent Activity
upvoted
a
paper
about 14 hours ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
upvoted
a
paper
about 15 hours ago
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies
upvoted
a
paper
about 15 hours ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
Organizations
None yet