Heming Xia's picture

10 3

Heming Xia

hemingkx

·

https://hemingkx.github.io/

AI & ML interests

Efficient and Effective NLP, Tool Learning, and Vision-Language Understanding.

Recent Activity

upvoted a paper 11 days ago

The Station: An Open-World Environment for AI-Driven Discovery

upvoted a collection about 1 month ago

upvoted a paper about 1 month ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

View all activity

Organizations

None yet

upvoted a paper 11 days ago

The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published 13 days ago • 34

upvoted a collection about 1 month ago

Qwen3-VL

37 items • Updated 21 days ago • 423

upvoted a paper about 1 month ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26 • 67

upvoted 2 papers 3 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

upvoted a paper 4 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30 • 46

upvoted a paper 6 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 262

upvoted a paper 8 months ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Paper • 2412.16855 • Published Dec 22, 2024 • 5

upvoted 2 papers 9 months ago

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Paper • 2502.13311 • Published Feb 18 • 2

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Paper • 2502.13946 • Published Feb 19 • 10