2 6

Yan Yang PRO

HelloKKMe

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 27 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a dataset 3 months ago

HelloKKMe/h

View all activity

Organizations

upvoted 2 papers 27 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 29 days ago • 147

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 28 days ago • 89

updated a dataset 3 months ago

HelloKKMe/h

Preview • Updated Nov 22, 2025 • 1 • 1

published a dataset 3 months ago

HelloKKMe/h

Preview • Updated Nov 22, 2025 • 1 • 1

updated 3 models 4 months ago

published a dataset 4 months ago

Salesforce/grounding_dataset

Viewer • Updated Oct 3, 2025 • 70.7k • 476 • 4

published a model 4 months ago

Salesforce/GTA1-7B

Image-Text-to-Text • 8B • Updated Oct 3, 2025 • 291 • 3

updated a dataset 4 months ago

Salesforce/grounding_dataset

Viewer • Updated Oct 3, 2025 • 70.7k • 476 • 4

updated a collection 4 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5

updated a collection 5 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5

published a model 5 months ago

Salesforce/GTA1-7B-2507

Image-Text-to-Text • 8B • Updated Oct 3, 2025 • 1.15k • 3

updated a collection 5 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5

published a model 5 months ago

Salesforce/GTA1-32B

Image-Text-to-Text • 33B • Updated Oct 3, 2025 • 253 • 6

Yan Yang PRO

AI & ML interests

Recent Activity

Organizations

HelloKKMe's activity