12 18 1

Jinyang Wu

Jinyang23

https://orcid.org/my-orcid?orcid=0009-0006-0220-616X

jinyangwu

AI & ML interests

large language models, reasoning, agentic rl

Recent Activity

upvoted a paper 2 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

upvoted a paper 2 days ago

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

upvoted a paper 2 days ago

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

View all activity

Organizations

None yet

upvoted 3 papers 2 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published 9 days ago • 9

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 5 days ago • 31

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published 5 days ago • 4

upvoted 2 papers 4 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 9 days ago • 147

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 5 days ago • 202

upvoted a paper 5 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 8 days ago • 12

submitted a paper to Daily Papers 5 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 8 days ago • 12

New activity in Jinyang23/Spark-1.5B-ScienceWorld 8 days ago

Update README.md

#2 opened 8 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-WebShop 8 days ago

Update README.md

#2 opened 8 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-ALFWorld 8 days ago

Update README.md

#2 opened 8 days ago by

shuo-yan

authored 2 papers 9 days ago

Double: Breaking the Acceleration Limit via Double Retrieval Speculative Parallelism

Paper • 2601.05524 • Published 29 days ago

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 10 days ago • 22

updated 3 models 9 days ago

published 3 models 9 days ago

Jinyang23/Spark-1.5B-ALFWorld

Reinforcement Learning • 2B • Updated 8 days ago • 9

Jinyang23/Spark-1.5B-WebShop

Reinforcement Learning • 2B • Updated 8 days ago • 19

Jinyang23/Spark-1.5B-ScienceWorld

Reinforcement Learning • 2B • Updated 8 days ago • 9

upvoted a paper 9 days ago

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 10 days ago • 22

submitted a paper to Daily Papers 9 days ago

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 10 days ago • 22

Jinyang Wu

AI & ML interests

Recent Activity

Organizations

Jinyang23's activity

Update README.md

Update README.md

Update README.md