2 31 11

Zihan Liu

LiuZH-19

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes

upvoted a paper 25 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

upvoted a paper about 1 month ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes

Paper • 2601.02356 • Published 3 days ago • 12

upvoted a paper 25 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published 27 days ago • 29

upvoted 2 papers about 1 month ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 47

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 21

New activity in internlm/STAR-Bench 2 months ago

Improve dataset card: Add task categories, language, tags, paper links, sample usage, and citation

#2 opened 2 months ago by

nielsr

liked a model 2 months ago

tencent/SongGeneration

Text-to-Audio • Updated Oct 23, 2025 • 417 • 291

upvoted a paper 2 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 28

commented a paper 2 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 18 •

liked a model 3 months ago

internlm/Spark-VL-7B

Video-Text-to-Text • 8B • Updated Oct 23, 2025 • 56 • 10

upvoted 2 papers 3 months ago

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 17

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 32

upvoted 3 papers 4 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 41

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 89

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 36

liked a Space 5 months ago

3DGen Leaderboard

😻

Display 3D model evaluation leaderboard

upvoted 2 papers 5 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1, 2025 • 62

upvoted a paper 6 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21, 2025 • 38

updated a collection 6 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text

Collection

A storage repo for SongGen. • 3 items • Updated Jul 4, 2025 • 1