28 6 1

Zhilin Wang

zhilinw

AI & ML interests

None yet

Recent Activity

updated a Space 4 days ago

nvidia/ProfBench

updated a collection 3 months ago

Reward Models 10-2025

updated a dataset 3 months ago

nvidia/HelpSteer3

View all activity

Organizations

updated a Space 4 days ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

updated a collection 3 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated 11 days ago • 12

updated a dataset 3 months ago

nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 2.59k • 96

New activity in nvidia/ProfBench 3 months ago

Full Set of Tasks and Rubrics

#3 opened 4 months ago by

post-train

updated a model 4 months ago

nvidia/Qwen3-Nemotron-32B-RLBFF

Text Generation • 33B • Updated Oct 31, 2025 • 35 • 27

liked a Space 4 months ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

updated a dataset 4 months ago

nvidia/ProfBench

Viewer • Updated Oct 30, 2025 • 40 • 676 • 19

published a Space 4 months ago

ProfBench

🦀

Human-annotated rubrics in Professional Tasks

updated a collection 4 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated 11 days ago • 12

upvoted a collection 4 months ago

Reward Models 10-2025

Collection

A collection of great reward models for research and production • 7 items • Updated 11 days ago • 12

updated a model 4 months ago

nvidia/Qwen3-Nemotron-32B-GenRM-Principle

Text Generation • 33B • Updated Oct 30, 2025 • 274 • 12

upvoted an article 4 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

published an article 4 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

authored a paper 4 months ago

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 8

upvoted a paper 4 months ago

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 8

published a dataset 4 months ago

nvidia/ProfBench

Viewer • Updated Oct 30, 2025 • 40 • 676 • 19

updated a model 4 months ago

nvidia/Llama-3.3-Nemotron-70B-Reward-Principle

Text Generation • 71B • Updated Oct 30, 2025 • 201 • 6

authored a paper 5 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8

upvoted a paper 5 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8

commented a paper 5 months ago

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards

Paper • 2509.21319 • Published Sep 25, 2025 • 8 •

Zhilin Wang

AI & ML interests

Recent Activity

Organizations

zhilinw's activity

ProfBench

Full Set of Tasks and Rubrics

ProfBench

ProfBench

Can Your LLM Think Like a Professional? Introducing ProfBench

Can Your LLM Think Like a Professional? Introducing ProfBench