5 12 2

Yantao Liu

RicardoL1u

https://scholar.google.com/citations?user=CKieAy4AAAAJ&hl=en

RicardoL1u

AI & ML interests

NLP

Recent Activity

new activity 27 days ago

THU-KEG/RM-Bench:Many chosen rows are truncated

upvoted a paper about 1 month ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

updated a dataset 4 months ago

THU-KEG/RM-Bench

View all activity

Organizations

New activity in THU-KEG/RM-Bench 27 days ago

Many chosen rows are truncated

#3 opened about 1 month ago by

AlexShengzhiMeta

upvoted a paper about 1 month ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2 • 52

updated a dataset 4 months ago

THU-KEG/RM-Bench

Viewer • Updated Jul 12 • 1.33k • 740 • 7

commented a paper 6 months ago

Are Reasoning Models More Prone to Hallucination?

Paper • 2505.23646 • Published May 29 • 24 •

upvoted a paper 6 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

upvoted a paper 9 months ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26 • 23

updated a dataset 9 months ago

THU-KEG/PairJudge-432K

Viewer • Updated Feb 19 • 432k • 75 • 1

updated a model 9 months ago

THU-KEG/PairJudge-RM

8B • Updated Feb 19 • 2 • 1

upvoted a paper 9 months ago

ADELIE: Aligning Large Language Models on Information Extraction

Paper • 2405.05008 • Published May 8, 2024 • 2

upvoted a collection 9 months ago

OpenSAE-LLaMA-3.1-8B

Collection

OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29 • 5

published a model 10 months ago

THU-KEG/PairJudge-RM

8B • Updated Feb 19 • 2 • 1

published a dataset 10 months ago

THU-KEG/PairJudge-432K

Viewer • Updated Feb 19 • 432k • 75 • 1

commented a paper 10 months ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22 • 20 •

upvoted a paper 10 months ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22 • 20

commented a paper 10 months ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22 • 20 •

New activity in THU-KEG/RM-Bench about 1 year ago

Add link to paper

#2 opened about 1 year ago by

nielsr

upvoted 2 papers about 1 year ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21, 2024 • 16

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 25

commented a paper about 1 year ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 25 •

liked a model about 1 year ago

sfairXC/FsfairX-Zephyr-Chat-v0.1

Text Generation • 7B • Updated Apr 24, 2024 • 8

Yantao Liu

AI & ML interests

Recent Activity

Organizations

RicardoL1u's activity

Many chosen rows are truncated

Add link to paper