shawnxzhu's picture

4 1

shawnxzhu

shawnxzhu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

QueST: Incentivizing LLMs to Generate Difficult Problems

upvoted a paper 2 months ago

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

upvoted a paper 3 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

View all activity

Organizations

None yet

Collections 1

models 3

shawnxzhu/CHARM-calibrated-Skywork-Reward-Llama-3.1-8B-v0.2

Text Classification • 8B • Updated Apr 14 • 4

shawnxzhu/Llama-2-7b-hf-backward-finetuned

shawnxzhu/Llama-2-7b-hf-backward

datasets 10

shawnxzhu/DSAA6000Q-Mistral-7B-Instruct-v0.2-lima-dpo

Viewer • Updated May 11 • 1.03k • 6

shawnxzhu/CHARM-preference20K

Viewer • Updated Apr 12 • 20k • 3

shawnxzhu/CHARM-preference20K-Qwen2.5-72B-Instruct

Viewer • Updated Apr 12 • 20k • 2

shawnxzhu/CHARM-preference20K-Llama-3.1-70B-Instruct

Viewer • Updated Apr 12 • 20k • 19

shawnxzhu/CHARM-preference20K-Llama-3.1-8B-Instruct

Viewer • Updated Apr 12 • 20k • 5

shawnxzhu/CHARM-preference20K-GPT-4o-mini-2024-07-18

Viewer • Updated Apr 12 • 20k • 5

shawnxzhu/CHARM-preference20K-gemma-2-27b-it

Viewer • Updated Apr 12 • 20k • 9

shawnxzhu/CHARM-preference20K-gemma-2-9b-it

Viewer • Updated Apr 12 • 20k • 7

shawnxzhu/CHARM-preference20K-gemma-2-9b-it-SimPO

Viewer • Updated Apr 12 • 20k • 16

shawnxzhu/backward-curation

Preview • Updated Apr 8 • 2