3 6

Minwu Kim

guactastesgood

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

published a model about 10 hours ago

guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration1

updated a collection about 10 hours ago

Failure-Prefix Conditioning

View all activity

Organizations

authored a paper about 5 hours ago

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

Paper • 2601.20829 • Published 5 days ago • 5

published a model about 10 hours ago

guactastesgood/DeepSeek-R1-Distill-Qwen-1.5B-failure-prefix-conditioning-iteration1

Updated about 10 hours ago

updated a collection about 10 hours ago

Failure-Prefix Conditioning

Collection

Collection for the paper: Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning • 1 item • Updated about 10 hours ago

upvoted a paper 4 days ago

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

Paper • 2601.20829 • Published 5 days ago • 5

submitted a paper to Daily Papers 4 days ago

Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning

Paper • 2601.20829 • Published 5 days ago • 5

published a dataset 5 days ago

guactastesgood/failure-prefix-conditioned-dataset-iteration-1

Updated 5 days ago • 3

authored a paper 9 months ago

Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning

Paper • 2505.14216 • Published May 20, 2025 • 2

upvoted a paper 9 months ago

Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning

Paper • 2505.14216 • Published May 20, 2025 • 2

upvoted a collection 9 months ago

Warmup Before You Train

Collection

10 items • Updated about 13 hours ago • 2

upvoted a paper 9 months ago

Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings

Paper • 2505.13718 • Published May 19, 2025 • 7

authored a paper 9 months ago

Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings

Paper • 2505.13718 • Published May 19, 2025 • 7

upvoted a collection 10 months ago

🌾Oat-Zero: Understanding R1-Zero-Like Training

Collection

5 items • Updated Apr 10, 2025 • 7

upvoted a paper 11 months ago

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Paper • 2502.08680 • Published Feb 12, 2025 • 11

New activity in guactastesgood/GSM-Ranges 12 months ago

Replace Arxiv link with HF papers link

#2 opened 12 months ago by

nielsr

commented a paper 12 months ago

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Paper • 2502.08680 • Published Feb 12, 2025 • 11 •

authored a paper 12 months ago

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Paper • 2502.08680 • Published Feb 12, 2025 • 11

updated a dataset 12 months ago

guactastesgood/GSM-Ranges

Viewer • Updated Feb 15, 2025 • 30.1k • 100

published a dataset 12 months ago

guactastesgood/GSM-Ranges

Viewer • Updated Feb 15, 2025 • 30.1k • 100

Minwu Kim

AI & ML interests

Recent Activity

Organizations

guactastesgood's activity

Replace Arxiv link with HF papers link