Hyunwoo Ko's picture

Hyunwoo Ko

Cartinoe5930

·

https://cartinoe5930.tistory.com/

AI & ML interests

NLP(LLM)

Recent Activity

upvoted a paper 1 day ago

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

updated a dataset 9 days ago

Cartinoe5930/realmath_result

updated a dataset 21 days ago

Cartinoe5930/realmath_result

View all activity

Organizations

upvoted a paper 1 day ago

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

Paper • 2601.01836 • Published 3 days ago • 5

upvoted 3 papers 3 months ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published Oct 5, 2025 • 26

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 58

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 69

upvoted 2 papers 8 months ago

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research

Paper • 2505.11855 • Published May 17, 2025 • 10

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8, 2025 • 86

upvoted a paper 10 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

upvoted 2 papers 11 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24, 2025 • 26

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published Feb 20, 2025 • 47

upvoted an article 11 months ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted 2 papers 11 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61

upvoted a collection 12 months ago

DeepSeek-R1

10 items • Updated Nov 27, 2025 • 826

upvoted 2 papers 12 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 99

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

upvoted 3 articles about 1 year ago

Article

Releasing QwQ-LongCoT-130K

Dec 5, 2024

•

10

Article

Navigating Korean LLM Research #2: Evaluation Tools

Oct 23, 2024

•

8

Article

Navigating Korean LLM Research #1: Models

Oct 22, 2024

•

26

upvoted a paper over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

upvoted an article over 1 year ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Aug 19, 2024

•

79