Cornell-AGI

university

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

GitBag authored a paper about 1 month ago

Prompt Curriculum Learning for Efficient LLM Post-Training

GitBag authored a paper 5 months ago

Pre-trained Large Language Models Learn Hidden Markov Models In-context

GitBag updated a collection 6 months ago

Accelerating RL for LLM Reasoning with Optimal Advantage Reg

View all activity

Cornell-AGI 's datasets 15

Cornell-AGI/math_size_qwen2.5_7b_eval

Viewer • Updated May 29 • 7.5k • 28

Cornell-AGI/math_size_qwen2.5_3b_eval

Viewer • Updated May 29 • 7.5k • 10

Cornell-AGI/math_size_qwen2.5_1.5b_eval

Viewer • Updated May 29 • 7.5k • 120

Cornell-AGI/gsm8k_size_qwen2.5_7b_eval

Viewer • Updated May 29 • 7.47k • 10

Cornell-AGI/gsm8k_size_qwen2.5_3b_eval

Viewer • Updated May 29 • 7.47k • 8

Cornell-AGI/gsm8k_size_qwen2.5_1.5b_eval

Viewer • Updated May 29 • 7.47k • 39

Cornell-AGI/amazon_movie_tv_item_mxbai

Viewer • Updated Dec 2, 2024 • 10.5k • 18

Cornell-AGI/amazon_movie_tv_llama_mxbai

Viewer • Updated Oct 23, 2024 • 17.1k • 121

Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_2

Viewer • Updated Oct 8, 2024 • 116k • 35 • 1

Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_1

Viewer • Updated Oct 8, 2024 • 64.6k • 16 • 2

Cornell-AGI/REFUEL-UltraInteract-setting-two

Viewer • Updated Oct 5, 2024 • 106k • 17 • 1

Cornell-AGI/REFUEL-hh-setting-two

Viewer • Updated Oct 5, 2024 • 165k • 11

Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_1

Viewer • Updated Sep 2, 2024 • 56.1k • 61

Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_3

Viewer • Updated Sep 2, 2024 • 44.6k • 12 • 1

Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_2

Viewer • Updated Sep 2, 2024 • 55.1k • 81