Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cornell-AGI
university
Activity Feed
Follow
9
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
GitBag
authored
a paper
about 1 month ago
Prompt Curriculum Learning for Efficient LLM Post-Training
GitBag
authored
a paper
5 months ago
Pre-trained Large Language Models Learn Hidden Markov Models In-context
GitBag
updated
a collection
6 months ago
Accelerating RL for LLM Reasoning with Optimal Advantage Reg
View all activity
Team members
1
Cornell-AGI
's datasets
15
Sort: Recently updated
Cornell-AGI/math_size_qwen2.5_7b_eval
Viewer
•
Updated
May 29
•
7.5k
•
28
Cornell-AGI/math_size_qwen2.5_3b_eval
Viewer
•
Updated
May 29
•
7.5k
•
10
Cornell-AGI/math_size_qwen2.5_1.5b_eval
Viewer
•
Updated
May 29
•
7.5k
•
120
Cornell-AGI/gsm8k_size_qwen2.5_7b_eval
Viewer
•
Updated
May 29
•
7.47k
•
10
Cornell-AGI/gsm8k_size_qwen2.5_3b_eval
Viewer
•
Updated
May 29
•
7.47k
•
8
Cornell-AGI/gsm8k_size_qwen2.5_1.5b_eval
Viewer
•
Updated
May 29
•
7.47k
•
39
Cornell-AGI/amazon_movie_tv_item_mxbai
Viewer
•
Updated
Dec 2, 2024
•
10.5k
•
18
Cornell-AGI/amazon_movie_tv_llama_mxbai
Viewer
•
Updated
Oct 23, 2024
•
17.1k
•
121
Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_2
Viewer
•
Updated
Oct 8, 2024
•
116k
•
35
•
1
Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_1
Viewer
•
Updated
Oct 8, 2024
•
64.6k
•
16
•
2
Cornell-AGI/REFUEL-UltraInteract-setting-two
Viewer
•
Updated
Oct 5, 2024
•
106k
•
17
•
1
Cornell-AGI/REFUEL-hh-setting-two
Viewer
•
Updated
Oct 5, 2024
•
165k
•
11
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_1
Viewer
•
Updated
Sep 2, 2024
•
56.1k
•
61
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_3
Viewer
•
Updated
Sep 2, 2024
•
44.6k
•
12
•
1
Cornell-AGI/Ultrafeedback-Llama-3-Armo-iter_2
Viewer
•
Updated
Sep 2, 2024
•
55.1k
•
81