Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
1218
146
114
Quentin Gallouédec
PRO
qgallouedec
Follow
traversaro's profile picture
Chinez-dev's profile picture
DerekLiu35's profile picture
578 followers
·
335 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
liked
a Space
about 9 hours ago
Chunte/HFBA
upvoted
a
paper
2 days ago
Rethinking the Trust Region in LLM Reinforcement Learning
upvoted
a
paper
2 days ago
Defeating the Training-Inference Mismatch via FP16
View all activity
Organizations
qgallouedec
's datasets
83
Sort: Recently updated
qgallouedec/deepmath-completions-logs2
Viewer
•
Updated
18 days ago
•
48
•
16
qgallouedec/deepmath-completions-logs
Viewer
•
Updated
27 days ago
•
232
•
127
•
1
qgallouedec/Dolci-Think-DPO-7B
Viewer
•
Updated
Nov 28, 2025
•
150k
•
2
qgallouedec/biogrid_qa
Viewer
•
Updated
Nov 18, 2025
•
59.4k
•
353
qgallouedec/human_gene_interaction_qa_v2
Viewer
•
Updated
Nov 18, 2025
•
79.2k
•
5
qgallouedec/human_gene_interaction_qa
Viewer
•
Updated
Nov 17, 2025
•
1.84M
•
6
qgallouedec/biogrid
Viewer
•
Updated
Nov 17, 2025
•
2.82M
•
100
qgallouedec/trl-metrics
Viewer
•
Updated
Oct 7, 2025
•
148k
•
57
•
1
qgallouedec/rick
Viewer
•
Updated
Sep 11, 2025
•
1.18k
•
2
qgallouedec/OpenMathReasoning
Viewer
•
Updated
Sep 10, 2025
•
10k
•
15
qgallouedec/math-lvl3to5-8k
Viewer
•
Updated
Aug 22, 2025
•
8.52k
•
8
qgallouedec/svg
Viewer
•
Updated
Aug 2, 2025
•
900
•
47
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
May 22, 2025
•
1.79k
•
22
•
1
qgallouedec/rick-science
Viewer
•
Updated
May 16, 2025
•
1.18k
•
8
•
3
qgallouedec/physics-problems
Viewer
•
Updated
May 10, 2025
•
247
•
13
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10, 2025
•
6.8k
•
7
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29, 2025
•
16.4k
•
8
•
3
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
2
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
1
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
9
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
2
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
3
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
2
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
3
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
3
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
12
qgallouedec/suap_essentials
Viewer
•
Updated
Aug 6, 2024
•
30
•
5
qgallouedec/qa_suap
Viewer
•
Updated
Jul 14, 2024
•
270
•
1
qgallouedec/amber_results
Viewer
•
Updated
Jul 11, 2024
•
30.4k
•
8
qgallouedec/amber
Viewer
•
Updated
Jul 11, 2024
•
15.2k
•
12
Previous
1
2
3
Next