Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
e6b220a
llama3.1-8B-gsm8k-grpo
/
special_tokens_map.json
Commit History
Upload tokenizer
b4d6dde
verified
ubermenchh
commited on
Feb 13