Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
shanchen
/
math-500-base-japanese-lora
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
math-500-base-japanese-lora
/
tokenizer_config.json
Commit History
Upload MATH-500 Japanese LoRA model (GRPO fine-tuned)
ba9028b
verified
shanchen
commited on
Sep 10