Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
apriasmoro
/
b0704929-9dfc-4456-a36b-3d478481e86f
like
0
Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
axolotl
trl
grpo
conversational
custom_code
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
b0704929-9dfc-4456-a36b-3d478481e86f
/
last-checkpoint
/
rng_state_0.pth
Commit History
Training in progress, step 364, checkpoint
8613701
verified
apriasmoro
commited on
Jul 11
Training in progress, step 324, checkpoint
1b0a560
verified
apriasmoro
commited on
Jul 11
Training in progress, step 270, checkpoint
944b5c6
verified
apriasmoro
commited on
Jul 11
Training in progress, step 216, checkpoint
3cc2e11
verified
apriasmoro
commited on
Jul 11
Training in progress, step 162, checkpoint
ed9fd0f
verified
apriasmoro
commited on
Jul 11
Training in progress, step 108, checkpoint
e209b1f
verified
apriasmoro
commited on
Jul 11
Training in progress, step 54, checkpoint
95afb11
verified
apriasmoro
commited on
Jul 10
Training in progress, step 324, checkpoint
5b8242d
verified
apriasmoro
commited on
Jul 10
Training in progress, step 270, checkpoint
a30f141
verified
apriasmoro
commited on
Jul 10
Training in progress, step 216, checkpoint
d730c56
verified
apriasmoro
commited on
Jul 10
Training in progress, step 162, checkpoint
32a93b2
verified
apriasmoro
commited on
Jul 10
Training in progress, step 108, checkpoint
f17fb56
verified
apriasmoro
commited on
Jul 10
Training in progress, step 54, checkpoint
5a5b4e7
verified
apriasmoro
commited on
Jul 10