Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dada22231
/
9673be0a-6c60-4635-b850-f4bc6dd20a2f
like
0
Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
axolotl
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
763ca09
9673be0a-6c60-4635-b850-f4bc6dd20a2f
1.98 GB
1 contributor
History:
31 commits
dada22231
Training in progress, step 1125, checkpoint
763ca09
verified
6 months ago
last-checkpoint
Training in progress, step 1125, checkpoint
6 months ago
runs
Training in progress, step 75
6 months ago
.gitattributes
1.52 kB
initial commit
6 months ago
README.md
4.04 kB
Training in progress, step 75
6 months ago
adapter_config.json
897 Bytes
Training in progress, step 75
6 months ago
adapter_model.safetensors
982 MB
xet
Training in progress, step 1125
6 months ago
added_tokens.json
80 Bytes
Training in progress, step 75
6 months ago
chat_template.jinja
484 Bytes
Training in progress, step 75
6 months ago
config.json
697 Bytes
Training in progress, step 75
6 months ago
generation_config.json
140 Bytes
Training in progress, step 75
6 months ago
merges.txt
466 kB
Training in progress, step 75
6 months ago
special_tokens_map.json
659 Bytes
Training in progress, step 75
6 months ago
tokenizer.json
3.52 MB
Training in progress, step 75
6 months ago
tokenizer.model
493 kB
xet
Training in progress, step 75
6 months ago
tokenizer_config.json
3.43 kB
Training in progress, step 75
6 months ago
training_args.bin
8.27 kB
xet
Training in progress, step 75
6 months ago
vocab.json
801 kB
Training in progress, step 75
6 months ago