Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jaehwisong
/
POC_Qwen3_backbone_all
like
0
Transformers
Safetensors
Generated from Trainer
sft
trl
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
POC_Qwen3_backbone_all
/
checkpoint-2500
12.8 GB
1 contributor
History:
1 commit
Jaehwisong
Upload trained Qwen3-VL model
04e249e
verified
22 days ago
added_tokens.json
707 Bytes
Upload trained Qwen3-VL model
22 days ago
chat_template.jinja
5.29 kB
Upload trained Qwen3-VL model
22 days ago
config.json
1.6 kB
Upload trained Qwen3-VL model
22 days ago
generation_config.json
199 Bytes
Upload trained Qwen3-VL model
22 days ago
merges.txt
1.67 MB
Upload trained Qwen3-VL model
22 days ago
model.safetensors
4.26 GB
xet
Upload trained Qwen3-VL model
22 days ago
optimizer.pt
8.51 GB
xet
Upload trained Qwen3-VL model
22 days ago
preprocessor_config.json
782 Bytes
Upload trained Qwen3-VL model
22 days ago
rng_state_0.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_1.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_2.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_3.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_4.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_5.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_6.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
rng_state_7.pth
16.4 kB
xet
Upload trained Qwen3-VL model
22 days ago
scheduler.pt
1.47 kB
xet
Upload trained Qwen3-VL model
22 days ago
special_tokens_map.json
613 Bytes
Upload trained Qwen3-VL model
22 days ago
tokenizer.json
11.4 MB
xet
Upload trained Qwen3-VL model
22 days ago
tokenizer_config.json
5.45 kB
Upload trained Qwen3-VL model
22 days ago
trainer_state.json
75.7 kB
Upload trained Qwen3-VL model
22 days ago
training_args.bin
6.29 kB
xet
Upload trained Qwen3-VL model
22 days ago
video_preprocessor_config.json
817 Bytes
Upload trained Qwen3-VL model
22 days ago
vocab.json
2.78 MB
Upload trained Qwen3-VL model
22 days ago