yuyijiong
/

Qwen3-SWA-adaptation

Text Generation

Model card Files Files and versions

Qwen3-SWA-adaptation / Qwen3-4B-Thinking-2507-sft-fusang-swa=2k_sink=100_falayer=25_from1_step2_fadec /checkpoint-926

47.8 MB

2 contributors

History: 4 commits

yuyijiong's picture

Add files using upload-large-folder tool

2898f7c verified 27 days ago

README.md
5.23 kB

Add files using upload-large-folder tool 27 days ago
adapter_config.json
883 Bytes

Add files using upload-large-folder tool 27 days ago
adapter_model.safetensors
31.9 MB
xet

Add files using upload-large-folder tool 27 days ago
added_tokens.json
707 Bytes

Add files using upload-large-folder tool 27 days ago
chat_template.jinja
4.05 kB

Add files using upload-large-folder tool 27 days ago
merges.txt
1.67 MB

Add files using upload-large-folder tool 27 days ago
special_tokens_map.json
613 Bytes

Add files using upload-large-folder tool 27 days ago
tokenizer.json
11.4 MB
xet

Add files using upload-large-folder tool 27 days ago
tokenizer_config.json
5.4 kB

Add files using upload-large-folder tool 27 days ago
trainer_state.json
33 kB

Add files using upload-large-folder tool 27 days ago
training_args.bin
5.97 kB
xet

Add files using upload-large-folder tool 27 days ago
vocab.json
2.78 MB

Add files using upload-large-folder tool 27 days ago