MathewShen
/

qwen3-0.6B-tldr-adapter

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

MathewShen commited on May 13

Commit

74d9c95

·

verified ·

1 Parent(s): c6f2624

End of training

Files changed (4) hide show

README.md +1 -1
adapter_config.json +3 -3
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ print(output["generated_text"])
 ## Training procedure
 This model was trained with SFT.

 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/mathewshen/huggingface/runs/xqdk2ol2)
 This model was trained with SFT.

adapter_config.json CHANGED Viewed

@@ -24,12 +24,12 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
     "gate_proj",
-    "v_proj",
     "down_proj",
     "up_proj",
-    "q_proj",
     "o_proj"
   ],
   "task_type": "CAUSAL_LM",

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "gate_proj",
+    "q_proj",
     "down_proj",
+    "k_proj",
+    "v_proj",
     "up_proj",
     "o_proj"
   ],
   "task_type": "CAUSAL_LM",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5f8529aebce087155a3d6933c44c2dea7d345b401691d7641ce0ae342c6dd539
 size 20236472

 version https://git-lfs.github.com/spec/v1
+oid sha256:0f29ee66927e5fd6559cb071e662bbbdf1ec1b6d4e9ffdf7ee96542fe8f2fd1d
 size 20236472

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e87a279ff96b8ac5d6444813d0cb102c3d1306fdcd4a939462ea800c1eee992
 size 6033

 version https://git-lfs.github.com/spec/v1
+oid sha256:02e21d56bfba83d131555209fe64e5cffdb79be5464a26a5e005ea2ef157d8a0
 size 6033