PJMixers-Dev
/

Gemma-3-Starshine-Earthen-v0.4-12B-QLoRA

Text Generation

4-bit precision

Model card Files Files and versions

xzuyn commited on Jun 12

Commit

38a1331

·

verified ·

1 Parent(s): 041355a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -52,7 +52,7 @@ datasets:
 ---
 # Gemma-3-Starshine-Earthen-v0.4-12B-QLoRA
-[`ToastyPigeon/Gemma-3-Starshine-12B`](https://huggingface.co/ToastyPigeon/Gemma-3-Starshine-12B) was trained at 8K with batch size 4 gradient accumulation 1, so each step was 32,768 tokens (including any padding tokens). It was trained for 40 steps, adding up to a total of 1,310,720 unique tokens seen.
 ## Quants

 ---
 # Gemma-3-Starshine-Earthen-v0.4-12B-QLoRA
+[`ToastyPigeon/Gemma-3-Starshine-12B`](https://huggingface.co/ToastyPigeon/Gemma-3-Starshine-12B) was trained at 8K with batch size 4 gradient accumulation 1, so each step was 32,768 tokens (including any padding tokens). It was trained for 100 steps, adding up to a total of 3,276,800 unique tokens seen.
 ## Quants