Fix chat template including thinking token

by chompk - opened 16 days ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-26

chompk

16 days ago

The original chat template mask {% generation %} tag wrongly. I supposed that the original version was copied from other Qwen3 models that requires reasoning. This model, however, doesn't contain thinking tag, making finetuning with this tokenizer under --assistant_only_loss resulted in wrong assistant masking. This change fix the incorporation of think token for instruction model while also allowing proper masking for instruction tuning.

Fix chat template including thinking token9ae83004

edobobo

6 days ago

Yup. Not only that, i trained it with the correct masking with "train_on_responses_only" but the thinking tokens in the sequence make the loss way higher on the training set and the final performances on the eval dataset worse.

Obv you can fix it with something like the following code:

_tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-4B-Instruct-2507" )
tokenizer.chat_template = _tokenizer.chat_template

Btw, thanks a lot for the great work! I love you <3.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment