Fix generation config saving error

#38

by RaushanTurganbay HF Staff - opened 4 days ago

base: refs/heads/main

←

from: refs/pr/38

Discussion Files changed

-1

RaushanTurganbay

4 days ago

Simply loading the model from_pretrained and saving without changes throws an error with transformers latest versions, because we validate strictly that generation params are consistent. Setting a sampling flag solves the issue

Current woraround is to manually change the value after loading the model

from transformers import Mistral3ForConditionalGeneration
import torch

text_encoder = Mistral3ForConditionalGeneration.from_pretrained(
    "mistralai/Mistral-Small-3.2-24B-Instruct-2506", 
    dtype=torch.bfloat16
).to("cuda")
model.generation_config.do_sample = True # change the value
model.save_pretrained(output_dir) # SUCCESS!

Fix generation config saving error99b21eed

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment