A.X-4.0-FP8-Dynamic / recipe.yaml
sh2orc's picture
Add files using upload-large-folder tool
f5b8a22 verified
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head, 're:.*mlp.gate$', 're:.*mlp.shared_expert_gate$']
scheme: FP8_dynamic