Remove the setting of _attn_implementation from llama_bidirectional_model

by nvidia-oliver-holworthy - opened 5 days ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-1

nvidia-oliver-holworthy

NVIDIA org 5 days ago

Remove _attn_implementation from LlamaBidirectionalModel constructor.
Following the transformers 4.48.0 release which inclided a refactor of attention implementations. (transformers/pull/35235🚨All attention refactor🚨), In transformers versions 4.47.0 this line did not have an impact on the attention implementation. Following 4.48.0 this line was activated by the new attention initialization and activated "eager" attention instead of "sdpa".

Remove the setting of _attn_implementation from llama_bidirectional_model137fa5fc

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment