Low step inference for speedups?

by prottropprot - opened 16 days ago

16 days ago

Is it possible to achieve consistent low step (4 or 8 step) technique in Lumina family of models? Current requirement of 30+ steps is a steep barrier of entry. Do approaches like DMD2 translate tho this architecture?

duongve

Owner 15 days ago

Hi currently, there is not any effective solution to reduce step for lumina Image v2. I am still trying some approaches to handle this. So, when i complete i will publish and have a note for this.

Y4iges

10 days ago

This comment has been hidden (marked as Resolved)

duongve

Owner 10 days ago

hi @Y4iges . This problem related to the sage attention. If you use it please turn it off.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment