Request for ONNX version

#21

by harisnaeem - opened Sep 21

Sep 21

I have tried to convert this model to ONNX using the current Hugging Face ONNX conversion workflow (via transformers.js/scripts/convert.py and optimum.exporters.onnx). Unfortunately, the conversion fails because this model uses the idefics3 architecture, which is currently not supported by optimum.exporters.onnx.

Could the maintainers or the model authors provide an official ONNX version? Having an ONNX version would greatly help with deployment in environments that require ONNX Runtime and improve inference efficiency.

ldenoue

Sep 24

It would be nice yes, I second that idea. We could run the model in a browser environment for example.

gabegoodhart

IBM Granite org Sep 24

I have some initial work on this up in: https://github.com/gabe-l-hart/optimum-onnx. This is entirely done using our internal developer assistant (see the commit message for the full prompt). I haven't done anything to validate either the code changes or the output model, but the auto-evaluated maximum delta is fairly low, so I think it should be somewhat close. I'm not at all familiar with how VLMs are handled in ONNX runtime environments, in particular the preprocessing portions. For llama.cpp, there are some significant preprocessing changes that are needed before the model will perform well, so it very well may be the case that the same is needed here.

glamberson

Sep 28

Hi, I have generated an ONNX format of this model here: https://huggingface.co/lamco-development/granite-docling-258M-onnx using @gabegoodhart 's provided work. Check it out!

Xenova

IBM Granite org Oct 4

Posting here for others to take a look at: https://huggingface.co/onnx-community/granite-docling-258M-ONNX
:)

harisnaeem

Oct 4

Posting here for others to take a look at: https://huggingface.co/onnx-community/granite-docling-258M-ONNX
:)

Shalom Joshua 🖖🏽, thanks for the onnx version 😊

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment