Image maximum resolution

#37

by andi-at - opened 11 days ago

11 days ago

Hello,
I noticed that text recognition in images works extremely poorly for very high-resolution images (8k), scaling down to 1k gives me perfect results. I'm running the model with vLLM. Should this be automatically resized? What resolution does the model natively support?
thanks for this wonderful model!

SerialKicked

7 days ago

•

edited 7 days ago

Unless specified otherwise, most models (this one included) run on 1K resolution. They have no idea how to process a 8K image. Normally, good frontends will resize that for you, but if you're using the model more "directly", yep, that's your job to do so.

andi-at

2 days ago

oh perfect thank you! yeah openwebui seems to ignore it, but i´ve already implemented the resizeing in my interface.
Also funny things will happen when the image is rotated, but file explorer shows it correctly due to the metadata :D
Thanks a lot!

andi-at changed discussion status to closed 2 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment