Hi Vidore Team, good job.
I modified the pipeline of vidore-benchmark codebase before (to fit my own research needs), so I am trying to evaluate the recently released vidore-v3 on vidore-benchmark. But it turns out that I got "DatasetGenerationError: An error occurred while generating the dataset". This error did not occur when I ran other benchmarks such vidore-v1 and v2. So I am wondering if you can check on this, or is there any different setting we should know if we still use vidore-benchmark to evaluate?