Dcas89 PRO
Dcas89
·
AI & ML interests
None yet
Recent Activity
liked
a dataset
6 days ago
minwoosun/CholecSeg8k
reacted
to
prithivMLmods's
post
with 👍
about 1 month ago
Try the Hugging Face Space demo for https://huggingface.co/Logics-MLLM/Logics-Parsing, the latest multimodal VLM from the Logics Team at Alibaba Group. It enables end-to-end document parsing with precise content extraction in markdown format, and it also generates a clean HTML representation of the document while preserving its logical structure. 🤗🔥
Additionally, I’ve integrated one of my recent works — https://huggingface.co/prithivMLmods/Gliese-OCR-7B-Post1.0 — which also excels at document comprehension.
⭐ Space / App : https://huggingface.co/spaces/prithivMLmods/VLM-Parsing
📄 Technical Report by the Logics Team, Alibaba Group : https://huggingface.co/papers/2509.19760
🖖 MM: VLM-Parsing: https://huggingface.co/collections/prithivMLmods/mm-vlm-parsing-68e33e52bfb9ae60b50602dc
⚡ Collections : https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
Other Pages:
➔ Multimodal VLMs - July'25 : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-until-july25-688312e6b840e1e156f13027
➔ Multimodal VLMs - Aug'25 : https://huggingface.co/collections/prithivMLmods/multimodal-vlms-aug25-68a56aac39fe8084f3c168bd
➔ VL caption — < Sep 15 ’25 : https://huggingface.co/collections/prithivMLmods/vl-caption-sep-15-25-68c7f6d737985c63c13e2391
.
.
.
To know more about it, visit the app page or the respective model page!!
Organizations
None yet