Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published 25 days ago • 38
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published 25 days ago • 38
view post Post 5529 deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️ > pretty insane it can parse and re-render charts in HTML> it uses CLIP and SAM features concatenated, so better grounding> very efficient per vision tokens/performance ratio> covers 100 languages See translation 3 replies · 🚀 9 9 👍 3 3 + Reply
view post Post 6654 large AI labs open-sourced a ton of models last week 🔥here's few picks, find even more here merve/sep-16-releases-68d13ea4c547f02f95842f05 🤝> IBM released a new Docling model with 258M params based on Granite (A2.0) 📝 ibm-granite/granite-docling-258M> Xiaomi released 7B audio LM with base and instruct variants (MIT) XiaomiMiMo/mimo-audio-68cc7202692c27dae881cce0> DecartAI released Lucy Edit, open Nano Banana 🍌 (NC) decart-ai/Lucy-Edit-Dev> OpenGVLab released a family of agentic computer use models (3B/7B/32B) with the dataset 💻 OpenGVLab/scalecua-68c912cf56f7ff4c8e034003> Meituan Longcat released thinking version of LongCat-Flash 💭 meituan-longcat/LongCat-Flash-Thinking See translation 2 replies · 🔥 7 7 🤗 2 2 + Reply