2 53 9

Pham Van Linh

phamvanlinh143

AI & ML interests

OCR, AI, DL

Recent Activity

upvoted an article about 4 hours ago

A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

upvoted a paper about 5 hours ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

upvoted an article about 8 hours ago

The 4 Things Qwen-3’s Chat Template Teaches Us

View all activity

Organizations

None yet

upvoted an article about 4 hours ago

Article

A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

Jul 16

•

upvoted a paper about 5 hours ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 238

upvoted an article about 8 hours ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30

•

upvoted 4 articles about 9 hours ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

•

470

Article

Visualizing How VLMs Work

Oct 7

•

Article

Welcome PaliGemma 2 – New vision language models by Google

Dec 5, 2024

•

162

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

301

upvoted a collection about 9 hours ago

Vision Language Models: 2025 Update

Collection

This collection includes all the models, datasets and Spaces mentioned in the blog Vision Language Models: 2025 Update • 67 items • Updated May 12 • 5

upvoted a collection about 10 hours ago

Qwen3-VL

Collection

37 items • Updated 18 days ago • 414

liked 2 models about 19 hours ago

Qwen/QVQ-72B-Preview

Image-Text-to-Text • 73B • Updated Jan 12 • 496 • 609

Alpha-VLLM/Lumina-mGPT-7B-768

Any-to-Any • 7B • Updated Apr 7 • 1.81k • 39

upvoted a collection about 19 hours ago

Any-to-Any Models, Datasets, Spaces

Collection

18 items • Updated Jun 20 • 27

liked a model about 19 hours ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 62.7k • 3.53k

upvoted 2 articles 1 day ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

725

Article

Transformers

Jul 2, 2024

•

upvoted 3 papers 2 days ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 169

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 32

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 132

upvoted 2 articles 2 days ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

•

190

Article

Vision Language Models Explained

Apr 11, 2024

•

492

Pham Van Linh

AI & ML interests

Recent Activity

Organizations

phamvanlinh143's activity

A Survey of Small Language Models in the Era of LLMs: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness

The 4 Things Qwen-3’s Chat Template Teaches Us

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Visualizing How VLMs Work

Welcome PaliGemma 2 – New vision language models by Google

ColPali: Efficient Document Retrieval with Vision Language Models 👀

SmolLM3: smol, multilingual, long-context reasoner

Transformers

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Vision Language Models Explained