Adnan Pen's picture

8 21

Adnan Pen

AdnanPen

·

adnan119

AI & ML interests

Computer Vision

Organizations

None yet

upvoted a collection 4 months ago

VisionLM

1867 items • Updated 11 days ago • 138

upvoted 3 collections 5 months ago

DINOv2

DINOv2: foundation models producing robust visual features suitable for image-level and pixel-level visual tasks - https://arxiv.org/abs/2304.07193 • 5 items • Updated Aug 13, 2025 • 30

Perception Encoder

17 items • Updated Jul 11, 2025 • 73

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 435

upvoted a paper 5 months ago

Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding

Paper • 2507.15028 • Published Jul 20, 2025 • 21

upvoted an article 6 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

Jun 26, 2025

•

48

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

580

upvoted a paper about 1 year ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147