OpenGVLab

community

https://github.com/opengvlab

opengvlab

OpenGVLab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

vansin submitted a paper 2 days ago

End-to-End Video Character Replacement without Structural Guidance

heroding77 authored a paper 3 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

heroding77 submitted a paper 3 days ago

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

View all activity

Papers

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs

View all Papers

OpenGVLab 's models 286

OpenGVLab/InternOmni

Image-Text-to-Text • 9B • Updated Jan 20, 2025 • 130 • 6

OpenGVLab/VideoMAEv2-Large

Video Classification • 0.3B • Updated Jan 14, 2025 • 2.44k • 1

OpenGVLab/VideoMAEv2-Base

Video Classification • 86.2M • Updated Jan 14, 2025 • 6.54k • 9

OpenGVLab/InternViT-300M-448px

Image Feature Extraction • 0.3B • Updated Jan 8, 2025 • 6.4k • 62

OpenGVLab/InternVL2_5-78B-MPO-AWQ

Image-Text-to-Text • Updated Jan 6, 2025 • 60 • 9

OpenGVLab/VideoChat-TPO

Video-Text-to-Text • 8B • Updated Jan 2, 2025 • 34 • 5

OpenGVLab/InternVL

Updated Dec 25, 2024 • 37

OpenGVLab/HoVLE

Image-Text-to-Text • 3B • Updated Dec 24, 2024 • 54 • 13

OpenGVLab/InternVL2-8B-MPO

Image-Text-to-Text • 8B • Updated Dec 20, 2024 • 56 • 37

OpenGVLab/VideoChat2_HD_stage4_Mistral_7B_hf

Video-Text-to-Text • 8B • Updated Dec 19, 2024 • 82 • 3

OpenGVLab/InternVideo2_chat_8B_HD

Video-Text-to-Text • 8B • Updated Dec 18, 2024 • 94 • 18

OpenGVLab/PVC-InternVL2-8B

Image-Text-to-Text • 10B • Updated Dec 17, 2024 • 65 • 9

OpenGVLab/V2PE

Updated Dec 13, 2024 • 4

OpenGVLab/Mini-InternVL2-1B-DA-Medical

Image-Text-to-Text • 0.9B • Updated Dec 9, 2024 • 55 • 1

OpenGVLab/Mini-InternVL2-4B-DA-Medical

Image-Text-to-Text • 4B • Updated Dec 9, 2024 • 97 • 6

OpenGVLab/Mini-InternVL2-1B-DA-DriveLM

Image-Text-to-Text • 0.9B • Updated Dec 9, 2024 • 155 • 1

OpenGVLab/Mini-InternVL2-4B-DA-DriveLM

Image-Text-to-Text • 4B • Updated Dec 9, 2024 • 108 • 3

OpenGVLab/Mini-InternVL2-4B-DA-BDD

Image-Text-to-Text • 4B • Updated Dec 9, 2024 • 54

OpenGVLab/Mini-InternVL2-1B-DA-BDD

Image-Text-to-Text • 0.9B • Updated Dec 9, 2024 • 54

OpenGVLab/InternViT-6B-224px

Image Feature Extraction • Updated Dec 9, 2024 • 655 • 24

OpenGVLab/InternVL-14B-224px

Image Feature Extraction • 14B • Updated Dec 9, 2024 • 98 • 35

OpenGVLab/InternViT-6B-448px-V1-0

Image Feature Extraction • Updated Dec 9, 2024 • 54 • 9

OpenGVLab/InternViT-6B-448px-V1-2

Image Feature Extraction • 6B • Updated Dec 9, 2024 • 62 • 26

OpenGVLab/InternViT-6B-448px-V2_5

Image Feature Extraction • 6B • Updated Dec 9, 2024 • 1.25k • 48

OpenGVLab/InternViT-300M-448px-V2_5

Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 5.2k • 48

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • 6B • Updated Dec 9, 2024 • 521 • 77

OpenGVLab/InternVideo2-Stage2-6B-Audio

Updated Nov 27, 2024 • 2

OpenGVLab/InternVideo2-Chat-8B

Video-Text-to-Text • 8B • Updated Oct 10, 2024 • 686 • 23

OpenGVLab/InternVideo2_Chat_8B_InternLM2_5

Video-Text-to-Text • 9B • Updated Sep 19, 2024 • 40 • 7

OpenGVLab/ViCLIP-L-14-hf

0.4B • Updated Sep 17, 2024 • 19.7k • 1