LMMs-Lab-SI

community

https://www.lmms-lab.com/

lmmslab

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

yl-1993 authored a paper about 22 hours ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

edwardyzt updated a Space 4 days ago

lmms-lab-si/EASI-Leaderboard

PeterStacy updated a dataset 4 days ago

lmms-lab-si/EASI-Leaderboard-Data

View all activity

yl-1993

authored a paper about 22 hours ago

Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals

Paper • 2510.27684 • Published 11 days ago • 21

edwardyzt

updated a Space 4 days ago

EASI Leaderboard

Evaluation and Analysis for Spatial Intelligence Made Easy

PeterStacy

updated a dataset 4 days ago

lmms-lab-si/EASI-Leaderboard-Data

Preview • Updated 4 days ago • 17

yl-1993

in lmms-lab-si/EASI-Leaderboard-Data 4 days ago

Add README.md

#1 opened 4 days ago by

PeterStacy

in lmms-lab-si/EASI-Leaderboard-Data 4 days ago

Add README.md

#1 opened 4 days ago by

edwardyzt

published a dataset 4 days ago

lmms-lab-si/EASI-Leaderboard-Requests

Updated 4 days ago • 94

edwardyzt

published a Space 4 days ago

EASI Leaderboard

Evaluation and Analysis for Spatial Intelligence Made Easy

PeterStacy

published a dataset 4 days ago

lmms-lab-si/EASI-Leaderboard-Data

Preview • Updated 4 days ago • 17

luodian

authored 4 papers about 1 month ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 21

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 25

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28 • 44

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29 • 35

yl-1993

authored 4 papers 3 months ago

SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation

Paper • 2411.19921 • Published Nov 29, 2024

TokensGen: Harnessing Condensed Tokens for Long Video Generation

Paper • 2507.15728 • Published Jul 21 • 7

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Paper • 2508.00599 • Published Aug 1 • 7

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published Aug 18 • 34

luodian

authored 2 papers 5 months ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published Mar 5 • 46

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 64

yl-1993

authored 2 papers 8 months ago

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Paper • 2301.07525 • Published Jan 18, 2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

Paper • 2308.11473 • Published Aug 22, 2023