VLMHardPrompt

community

AI & ML interests

None defined yet.

Recent Activity

dxli1 authored a paper about 2 months ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

teowu authored a paper 6 months ago

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

JunnanLi authored a paper 6 months ago

Fractured Chain-of-Thought Reasoning

View all activity

dxli1

authored a paper about 2 months ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 54

teowu

authored a paper 6 months ago

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29 • 39

JunnanLi

authored 3 papers 6 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 26

guoyinwang

authored a paper 7 months ago

FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset

Paper • 2503.07091 • Published Mar 10 • 3

teowu

authored 3 papers 7 months ago

Teaching LMMs for Image Quality Scoring and Interpreting

Paper • 2503.09197 • Published Mar 12 • 1

Generative Frame Sampler for Long Video Understanding

Paper • 2503.09146 • Published Mar 12 • 1

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 132

guoyinwang

authored a paper 7 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

JunnanLi

authored a paper 8 months ago

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Paper • 2503.06885 • Published Mar 10 • 4

guoyinwang

authored 4 papers 9 months ago

15M Multimodal Facial Image-Text Dataset

Paper • 2407.08515 • Published Jul 11, 2024

Reinforcement Learning Enhanced LLMs: A Survey

Paper • 2412.10400 • Published Dec 5, 2024

Aligning Instruction Tuning with Pre-training

Paper • 2501.09368 • Published Jan 16

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 104

JunnanLi

authored a paper 10 months ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39

guoyinwang

authored 4 papers 12 months ago

Deconvolutional Paragraph Representation Learning

Paper • 1708.04729 • Published Aug 16, 2017

Are Human-generated Demonstrations Necessary for In-context Learning?

Paper • 2309.14681 • Published Sep 26, 2023 • 1

Towards Building the Federated GPT: Federated Instruction Tuning

Paper • 2305.05644 • Published May 9, 2023 • 5

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

Paper • 2401.05507 • Published Jan 10, 2024 • 1