Barry Li

Brilliant-B

Brilliant-B

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Continuous Autoregressive Language Models

upvoted a paper 24 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

upvoted a paper 2 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published 16 days ago • 64

upvoted a paper 24 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published 25 days ago • 26

upvoted a paper 2 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9 • 59

liked a Space 2 months ago

206

FineVision: Open Data is All You Need

📝

A new open-source dataset for training VLMs

liked a model 3 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 224k • • 2.2k

upvoted 5 papers 4 months ago

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17 • 41

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Paper • 2507.14137 • Published Jul 18 • 34

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22 • 39

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21 • 68

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

liked 2 models 4 months ago

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 2.4M • 611

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.55M • • 11.8k

upvoted 2 papers 5 months ago

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation

Paper • 2506.17202 • Published Jun 20 • 10

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

upvoted a paper 6 months ago

TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Paper • 2506.02161 • Published Jun 2 • 12

upvoted 3 papers 7 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 30

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 136

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

Paper • 2504.15280 • Published Apr 21 • 25

liked a dataset 7 months ago

TIGER-Lab/ViRL39K

Preview • Updated Apr 23 • 310 • 30

upvoted a paper 8 months ago

Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning

Paper • 2503.18013 • Published Mar 23 • 20

Barry Li

AI & ML interests

Recent Activity

Organizations

Brilliant-B's activity

FineVision: Open Data is All You Need