AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 3 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 8 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 121 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 623 • 14 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 13 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 580 • 77 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 2
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 911 • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 103 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 571 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 9.93k • 2
-
UCSC-VLAA/HQ-Edit-ckpt
Text-to-Image • Updated • 86 • 13 -
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
Paper • 2404.09990 • Published • 13 -
UCSC-VLAA/HQ-Edit
Viewer • Updated • 1.64k • 2.09k • 36 -
UCSC-VLAA/HQ-Edit-data-demo
Viewer • Updated • 162 • 82 • 2
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 25 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 6.37k • 68 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 21 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 2.28k • 1
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 13 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 6 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 10 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 7 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 1.01k • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 288 • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 9 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 11
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 9 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 9 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 18
CLIPS
-
UCSC-VLAA/Recap-DataComp-1B
Viewer • Updated • 1.88B • 5.66k • 193 -
UCSC-VLAA/Recap-COCO-30K
Viewer • Updated • 30.5k • 104 • 25 -
UCSC-VLAA/ViT-L-16-HTxt-Recap-CLIP
Zero-Shot Image Classification • Updated • 70 • 17 -
tennant/llava-llama-3-8b-hqedit
Text Generation • 8B • Updated • 10 • 17
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 25 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 6.37k • 68 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 21 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 2.28k • 1
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 3 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 8 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 121 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 13 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 6 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 10 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 7 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 1.01k • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 288 • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 9 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 11
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 623 • 14 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 13 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 580 • 77 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 2
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 9 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 9 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 18
CLIPS
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 911 • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 103 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 571 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 9.93k • 2
-
UCSC-VLAA/Recap-DataComp-1B
Viewer • Updated • 1.88B • 5.66k • 193 -
UCSC-VLAA/Recap-COCO-30K
Viewer • Updated • 30.5k • 104 • 25 -
UCSC-VLAA/ViT-L-16-HTxt-Recap-CLIP
Zero-Shot Image Classification • Updated • 70 • 17 -
tennant/llava-llama-3-8b-hqedit
Text Generation • 8B • Updated • 10 • 17
-
UCSC-VLAA/HQ-Edit-ckpt
Text-to-Image • Updated • 86 • 13 -
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing
Paper • 2404.09990 • Published • 13 -
UCSC-VLAA/HQ-Edit
Viewer • Updated • 1.64k • 2.09k • 36 -
UCSC-VLAA/HQ-Edit-data-demo
Viewer • Updated • 162 • 82 • 2