AI & ML interests
None defined yet.
Recent Activity
Papers
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 28 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 15 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 392 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 750 • 14 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 15 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 714 • 76 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 2
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 1k • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 984 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 62 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 375 • 2
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 25 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 9.53k • 66 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 20 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 162 • 1
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 22 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 9 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 13 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 14 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 611 • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 2.28k • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 58 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 37
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 14 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 7 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 13
CLIPS
-
UCSC-VLAA/gpt-image-edit-training
Image-to-Image • Updated • 25 -
UCSC-VLAA/GPT-Image-Edit-1.5M
Viewer • Updated • 2.78M • 9.53k • 66 -
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Paper • 2507.21033 • Published • 20 -
UCSC-VLAA/gpt-image-edit-benchmark-results
Viewer • Updated • 1.21k • 162 • 1
-
UCSC-VLAA/openvision-vit-tiny-patch16-224
Image Feature Extraction • Updated • 28 -
UCSC-VLAA/openvision-vit-tiny-patch8-224
Image Feature Extraction • Updated • 15 -
UCSC-VLAA/openvision-vit-tiny-patch16-384
Image Feature Extraction • Updated • 392 -
UCSC-VLAA/openvision-vit-tiny-patch8-160
Image Feature Extraction • Updated
-
UCSC-VLAA/MedVLThinker-3B-SFT_m23k
Image-Text-to-Text • 4B • Updated • 22 -
UCSC-VLAA/MedVLThinker-3B-SFT_PMC
Image-Text-to-Text • 4B • Updated • 9 -
UCSC-VLAA/MedVLThinker-7B-SFT_m23k
Image-Text-to-Text • 8B • Updated • 13 -
UCSC-VLAA/MedVLThinker-3B-SFT_m23k-RL_PMC
Image-Text-to-Text • 4B • Updated • 14 • 1
-
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B
Image-Text-to-Text • 4B • Updated • 611 • 5 -
UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-7B
Image-Text-to-Text • 8B • Updated • 2.28k • 2 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-2B
Image-Text-to-Text • 2B • Updated • 58 • 1 -
UCSC-VLAA/VLAA-Thinker-Qwen2VL-7B
Image-Text-to-Text • 8B • Updated • 37
-
UCSC-VLAA/MedReason-8B
Question Answering • 8B • Updated • 750 • 14 -
UCSC-VLAA/MedReason-Mistral
Question Answering • 266k • Updated • 15 -
UCSC-VLAA/MedReason
Viewer • Updated • 32.7k • 714 • 76 -
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Paper • 2504.00993 • Published • 2
-
UCSC-VLAA/m1-7B-1K
Question Answering • 8B • Updated • 14 • 1 -
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
Paper • 2504.00869 • Published • 10 -
UCSC-VLAA/m1-32B-1K
Question Answering • 33B • Updated • 7 -
UCSC-VLAA/m1-7B-23K
Question Answering • 8B • Updated • 13
CLIPS
-
UCSC-VLAA/ViT-bigG-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 1k • 4 -
UCSC-VLAA/ViT-bigG-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 984 • 4 -
UCSC-VLAA/ViT-L-14-CLIPA-336-datacomp1B
Zero-Shot Image Classification • Updated • 62 • 2 -
UCSC-VLAA/ViT-L-14-CLIPA-datacomp1B
Zero-Shot Image Classification • Updated • 375 • 2