Qwen
/

Qwen3-VL-2B-Instruct

Image-Text-to-Text

Model card Files Files and versions

Resources

View closed (0)

Batch vs individual inference output mismatch

#9 opened about 2 months ago by

torch.OutOfMemoryError: CUDA out of memory

#8 opened about 2 months ago by

Inference seems to be very slow on A100 even when flash_attn is enabled

#7 opened about 2 months ago by

Are these variables implicitly read by transformers library or do I need to incorporate into generate function?

#6 opened about 2 months ago by

why the outputs are different ?

#5 opened 2 months ago by

How different are its hardware requirements from those of the Qwen2-VL-2B?

#4 opened 3 months ago by

Finetune It's Brain On Text

#3 opened 4 months ago by

GGUFs are here. Tutorials to run locally.

#2 opened 4 months ago by

Local Installation Video and Testing - Step by Step

#1 opened 4 months ago by