Qwen3-vl-4b-fp8
#4 opened 27 days ago
by
Cia8868
vllm version for inference of Qwen/Qwen3-VL-4B-Instruct-FP8 and Qwen/Qwen3-VL-4B-Instruct
#3 opened 2 months ago
by
saiyanhuang
VRAM usage not making sense
1
#2 opened 3 months ago
by
spanspek