Instructions to use Qwen/Qwen3.6-27B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Qwen/Qwen3.6-27B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="Qwen/Qwen3.6-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Qwen/Qwen3.6-27B") model = AutoModelForImageTextToText.from_pretrained("Qwen/Qwen3.6-27B") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- AMD Developer Cloud
- Local Apps
- vLLM
How to use Qwen/Qwen3.6-27B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Qwen/Qwen3.6-27B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/Qwen/Qwen3.6-27B
- SGLang
How to use Qwen/Qwen3.6-27B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.6-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Qwen/Qwen3.6-27B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Qwen/Qwen3.6-27B", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use Qwen/Qwen3.6-27B with Docker Model Runner:
docker model run hf.co/Qwen/Qwen3.6-27B
Qwen3.7 27B without overthinking
#38 opened about 17 hours ago
by
Duonglv
We rewrote the multimodal processor for Qwen 3.6 and we reduced latency from 11s to 80ms
π 1
#37 opened 4 days ago
by
zakariaelhjouji
Add MMMU-Pro evaluation result
#36 opened 7 days ago
by
SaylorTwift
feat: Add native NVFP4 MTP GGUF community quantization link
#35 opened 7 days ago
by
nilayparikh
set enable thining False but response will output <think> or None ..
π₯ 1
#34 opened 13 days ago
by
titanium2000
90.0% on SWE-bench Verified using Qwen3.6-27B-FP8 β open-weight, consumer GPUs, full transparency
π 2
#33 opened 13 days ago
by
mrguo
When will QWEN 3.6 MAX be released?
π 2
#32 opened 19 days ago
by
ZeroWw
Possible future 32 GB single-GPU variant?
π₯ 3
3
#31 opened 22 days ago
by
vanbukin
Qwen3.7-20B-A4B
π 6
2
#30 opened 22 days ago
by
fenglui
When Qwen 3.6 2b / 4b?
π₯ 1
#29 opened 23 days ago
by
sprapp
GGUF available β Cerebellum v4 (ablation-guided mixed-precision)
π 2
#28 opened 25 days ago
by
deucebucket
function_call do not give me report usually
2
#27 opened 27 days ago
by
HeisenbergQ
reasoning_effort ?
1
#26 opened 27 days ago
by
supersmeh
When is 122B-A10B coming?
β 29
1
#25 opened 28 days ago
by
KeilahElla
Fine-tuning Best Practices
#24 opened 28 days ago
by
study-hjt
It seemed to me...
#23 opened 28 days ago
by
EvilinaMaller
Where can I try this?
7
#21 opened 30 days ago
by
mindplay
Update chat_template.jinja
π 1
#20 opened 30 days ago
by
dluthyc
Qwen3.6-27B-AWQ-INT4 for running in vLLM
β 6
2
#19 opened about 1 month ago
by
Duonglv
Qwen3.6 seem to fake record too often
4
#18 opened about 1 month ago
by
anitman
MTP Speculative Decoding absolutely no performance gains
#17 opened about 1 month ago
by
Manuun1
Qwen3.6 27b has template issue in ralph loop on opencode: `No user query found in messages`
1
#16 opened about 1 month ago
by
cropse
Base model release
π 4
#14 opened about 1 month ago
by
aimeri
Anyone is having issues with tool calling with the 3.6 family ? (not just 35b or 27b)
3
#13 opened about 1 month ago
by
LPN64
Qwen3.6-27B is suprisingly good for coding
π₯ 1
1
#12 opened about 1 month ago
by
weisunding
what kind of effect if I increase the expert number used in lm studio?
2
#11 opened about 1 month ago
by
Raffaelelu
Anyone noticed the diffrence of sampling parameters between 27B and 35B-A3B (both 3.6)
π 1
1
#10 opened about 1 month ago
by
bash99
Great model, please release ~16B model with FP8
2
#9 opened about 1 month ago
by
Duonglv
Installation Video and Testing - Step by Step
π 1
#8 opened about 1 month ago
by
fahdmirzac
MOE 122b and 397b please!
π 23
14
#7 opened about 1 month ago
by
jesleocizi
feedback. I just can't hold back my comment!
π₯ 11
#6 opened about 1 month ago
by
EvilinaMaller
Deploying with sglang, weight name not matching
π 1
2
#5 opened about 1 month ago
by
Kyoma001
10/10
π₯ 6
1
#4 opened about 1 month ago
by
qenme
Wan when?
π 1
#3 opened about 1 month ago
by
kabachuha
Add community evaluation results for AIME_2026, GPQA, HLE, HMMT_FEB_2026, MMLU-PRO, SWE-BENCH_PRO, SWE-BENCH_VERIFIED, TERMINAL-BENCH-2.0
1
#2 opened about 1 month ago
by
nielsr
My deepest respect and admiration goes out to the Qwen team!!
πβ€οΈ 62
14
#1 opened about 1 month ago
by
testamentaddress01