Qwen3 4B Instruct 2507 x Polaris Alpha

This is a non-reasoning model trained on 1,000 examples from Polaris Alpha, an early snapshot of GPT-5.1 with reasoning effort set to minimal.

GGUFs available here

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Safetensors

Model size

4B params

Tensor type

BF16

Model tree for TeichAI/Qwen3-4B-Instruct-2507-Polaris-Alpha-Distill

Base model

Finetuned

Finetuned

(194)

this model

Finetunes

Merges

Quantizations