pytorch/Phi-4-mini-instruct-INT8-INT4 at f950c8a86a0f4cc7a98082e9b5cb7725a899b7fc

Phi-4-mini-instruct-INT8-INT4

Ctrl+K

Ctrl+K

2 contributors

History: 85 commits

jerryzh168's picture

Update README.md

f950c8a verified 11 months ago

.gitattributes

1.68 kB
Rename phi4-mini-8dq4w.pte to phi4-mini-8da4w.pte 12 months ago
LICENSE

1.04 kB
Create LICENSE 12 months ago
README.md

10.3 kB
Update README.md 11 months ago
added_tokens.json

249 Bytes
Upload tokenizer about 1 year ago
chat_template.jinja

423 Bytes
Upload tokenizer 12 months ago
config.json

4.85 kB
Upload Phi3ForCausalLM 12 months ago
generation_config.json

174 Bytes
Upload Phi3ForCausalLM about 1 year ago
merges.txt

2.42 MB
Upload tokenizer about 1 year ago
phi4-mini-8da4w.pte

3.02 GB
xet

Rename phi4-mini-8dq4w.pte to phi4-mini-8da4w.pte 12 months ago
pytorch_model.bin
Detected Pickle imports (17)
- "torchao.dtypes.uintx.q_dq_layout.QDQTensorImpl",
- "torch._utils._rebuild_tensor_v2",
- "torch.CharStorage",
- "torchao.quantization.quant_api._int8_asymm_per_token_quant",
- "torch._tensor._rebuild_from_type_v2",
- "torchao.quantization.quant_primitives.ZeroPointDomain",
- "torchao.quantization.linear_activation_quantized_tensor.LinearActivationQuantizedTensor",
- "torchao.dtypes.uintx.q_dq_layout.QDQLayout",
- "torch.int8",
- "torch.device",
- "torch._utils._rebuild_wrapper_subclass",
- "torch.BFloat16Storage",
- "torch.serialization._get_layout",
- "torch.FloatStorage",
- "torch.float32",
- "torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor",
- "collections.OrderedDict"
How to fix it?
4.81 GB
xet

Upload Phi3ForCausalLM 12 months ago
pytorch_model_converted.bin
Detected Pickle imports (17)
- "torchao.quantization.quant_api._int8_asymm_per_token_quant",
- "collections.OrderedDict",
- "torch._tensor._rebuild_from_type_v2",
- "torch.float32",
- "torchao.dtypes.affine_quantized_tensor.AffineQuantizedTensor",
- "torch.FloatStorage",
- "torchao.dtypes.uintx.q_dq_layout.QDQLayout",
- "torch._utils._rebuild_tensor_v2",
- "torchao.quantization.linear_activation_quantized_tensor.LinearActivationQuantizedTensor",
- "torch.device",
- "torch.CharStorage",
- "torch.BFloat16Storage",
- "torch.int8",
- "torch._utils._rebuild_wrapper_subclass",
- "torch.serialization._get_layout",
- "torchao.quantization.quant_primitives.ZeroPointDomain",
- "torchao.dtypes.uintx.q_dq_layout.QDQTensorImpl"
How to fix it?
4.81 GB
xet

Rename phi4-mini-8da4w-converted.bin to pytorch_model_converted.bin 11 months ago
special_tokens_map.json

587 Bytes
Upload tokenizer about 1 year ago
tokenizer.json

15.5 MB
xet

Upload tokenizer about 1 year ago
tokenizer_config.json

2.52 kB
Upload tokenizer 12 months ago
vocab.json

3.91 MB
Upload tokenizer about 1 year ago