Instructions to use OpenVINO/pythia-1b-int4-ov with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenVINO/pythia-1b-int4-ov with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="OpenVINO/pythia-1b-int4-ov")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("OpenVINO/pythia-1b-int4-ov") model = AutoModelForCausalLM.from_pretrained("OpenVINO/pythia-1b-int4-ov") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use OpenVINO/pythia-1b-int4-ov with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "OpenVINO/pythia-1b-int4-ov" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenVINO/pythia-1b-int4-ov", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/OpenVINO/pythia-1b-int4-ov
- SGLang
How to use OpenVINO/pythia-1b-int4-ov with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "OpenVINO/pythia-1b-int4-ov" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenVINO/pythia-1b-int4-ov", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "OpenVINO/pythia-1b-int4-ov" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OpenVINO/pythia-1b-int4-ov", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use OpenVINO/pythia-1b-int4-ov with Docker Model Runner:
docker model run hf.co/OpenVINO/pythia-1b-int4-ov
| <net name="tokenizer" version="11"> | |
| <layers> | |
| <layer id="0" name="Parameter_155250" type="Parameter" version="opset1"> | |
| <data shape="?" element_type="string" /> | |
| <output> | |
| <port id="0" precision="STRING" names="Parameter_155250"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="1" name="Constant_155257" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="2" name="StringTensorUnpack_155251" type="StringTensorUnpack" version="extension"> | |
| <data mode="begins_ends" /> | |
| <input> | |
| <port id="0" precision="STRING"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="3" name="NormalizeUnicode_155252" type="NormalizeUnicode" version="extension"> | |
| <data normalization_form="NFC" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="5" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="4" name="ShapeOf_155253" type="ShapeOf" version="opset3"> | |
| <data output_type="i64" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I64"> | |
| <dim>1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="5" name="Constant_155254" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="6" name="Constant_155255" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="7" name="Gather_155256" type="Gather" version="opset8"> | |
| <data batch_dims="0" /> | |
| <input> | |
| <port id="0" precision="I64"> | |
| <dim>1</dim> | |
| </port> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="8" name="Constant_155258" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="9" name="Range_155259" type="Range" version="opset4"> | |
| <data output_type="i32" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="10" name="Constant_155261" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="11" name="Constant_155262" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="12" name="Add_155263" type="Add" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="2" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="13" name="Constant_155264" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="14" name="Range_155265" type="Range" version="opset4"> | |
| <data output_type="i32" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="15" name="Constant_155328" type="Const" version="opset1"> | |
| <data element_type="u8" shape="620" offset="16" size="620" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>620</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="16" name="RegexSplit_155329" type="RegexSplit" version="extension"> | |
| <data behaviour="isolate" invert="false" max_splits="-1" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="5" precision="U8"> | |
| <dim>620</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="9" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="10" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="17" name="Constant_155334" type="Const" version="opset1"> | |
| <data element_type="u8" shape="64" offset="636" size="64" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>64</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="18" name="Constant_155331" type="Const" version="opset1"> | |
| <data element_type="u8" shape="399" offset="700" size="399" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>399</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="19" name="StringTensorUnpack_155332" type="StringTensorUnpack" version="extension"> | |
| <data mode="begins_ends" /> | |
| <input> | |
| <port id="0" precision="U8"> | |
| <dim>399</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="20" name="RegexSplit_155335" type="RegexSplit" version="extension"> | |
| <data behaviour="isolate" invert="false" max_splits="-1" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="5" precision="U8"> | |
| <dim>64</dim> | |
| </port> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="9" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="10" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="11" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="12" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="13" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="21" name="BytesToChars_155336" type="BytesToChars" version="extension"> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="5" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="9" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="22" name="Constant_155338" type="Const" version="opset1"> | |
| <data element_type="u8" shape="558397" offset="1099" size="558397" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>558397</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="23" name="StringTensorUnpack_155339" type="StringTensorUnpack" version="extension"> | |
| <data mode="begins_ends" /> | |
| <input> | |
| <port id="0" precision="U8"> | |
| <dim>558397</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="24" name="Constant_155419" type="Const" version="opset1"> | |
| <data element_type="u8" shape="606619" offset="559496" size="606619" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>606619</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="25" name="StringTensorUnpack_155420" type="StringTensorUnpack" version="extension"> | |
| <data mode="begins_ends" /> | |
| <input> | |
| <port id="0" precision="U8"> | |
| <dim>606619</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="26" name="Constant_155347" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="27" name="Constant_155341" type="Const" version="opset1"> | |
| <data element_type="u8" shape="399" offset="700" size="399" /> | |
| <output> | |
| <port id="0" precision="U8"> | |
| <dim>399</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="28" name="StringTensorUnpack_155342" type="StringTensorUnpack" version="extension"> | |
| <data mode="begins_ends" /> | |
| <input> | |
| <port id="0" precision="U8"> | |
| <dim>399</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="29" name="ShapeOf_155343" type="ShapeOf" version="opset3"> | |
| <data output_type="i64" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I64"> | |
| <dim>1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="30" name="Constant_155344" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="31" name="Constant_155345" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="0" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="32" name="Gather_155346" type="Gather" version="opset8"> | |
| <data batch_dims="0" /> | |
| <input> | |
| <port id="0" precision="I64"> | |
| <dim>1</dim> | |
| </port> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="33" name="Constant_155348" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="34" name="Range_155349" type="Range" version="opset4"> | |
| <data output_type="i32" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="35" name="Constant_155351" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="36" name="Constant_155352" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="37" name="Add_155353" type="Add" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="2" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="38" name="Constant_155354" type="Const" version="opset1"> | |
| <data element_type="i64" shape="" offset="8" size="8" /> | |
| <output> | |
| <port id="0" precision="I64" /> | |
| </output> | |
| </layer> | |
| <layer id="39" name="Range_155355" type="Range" version="opset4"> | |
| <data output_type="i32" /> | |
| <input> | |
| <port id="0" precision="I64" /> | |
| <port id="1" precision="I64" /> | |
| <port id="2" precision="I64" /> | |
| </input> | |
| <output> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="40" name="BytesToChars_155417" type="BytesToChars" version="extension"> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="5" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="9" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="41" name="Constant_155421" type="Const" version="opset1"> | |
| <data element_type="i32" shape="23" offset="1166115" size="92" /> | |
| <output> | |
| <port id="0" precision="I32"> | |
| <dim>23</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="42" name="BPETokenizer_155422" type="BPETokenizer" version="extension"> | |
| <data unk_token="" fuse_unk="false" suffix_indicator="" end_suffix="" byte_fallback="false" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="4" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="5" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="9" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="10" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="11" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="12" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="13" precision="U8"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="14" precision="I32"> | |
| <dim>23</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="15" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="16" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="17" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="43" name="Subtract_155423" type="Subtract" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="44" name="Constant_155424" type="Const" version="opset1"> | |
| <data element_type="i32" shape="" offset="1166207" size="4" /> | |
| <output> | |
| <port id="0" precision="I32" /> | |
| </output> | |
| </layer> | |
| <layer id="45" name="Minimum_155425" type="Minimum" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32" /> | |
| </input> | |
| <output> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="46" name="Add_155426" type="Add" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="47" name="Constant_155427" type="Const" version="opset1"> | |
| <data element_type="i32" shape="1" offset="1166211" size="4" /> | |
| <output> | |
| <port id="0" precision="I32"> | |
| <dim>1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="48" name="CombineSegments_155428" type="CombineSegments" version="extension"> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32"> | |
| <dim>1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="4" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="5" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="6" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="7" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="8" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="9" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="49" name="Subtract_155429" type="Subtract" version="opset1"> | |
| <data auto_broadcast="numpy" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="50" name="Constant_155430" type="Const" version="opset1"> | |
| <data element_type="i32" shape="" offset="1166211" size="4" /> | |
| <output> | |
| <port id="0" precision="I32" /> | |
| </output> | |
| </layer> | |
| <layer id="51" name="ReduceMax_155431" type="ReduceMax" version="opset1"> | |
| <data keep_dims="false" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32" /> | |
| </input> | |
| <output> | |
| <port id="2" precision="I32" /> | |
| </output> | |
| </layer> | |
| <layer id="52" name="Constant_155432" type="Const" version="opset1"> | |
| <data element_type="i32" shape="" offset="1166211" size="4" /> | |
| <output> | |
| <port id="0" precision="I32" /> | |
| </output> | |
| </layer> | |
| <layer id="53" name="RaggedToDense_155433" type="RaggedToDense" version="extension"> | |
| <data pad_right="true" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="2" precision="I32"> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="3" precision="I32" /> | |
| <port id="4" precision="I32" /> | |
| </input> | |
| <output> | |
| <port id="5" precision="I32"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| <port id="6" precision="BOOL"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="54" name="Convert_155434" type="Convert" version="opset1"> | |
| <data destination_type="i32" /> | |
| <input> | |
| <port id="0" precision="BOOL"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I32"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="55" name="Convert_155434" type="Convert" version="opset1"> | |
| <data destination_type="i64" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I64" names="attention_mask"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="57" name="RaggedToDense_155433.0" type="Convert" version="opset1"> | |
| <data destination_type="i64" /> | |
| <input> | |
| <port id="0" precision="I32"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| <output> | |
| <port id="1" precision="I64" names="input_ids"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </output> | |
| </layer> | |
| <layer id="58" name="Result_155437" type="Result" version="opset1"> | |
| <input> | |
| <port id="0" precision="I64"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| </layer> | |
| <layer id="56" name="Result_155439" type="Result" version="opset1"> | |
| <input> | |
| <port id="0" precision="I64"> | |
| <dim>-1</dim> | |
| <dim>-1</dim> | |
| </port> | |
| </input> | |
| </layer> | |
| </layers> | |
| <edges> | |
| <edge from-layer="0" from-port="0" to-layer="2" to-port="0" /> | |
| <edge from-layer="1" from-port="0" to-layer="9" to-port="0" /> | |
| <edge from-layer="2" from-port="1" to-layer="3" to-port="0" /> | |
| <edge from-layer="2" from-port="2" to-layer="3" to-port="1" /> | |
| <edge from-layer="2" from-port="3" to-layer="3" to-port="2" /> | |
| <edge from-layer="3" from-port="5" to-layer="16" to-port="4" /> | |
| <edge from-layer="3" from-port="4" to-layer="16" to-port="3" /> | |
| <edge from-layer="3" from-port="3" to-layer="16" to-port="2" /> | |
| <edge from-layer="3" from-port="3" to-layer="4" to-port="0" /> | |
| <edge from-layer="4" from-port="1" to-layer="7" to-port="0" /> | |
| <edge from-layer="5" from-port="0" to-layer="7" to-port="1" /> | |
| <edge from-layer="6" from-port="0" to-layer="7" to-port="2" /> | |
| <edge from-layer="7" from-port="3" to-layer="9" to-port="1" /> | |
| <edge from-layer="7" from-port="3" to-layer="12" to-port="0" /> | |
| <edge from-layer="8" from-port="0" to-layer="9" to-port="2" /> | |
| <edge from-layer="9" from-port="3" to-layer="16" to-port="0" /> | |
| <edge from-layer="10" from-port="0" to-layer="14" to-port="0" /> | |
| <edge from-layer="11" from-port="0" to-layer="12" to-port="1" /> | |
| <edge from-layer="12" from-port="2" to-layer="14" to-port="1" /> | |
| <edge from-layer="13" from-port="0" to-layer="14" to-port="2" /> | |
| <edge from-layer="14" from-port="3" to-layer="16" to-port="1" /> | |
| <edge from-layer="15" from-port="0" to-layer="16" to-port="5" /> | |
| <edge from-layer="16" from-port="6" to-layer="20" to-port="0" /> | |
| <edge from-layer="16" from-port="7" to-layer="20" to-port="1" /> | |
| <edge from-layer="16" from-port="8" to-layer="20" to-port="2" /> | |
| <edge from-layer="16" from-port="9" to-layer="20" to-port="3" /> | |
| <edge from-layer="16" from-port="10" to-layer="20" to-port="4" /> | |
| <edge from-layer="17" from-port="0" to-layer="20" to-port="5" /> | |
| <edge from-layer="18" from-port="0" to-layer="19" to-port="0" /> | |
| <edge from-layer="19" from-port="1" to-layer="20" to-port="6" /> | |
| <edge from-layer="19" from-port="2" to-layer="20" to-port="7" /> | |
| <edge from-layer="19" from-port="3" to-layer="20" to-port="8" /> | |
| <edge from-layer="20" from-port="9" to-layer="21" to-port="0" /> | |
| <edge from-layer="20" from-port="13" to-layer="21" to-port="4" /> | |
| <edge from-layer="20" from-port="12" to-layer="21" to-port="3" /> | |
| <edge from-layer="20" from-port="10" to-layer="21" to-port="1" /> | |
| <edge from-layer="20" from-port="11" to-layer="21" to-port="2" /> | |
| <edge from-layer="21" from-port="9" to-layer="42" to-port="4" /> | |
| <edge from-layer="21" from-port="8" to-layer="42" to-port="3" /> | |
| <edge from-layer="21" from-port="7" to-layer="42" to-port="2" /> | |
| <edge from-layer="21" from-port="6" to-layer="42" to-port="1" /> | |
| <edge from-layer="21" from-port="5" to-layer="42" to-port="0" /> | |
| <edge from-layer="22" from-port="0" to-layer="23" to-port="0" /> | |
| <edge from-layer="23" from-port="3" to-layer="42" to-port="7" /> | |
| <edge from-layer="23" from-port="2" to-layer="42" to-port="6" /> | |
| <edge from-layer="23" from-port="1" to-layer="42" to-port="5" /> | |
| <edge from-layer="24" from-port="0" to-layer="25" to-port="0" /> | |
| <edge from-layer="25" from-port="3" to-layer="42" to-port="10" /> | |
| <edge from-layer="25" from-port="2" to-layer="42" to-port="9" /> | |
| <edge from-layer="25" from-port="1" to-layer="42" to-port="8" /> | |
| <edge from-layer="26" from-port="0" to-layer="34" to-port="0" /> | |
| <edge from-layer="27" from-port="0" to-layer="28" to-port="0" /> | |
| <edge from-layer="28" from-port="2" to-layer="40" to-port="3" /> | |
| <edge from-layer="28" from-port="1" to-layer="29" to-port="0" /> | |
| <edge from-layer="28" from-port="3" to-layer="40" to-port="4" /> | |
| <edge from-layer="28" from-port="1" to-layer="40" to-port="2" /> | |
| <edge from-layer="29" from-port="1" to-layer="32" to-port="0" /> | |
| <edge from-layer="30" from-port="0" to-layer="32" to-port="1" /> | |
| <edge from-layer="31" from-port="0" to-layer="32" to-port="2" /> | |
| <edge from-layer="32" from-port="3" to-layer="34" to-port="1" /> | |
| <edge from-layer="32" from-port="3" to-layer="37" to-port="0" /> | |
| <edge from-layer="33" from-port="0" to-layer="34" to-port="2" /> | |
| <edge from-layer="34" from-port="3" to-layer="40" to-port="0" /> | |
| <edge from-layer="35" from-port="0" to-layer="39" to-port="0" /> | |
| <edge from-layer="36" from-port="0" to-layer="37" to-port="1" /> | |
| <edge from-layer="37" from-port="2" to-layer="39" to-port="1" /> | |
| <edge from-layer="38" from-port="0" to-layer="39" to-port="2" /> | |
| <edge from-layer="39" from-port="3" to-layer="40" to-port="1" /> | |
| <edge from-layer="40" from-port="7" to-layer="42" to-port="11" /> | |
| <edge from-layer="40" from-port="8" to-layer="42" to-port="12" /> | |
| <edge from-layer="40" from-port="9" to-layer="42" to-port="13" /> | |
| <edge from-layer="41" from-port="0" to-layer="42" to-port="14" /> | |
| <edge from-layer="42" from-port="17" to-layer="48" to-port="2" /> | |
| <edge from-layer="42" from-port="15" to-layer="48" to-port="0" /> | |
| <edge from-layer="42" from-port="15" to-layer="46" to-port="0" /> | |
| <edge from-layer="42" from-port="15" to-layer="43" to-port="1" /> | |
| <edge from-layer="42" from-port="16" to-layer="43" to-port="0" /> | |
| <edge from-layer="43" from-port="2" to-layer="45" to-port="0" /> | |
| <edge from-layer="44" from-port="0" to-layer="45" to-port="1" /> | |
| <edge from-layer="45" from-port="2" to-layer="46" to-port="1" /> | |
| <edge from-layer="46" from-port="2" to-layer="48" to-port="1" /> | |
| <edge from-layer="47" from-port="0" to-layer="48" to-port="3" /> | |
| <edge from-layer="48" from-port="5" to-layer="49" to-port="0" /> | |
| <edge from-layer="48" from-port="4" to-layer="49" to-port="1" /> | |
| <edge from-layer="48" from-port="4" to-layer="53" to-port="0" /> | |
| <edge from-layer="48" from-port="5" to-layer="53" to-port="1" /> | |
| <edge from-layer="48" from-port="6" to-layer="53" to-port="2" /> | |
| <edge from-layer="49" from-port="2" to-layer="51" to-port="0" /> | |
| <edge from-layer="50" from-port="0" to-layer="51" to-port="1" /> | |
| <edge from-layer="51" from-port="2" to-layer="53" to-port="3" /> | |
| <edge from-layer="52" from-port="0" to-layer="53" to-port="4" /> | |
| <edge from-layer="53" from-port="6" to-layer="54" to-port="0" /> | |
| <edge from-layer="53" from-port="5" to-layer="57" to-port="0" /> | |
| <edge from-layer="54" from-port="1" to-layer="55" to-port="0" /> | |
| <edge from-layer="55" from-port="1" to-layer="56" to-port="0" /> | |
| <edge from-layer="57" from-port="1" to-layer="58" to-port="0" /> | |
| </edges> | |
| <rt_info> | |
| <eos_token_id value="0" /> | |
| </rt_info> | |
| </net> | |