qaihm-bot's picture
v0.51.0
94dbb96 verified
metadata
library_name: pytorch
license: other
tags:
  - android
pipeline_tag: image-segmentation

DeepLabV3-Plus-MobileNet: Optimized for Qualcomm Devices

DeepLabV3 is designed for semantic segmentation at multiple scales, trained on the various datasets. It uses MobileNet as a backbone.

This is based on the implementation of DeepLabV3-Plus-MobileNet found here. This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the Qualcomm® AI Hub Models library to export with custom configurations. More details on model performance across various devices, can be found here.

Qualcomm AI Hub Models uses Qualcomm AI Hub Workbench to compile, profile, and evaluate this model. Sign up to run these models on a hosted Qualcomm® device.

Getting Started

There are two ways to deploy this model on your device:

Option 1: Download Pre-Exported Models

Below are pre-exported model assets ready for deployment.

Runtime Precision Chipset SDK Versions Download
ONNX float Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
ONNX w8a16 Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
ONNX w8a8 Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
QNN_DLC float Universal QAIRT 2.45 Download
QNN_DLC w8a16 Universal QAIRT 2.45 Download
QNN_DLC w8a8 Universal QAIRT 2.45 Download
TFLITE float Universal QAIRT 2.45 Download
TFLITE w8a8 Universal QAIRT 2.45 Download

For more device-specific assets and performance metrics, visit DeepLabV3-Plus-MobileNet on Qualcomm® AI Hub.

Option 2: Export with Custom Configurations

Use the Qualcomm® AI Hub Models Python library to compile and export the model with your own:

  • Custom weights (e.g., fine-tuned checkpoints)
  • Custom input shapes
  • Target device and runtime configurations

This option is ideal if you need to customize the model beyond the default configuration provided here.

See our repository for DeepLabV3-Plus-MobileNet on GitHub for usage instructions.

Model Details

Model Type: Model_use_case.semantic_segmentation

Model Stats:

  • Model checkpoint: VOC2012
  • Input resolution: 513x513
  • Number of output classes: 21
  • Number of parameters: 5.80M
  • Model size (float): 22.2 MB
  • Model size (w8a16): 6.67 MB

Performance Summary

Model Runtime Precision Chipset Inference Time (ms) Peak Memory Range (MB) Primary Compute Unit
DeepLabV3-Plus-MobileNet ONNX float Snapdragon® 8 Elite Gen 5 Mobile 4.302 ms 2 - 176 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Snapdragon® X2 Elite 5.394 ms 10 - 10 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Snapdragon® X Elite 11.053 ms 10 - 10 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Snapdragon® 8 Gen 3 Mobile 7.19 ms 4 - 211 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Qualcomm® QCS8550 (Proxy) 10.361 ms 3 - 131 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Qualcomm® QCS9075 18.058 ms 3 - 6 MB NPU
DeepLabV3-Plus-MobileNet ONNX float Snapdragon® 8 Elite For Galaxy Mobile 5.893 ms 2 - 170 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® 8 Elite Gen 5 Mobile 2.962 ms 0 - 207 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® X2 Elite 3.711 ms 7 - 7 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® X Elite 7.659 ms 5 - 5 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® 8 Gen 3 Mobile 5.092 ms 2 - 220 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Qualcomm® QCS6490 1223.297 ms 89 - 92 MB CPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Qualcomm® QCS8550 (Proxy) 7.103 ms 2 - 158 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Qualcomm® QCS9075 8.613 ms 2 - 4 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Qualcomm® QCM6690 588.21 ms 115 - 123 MB CPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® 8 Elite For Galaxy Mobile 3.934 ms 0 - 185 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a16 Snapdragon® 7 Gen 4 Mobile 586.218 ms 93 - 101 MB CPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Snapdragon® 8 Elite Gen 5 Mobile 1.463 ms 0 - 179 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Snapdragon® X2 Elite 1.667 ms 5 - 5 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Snapdragon® X Elite 3.921 ms 5 - 5 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Snapdragon® 8 Gen 3 Mobile 2.375 ms 0 - 202 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Qualcomm® QCS8550 (Proxy) 3.524 ms 0 - 9 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Qualcomm® QCS9075 4.464 ms 1 - 4 MB NPU
DeepLabV3-Plus-MobileNet ONNX w8a8 Snapdragon® 8 Elite For Galaxy Mobile 1.907 ms 0 - 176 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Snapdragon® 8 Elite Gen 5 Mobile 4.31 ms 3 - 184 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Snapdragon® X2 Elite 5.59 ms 3 - 3 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Snapdragon® X Elite 12.297 ms 3 - 3 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Snapdragon® 8 Gen 3 Mobile 8.07 ms 3 - 210 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® QCS8275 (Proxy) 58.607 ms 1 - 166 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® QCS8550 (Proxy) 11.534 ms 3 - 142 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® SA8775P 17.524 ms 1 - 168 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® QCS9075 20.402 ms 3 - 8 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® QCS8450 (Proxy) 19.817 ms 3 - 210 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® SA7255P 58.607 ms 1 - 166 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Qualcomm® SA8295P 19.594 ms 0 - 170 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC float Snapdragon® 8 Elite For Galaxy Mobile 6.29 ms 3 - 178 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® 8 Elite Gen 5 Mobile 3.074 ms 2 - 195 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® X2 Elite 4.296 ms 2 - 2 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® X Elite 9.137 ms 2 - 2 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® 8 Gen 3 Mobile 6.127 ms 0 - 213 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCS6490 33.306 ms 1 - 4 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCS8275 (Proxy) 21.99 ms 2 - 182 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCS8550 (Proxy) 8.439 ms 2 - 3 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® SA8775P 8.912 ms 2 - 184 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCS9075 9.741 ms 2 - 5 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCM6690 104.611 ms 2 - 233 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® QCS8450 (Proxy) 12.074 ms 2 - 215 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® SA7255P 21.99 ms 2 - 182 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Qualcomm® SA8295P 13.71 ms 2 - 198 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® 8 Elite For Galaxy Mobile 4.371 ms 2 - 180 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a16 Snapdragon® 7 Gen 4 Mobile 12.08 ms 2 - 181 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® 8 Elite Gen 5 Mobile 1.614 ms 1 - 176 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® X2 Elite 2.12 ms 1 - 1 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® X Elite 4.623 ms 1 - 1 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® 8 Gen 3 Mobile 2.902 ms 0 - 196 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCS6490 15.615 ms 3 - 5 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCS8275 (Proxy) 10.789 ms 1 - 169 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCS8550 (Proxy) 4.162 ms 1 - 78 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® SA8775P 4.709 ms 1 - 172 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCS9075 5.13 ms 3 - 5 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCM6690 47.589 ms 1 - 201 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® QCS8450 (Proxy) 6.644 ms 1 - 197 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® SA7255P 10.789 ms 1 - 169 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Qualcomm® SA8295P 6.394 ms 1 - 168 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® 8 Elite For Galaxy Mobile 2.187 ms 1 - 173 MB NPU
DeepLabV3-Plus-MobileNet QNN_DLC w8a8 Snapdragon® 7 Gen 4 Mobile 6.269 ms 1 - 176 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Snapdragon® 8 Elite Gen 5 Mobile 4.312 ms 0 - 183 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Snapdragon® 8 Gen 3 Mobile 8.09 ms 0 - 215 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® QCS8275 (Proxy) 58.638 ms 0 - 168 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® QCS8550 (Proxy) 11.479 ms 0 - 2 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® SA8775P 17.543 ms 0 - 168 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® QCS9075 19.566 ms 0 - 18 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® QCS8450 (Proxy) 19.887 ms 0 - 212 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® SA7255P 58.638 ms 0 - 168 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Qualcomm® SA8295P 19.618 ms 0 - 170 MB NPU
DeepLabV3-Plus-MobileNet TFLITE float Snapdragon® 8 Elite For Galaxy Mobile 6.281 ms 0 - 177 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Snapdragon® 8 Elite Gen 5 Mobile 1.585 ms 0 - 183 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Snapdragon® 8 Gen 3 Mobile 2.942 ms 0 - 200 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCS6490 15.911 ms 0 - 11 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCS8275 (Proxy) 10.839 ms 0 - 174 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCS8550 (Proxy) 4.355 ms 0 - 102 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® SA8775P 4.622 ms 0 - 176 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCS9075 5.114 ms 0 - 9 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCM6690 47.33 ms 0 - 196 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® QCS8450 (Proxy) 5.745 ms 0 - 203 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® SA7255P 10.839 ms 0 - 174 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Qualcomm® SA8295P 6.255 ms 0 - 172 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Snapdragon® 8 Elite For Galaxy Mobile 2.081 ms 0 - 174 MB NPU
DeepLabV3-Plus-MobileNet TFLITE w8a8 Snapdragon® 7 Gen 4 Mobile 6.363 ms 0 - 178 MB NPU

License

  • The license for the original implementation of DeepLabV3-Plus-MobileNet can be found here.

References

Community