Difference between revisions of "Edge AI SDK/AI Framework/OpenVINO"

From ESS-WIKI
Jump to: navigation, search
 
(14 intermediate revisions by 2 users not shown)
Line 14: Line 14:
 
== OpenVINO Runtime SDK ==
 
== OpenVINO Runtime SDK ==
  
<span style="font-size:large;">[https://www.intel.com/content/www/us/en/developer/articles/release-notes/openvino/2024-3.html 2024.3]</span>
+
<span style="font-size:large;">[https://docs.openvino.ai/2023.3/home.html 2023.3]</span>
  
 
<span style="font-size:large;">Overall updates</span>
 
<span style="font-size:large;">Overall updates</span>
Line 21: Line 21:
 
*<span style="font-size:larger;">Symbolic shape inference preview is now available, leading to improved performance for LLMs.&nbsp;</span>  
 
*<span style="font-size:larger;">Symbolic shape inference preview is now available, leading to improved performance for LLMs.&nbsp;</span>  
 
*<span style="font-size:larger;">OpenVINO's graph representation has been upgraded to opset12, introducing a new set of operations that offer enhanced functionality and optimizations.</span>  
 
*<span style="font-size:larger;">OpenVINO's graph representation has been upgraded to opset12, introducing a new set of operations that offer enhanced functionality and optimizations.</span>  
 +
 +
&nbsp;
 +
 +
<span style="font-size:large;">[https://docs.openvino.ai/2024/index.html 2024.3]</span>
 +
 +
<span style="font-size:large;">Overall updates</span>
 +
 
*<span style="font-size:larger;">NPU Device Plugin support.</span>  
 
*<span style="font-size:larger;">NPU Device Plugin support.</span>  
  
Line 29: Line 36:
 
= Applications =
 
= Applications =
  
== Edge AI SDK&nbsp;/ Vision Application ==
+
 
 +
== '''<span style="color:#e74c3c;">OpenVINO <u>2023.3&nbsp;</u>for&nbsp;Edge AI SDK <u>3.0.0</u></span>''' ==
 +
 
 +
 
 +
=== <span style="font-size:larger;">Vision Application</span> ===
 +
 
 +
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 +
|-
 +
| style="width: 137px;" | Application
 +
| style="width: 215px;" | Model
 +
| style="width: 134px;" | AI Comput Units
 +
|-
 +
| style="width: 137px;" | Object Detection
 +
| style="width: 215px;" | yolov3 (tf)
 +
| style="width: 134px;" | CPU, iGPU, dGPU
 +
|-
 +
| style="width: 137px;" | Person Detection
 +
| style="width: 215px;" | person-detection-retail-0013
 +
| style="width: 134px;" | CPU, iGPU, dGPU
 +
|-
 +
| style="width: 137px;" | Face Detection
 +
| style="width: 215px;" | faceboxes-pytorch
 +
| style="width: 134px;" | CPU, iGPU, dGPU
 +
|-
 +
| style="width: 137px;" | Pose Estimation
 +
| style="width: 215px;" | human-pose-estimation-0001
 +
| style="width: 134px;" | CPU, iGPU, dGPU
 +
|}
 +
 
 +
=== <span style="font-size:larger;">GenAI Application</span> ===
 +
 
 +
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 +
|-
 +
| style="width: 136px;" | Application
 +
| style="width: 268px;" | SLM Model ( Small Language Model )
 +
| style="width: 116px;" | AI Comput Units
 +
|-
 +
| style="width: 136px;" | SLM Chatbot
 +
| style="width: 268px;" | Llama-2-7b
 +
| style="width: 116px;" | CPU, iGPU
 +
|}
 +
 
 +
=== &nbsp; ===
 +
 
 +
== '''<span style="color:#e74c3c;">OpenVINO <u>2024.3&nbsp;</u>for&nbsp;Edge AI SDK <u>3.1.0</u></span>''' ==
 +
 
 +
 
 +
=== <span style="font-size:larger;">Vision Application</span> ===
  
 
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 
|-
 
|-
 
| style="width: 154px;" | Application
 
| style="width: 154px;" | Application
| style="width: 179px;" | Model
+
| style="width: 204px;" | Model
 +
| style="width: 133px;" | AI Comput Units
 
|-
 
|-
 
| style="width: 154px;" | Object Detection
 
| style="width: 154px;" | Object Detection
| style="width: 154px;" | yolox
+
| style="width: 204px;" | yolox
 +
| style="width: 133px;" | CPU, iGPU, NPU
 
|-
 
|-
 
| style="width: 154px;" | Person Detection
 
| style="width: 154px;" | Person Detection
| style="width: 154px;" | person-detection-retail-0013
+
| style="width: 204px;" | person-detection-retail-0013
 +
| style="width: 133px;" | CPU, iGPU, NPU
 
|-
 
|-
 
| style="width: 154px;" | Face Detection
 
| style="width: 154px;" | Face Detection
| style="width: 154px;" | faceboxes-pytorch
+
| style="width: 204px;" | faceboxes-pytorch
 +
| style="width: 133px;" | CPU, iGPU, NPU
 
|-
 
|-
 
| style="width: 154px;" | Pose Estimation
 
| style="width: 154px;" | Pose Estimation
| style="width: 154px;" | human-pose-estimation-0001
+
| style="width: 204px;" | human-pose-estimation-0001
 +
| style="width: 133px;" | CPU, iGPU, NPU
 
|}
 
|}
  
== Edge AI SDK&nbsp;/ GenAI Application ==
+
 
 +
 
 +
=== <span style="font-size:larger;">GenAI Application</span> ===
  
 
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 
{| border="1" cellpadding="1" cellspacing="1" style="width: 500px;"
 
|-
 
|-
| style="width: 154px;" | Application
+
| style="width: 105px;" | Application
| style="width: 179px;" | Model
+
| style="width: 257px;" | SLM Model ( Small Language Model )
 +
| style="width: 122px;" | AI Comput Units
 
|-
 
|-
| style="width: 154px;" | Chatbot
+
| style="width: 105px;" | SLM Chatbot&nbsp;
| style="width: 154px;" | Llama-3.1-8b-instruct
+
| style="width: 257px;" | Llama-3.1-8b-instruct
 +
| style="width: 122px;" | CPU, iGPU
 
|}
 
|}
  
Line 100: Line 163:
 
<span style="font-size:large;">You can refer more info [https://github.com/intel/xpumanager/blob/master/SMI_README.md xpu-smi]</span>
 
<span style="font-size:large;">You can refer more info [https://github.com/intel/xpumanager/blob/master/SMI_README.md xpu-smi]</span>
  
== SoC Watch ==
+
== intel-gpu-tools ==
 +
 
 +
<span style="font-size:larger;">Intel GPU Tools is a collection of tools for development and testing of the Intel DRM driver. There are many macro-level test suites that get used against the driver, including xtest, rendercheck, piglit, and oglconform, but failures from those can be difficult to track down to kernel changes, and many require complicated build procedures or specific testing environments to get useful results. Therefore, Intel GPU Tools includes low-level tools and tests specifically for development and testing of the Intel DRM Driver.</span>
 +
 
 +
&nbsp;
 +
 
 +
<span style="font-size:large;">Show GPU basic information , sample below</span>
 +
<syntaxhighlight lang="bash">
 +
$ sudo intel_gpu_top
 +
</syntaxhighlight>
 +
 
 +
[[File:AFE360 intel gpu top.png|600x500px|AFE360 intel gpu top.png]]
 +
 
 +
<span style="font-size:large;">You can refer more info [https://github.com/tiagovignatti/intel-gpu-tools intel-gpu-tools]</span>
 +
 
 +
&nbsp;
 +
 
 +
== SoC-Watch ==
  
 
<span style="font-size:larger;">Use Intel® SoC Watch to perform energy analysis on a Linux*, Windows*, or Android* system running on Intel® architecture. Study power consumption in the system and identify behaviors that waste energy. Intel SoC Watch generates a summary text report or you can import results into Intel® VTune™ Profiler.</span>
 
<span style="font-size:larger;">Use Intel® SoC Watch to perform energy analysis on a Linux*, Windows*, or Android* system running on Intel® architecture. Study power consumption in the system and identify behaviors that waste energy. Intel SoC Watch generates a summary text report or you can import results into Intel® VTune™ Profiler.</span>
Line 107: Line 187:
  
 
<span style="font-size:large;">Show NPU basic information , sample below</span>
 
<span style="font-size:large;">Show NPU basic information , sample below</span>
 
&nbsp;
 
 
<syntaxhighlight lang="bash">
 
<syntaxhighlight lang="bash">
 
$ sudo ./socwatch -t 1 -f npu
 
$ sudo ./socwatch -t 1 -f npu
Line 115: Line 193:
 
[[File:AFE360 socwatch.png|600x250px|AFE360 socwatch.png]]
 
[[File:AFE360 socwatch.png|600x250px|AFE360 socwatch.png]]
  
&nbsp;
+
<span style="font-size:large;">You can refer more info [https://www.intel.com/content/www/us/en/docs/socwatch/get-started-guide/2023-1/overview.html Soc-Watch]</span>
 
 
<span style="font-size:large;">You can refer more info [https://github.com/intel/xpumanager/blob/master/SMI_README.md Soc-Watch]</span>
 

Latest revision as of 05:22, 26 December 2024

OpenVINO

OpenVINO™ toolkit: An open-source solution for optimizing and deploying AI inference, in domains such as computer vision, automatic speech recognition, natural language processing, recommendation systems, and more. With its plug-in architecture, OpenVINO allows developers to write once and deploy anywhere.  We are proud to announce the release of OpenVINO 2023.0 introducing a range of new features, improvements, and deprecations aimed at enhancing the developer experience.

 

  • Enables the use of models trained with popular frameworks, such as TensorFlow and PyTorch.  
  • Optimizes inference of deep learning models by applying model retraining or fine-tuning, like post-training quantization.  
  • Supports heterogeneous execution across Intel hardware, using a common API for the Intel CPU, Intel Integrated Graphics, Intel Discrete Graphics, and other commonly used accelerators. 

 

OpenVINO Runtime SDK

2023.3

Overall updates

  • Proxy & hetero plugins have been migrated to API 2.0, providing enhanced compatibility and stability. 
  • Symbolic shape inference preview is now available, leading to improved performance for LLMs. 
  • OpenVINO's graph representation has been upgraded to opset12, introducing a new set of operations that offer enhanced functionality and optimizations.

 

2024.3

Overall updates

  • NPU Device Plugin support.

 

 

Applications

OpenVINO 2023.3 for Edge AI SDK 3.0.0

Vision Application

Application Model AI Comput Units
Object Detection yolov3 (tf) CPU, iGPU, dGPU
Person Detection person-detection-retail-0013 CPU, iGPU, dGPU
Face Detection faceboxes-pytorch CPU, iGPU, dGPU
Pose Estimation human-pose-estimation-0001 CPU, iGPU, dGPU

GenAI Application

Application SLM Model ( Small Language Model ) AI Comput Units
SLM Chatbot Llama-2-7b CPU, iGPU

 

OpenVINO 2024.3 for Edge AI SDK 3.1.0

Vision Application

Application Model AI Comput Units
Object Detection yolox CPU, iGPU, NPU
Person Detection person-detection-retail-0013 CPU, iGPU, NPU
Face Detection faceboxes-pytorch CPU, iGPU, NPU
Pose Estimation human-pose-estimation-0001 CPU, iGPU, NPU


GenAI Application

Application SLM Model ( Small Language Model ) AI Comput Units
SLM Chatbot  Llama-3.1-8b-instruct CPU, iGPU

Benchmark

You can refer the link to test the performance with the benchmark_app

benchmark_app

The OpenVINO benchmark setup includes a single system with OpenVINO™, as well as the benchmark application installed. It measures the time spent on actual inference (excluding any pre or post processing) and then reports on the inferences per second (or Frames Per Second).

You can refer : link

 

Examples

cd /opt/Advantech/EdgeAISuite/Intel_Standard/benchmark  

<CPU>
$ ./benchmark_app -m ../model/mobilenet-ssd/FP16/mobilenet-ssd.xml -i car.png -t 8 -d CPU  

<iGPU>
$ ./benchmark_app -m ../model/mobilenet-ssd/FP16/mobilenet-ssd.xml -i car.png -t 8 -d GPU

<NPU>
$ ./benchmark_app -m ../model/mobilenet-ssd/FP16/mobilenet-ssd.xml -i car.png -t 8 -d NPU  


Utility

XPU-SIM

Intel® XPU Manager ( offical link ) is a free and open-source solution for local and remote monitoring and managing Intel® Data Center GPUs. It is designed to simplify administration, maximize reliability and uptime, and improve utilization.

Intel XPU System Management Interface (SMI) A command line utility for local XPU management.

Key features

Monitoring GPU utilization and health, getting job-level statistics, running comprehensive diagnostics, controlling power, policy management, firmware updating, and more.

Show GPU basic information , sample below

Xpu-smi.png

You can refer more info xpu-smi

intel-gpu-tools

Intel GPU Tools is a collection of tools for development and testing of the Intel DRM driver. There are many macro-level test suites that get used against the driver, including xtest, rendercheck, piglit, and oglconform, but failures from those can be difficult to track down to kernel changes, and many require complicated build procedures or specific testing environments to get useful results. Therefore, Intel GPU Tools includes low-level tools and tests specifically for development and testing of the Intel DRM Driver.

 

Show GPU basic information , sample below

$ sudo intel_gpu_top

AFE360 intel gpu top.png

You can refer more info intel-gpu-tools

 

SoC-Watch

Use Intel® SoC Watch to perform energy analysis on a Linux*, Windows*, or Android* system running on Intel® architecture. Study power consumption in the system and identify behaviors that waste energy. Intel SoC Watch generates a summary text report or you can import results into Intel® VTune™ Profiler.

 

Show NPU basic information , sample below

$ sudo ./socwatch -t 1 -f npu

AFE360 socwatch.png

You can refer more info Soc-Watch