OpenVINO™ Execution Provider + Model Caching = Better First Inference Latency for your ONNX Models

Developers, like yourself, can now leverage model caching through the OpenVINO Execution Provider for ONNX Runtime, a product that accelerates inferencing of ONNX models using ONNX Runtime API’s while using OpenVINO™ toolkit as a backend. With the OpenVINO Execution Provider, ONNX Runtime delivers better inferencing performance on the same hardware compared to generic acceleration on Intel® CPU, GPU, and VPU.

OpenVINO™ Execution Provider + Model Caching = Better First Inference Latency for your ONNX Models

Laisser un commentaire Annuler la réponse

Articles récents

Neural networks news

Intel NN News

Archives

Catégories