slyalin / openvino_devtoolsLinks
Tools for easier OpenVINO development/debugging
☆10Updated 6 months ago
Alternatives and similar repositories for openvino_devtools
Users that are interested in openvino_devtools are comparing it to the libraries listed below
Sorting:
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆414Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆528Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,115Updated last week
- OpenVINO Tokenizers extension☆46Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,002Updated this week
- OpenAI Triton backend for Intel® GPUs☆224Updated this week
- Intel® NPU Acceleration Library☆701Updated 8 months ago
- Repository for OpenVINO's extra modules☆161Updated last week
- A scalable inference server for models optimized with OpenVINO™☆816Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,570Updated this week
- Generative AI extensions for onnxruntime☆930Updated last week
- Intel® Tensor Processing Primitives extension for Pytorch*☆17Updated this week
- ☆21Updated last year
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆63Updated this week
- OpenVINO™ Explainable AI (XAI) Toolkit: Visual Explanation for OpenVINO Models☆36Updated 4 months ago
- ☆148Updated last month
- OpenVINO Intel NPU Compiler☆77Updated last week
- ☆61Updated last year
- With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs …☆35Updated last month
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆1,848Updated this week
- ☆28Updated 2 years ago
- Common utilities for ONNX converters☆291Updated last month
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆673Updated last month
- A parser, editor and profiler tool for ONNX models.☆475Updated 2 months ago
- Universal cross-platform tokenizers binding to HF and sentencepiece☆444Updated 5 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆508Updated last week
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆815Updated this week
- Software Development Kit (SDK) for the Geti™ platform for Computer Vision AI model training.☆123Updated this week
- ☆436Updated 4 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆434Updated last month