openvinotoolkit / openvinoLinks
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
☆8,531Updated last week
Alternatives and similar repositories for openvino
Users that are interested in openvino are comparing it to the libraries listed below
Sorting:
- Pre-trained Deep Learning models and demos (high quality and extremely fast)☆4,255Updated 2 weeks ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,828Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,830Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆17,136Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,795Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,449Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,435Updated this week
- Transformer related optimization, including BERT, GPT☆6,231Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,055Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,060Updated this week
- Simplify your onnx model☆4,114Updated 10 months ago
- Development repository for the Triton language and compiler☆16,114Updated this week
- Open standard for machine learning interoperability☆19,202Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,364Updated this week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,103Updated 3 weeks ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,443Updated this week
- Tutorials for creating and using ONNX models☆3,565Updated 11 months ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,327Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,441Updated last week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,900Updated this week
- An easy to use PyTorch to TensorRT converter☆4,773Updated 10 months ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,454Updated last week
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,533Updated last month
- CUDA Templates for Linear Algebra Subroutines☆7,808Updated this week
- Serve, optimize and scale PyTorch models in production☆4,339Updated last week
- Compiler for Neural Network hardware accelerators☆3,312Updated last year
- Build and run containers leveraging NVIDIA GPUs☆3,410Updated this week
- DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for comm…☆2,484Updated 3 weeks ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆32,717Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,655Updated 3 months ago