openvinotoolkit / openvinoLinks
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
☆8,424Updated this week
Alternatives and similar repositories for openvino
Users that are interested in openvino are comparing it to the libraries listed below
Sorting:
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,332Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆16,917Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,810Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,698Updated 3 weeks ago
- 📚 Jupyter notebook tutorials for OpenVINO™☆2,826Updated this week
- Open standard for machine learning interoperability☆19,098Updated this week
- Pre-trained Deep Learning models and demos (high quality and extremely fast)☆4,233Updated last week
- Transformer related optimization, including BERT, GPT☆6,200Updated last year
- Simplify your onnx model☆4,096Updated 9 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,781Updated this week
- Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any…☆13,877Updated this week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,091Updated last month
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,426Updated this week
- Fast and memory-efficient exact attention☆17,846Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,046Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,871Updated this week
- Development repository for the Triton language and compiler☆15,844Updated this week
- Serve, optimize and scale PyTorch models in production☆4,334Updated this week
- An easy to use PyTorch to TensorRT converter☆4,755Updated 9 months ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,429Updated this week
- CUDA Templates for Linear Algebra Subroutines☆7,688Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,370Updated this week
- A scalable inference server for models optimized with OpenVINO™☆739Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,181Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆35,530Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆14,800Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,938Updated this week
- Compiler for Neural Network hardware accelerators☆3,301Updated last year
- Build and run Docker containers leveraging NVIDIA GPUs☆17,388Updated last year
- Visualizer for neural network, deep learning and machine learning models☆30,450Updated this week