intel / ai-reference-models
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs
☆708Updated this week
Alternatives and similar repositories for ai-reference-models:
Users that are interested in ai-reference-models are comparing it to the libraries listed below
- A scalable inference server for models optimized with OpenVINO™☆722Updated this week
- Inference Model Manager for Kubernetes☆46Updated 6 years ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆460Updated this week
- Computation using data flow graphs for scalable machine learning☆67Updated this week
- Intel® Extension for TensorFlow*☆336Updated last month
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆392Updated 11 months ago
- Reference implementations of MLPerf™ inference benchmarks☆1,358Updated this week
- ONNX Optimizer☆696Updated 3 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆232Updated 3 weeks ago
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,380Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,836Updated this week
- A performant and modular runtime for TensorFlow☆759Updated last week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆260Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,002Updated this week
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆981Updated 7 months ago
- To make it easy to benchmark AI accelerators☆182Updated 2 years ago
- TensorFlow/TensorRT integration☆741Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆162Updated 2 weeks ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,774Updated this week
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆843Updated last week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆472Updated this week
- nGraph has moved to OpenVINO☆1,349Updated 4 years ago
- ☆57Updated 4 years ago
- Benchmark Suite for Deep Learning☆264Updated 2 months ago
- ☆410Updated last week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆383Updated this week
- ☆105Updated 2 weeks ago
- oneCCL Bindings for Pytorch*☆94Updated 2 weeks ago
- AMD's graph optimization engine.☆215Updated this week
- Examples for using ONNX Runtime for model training.☆332Updated 6 months ago