microsoft / DirectML
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
☆2,382Updated 3 months ago
Alternatives and similar repositories for DirectML:
Users that are interested in DirectML are comparing it to the libraries listed below
- Fork of TensorFlow accelerated by DirectML☆464Updated 5 months ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,778Updated this week
- DirectML PluggableDevice plugin for TensorFlow 2☆193Updated 2 weeks ago
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆1,815Updated this week
- Intel® NPU Acceleration Library☆642Updated 2 months ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,619Updated 3 months ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆447Updated this week
- Generative AI extensions for onnxruntime☆645Updated this week
- Intel® Extension for TensorFlow*☆332Updated last week
- Simple, safe way to store and distribute tensors☆3,161Updated last week
- AMD ROCm™ Software - GitHub Home☆5,058Updated this week
- DLPrimitives/OpenCL out of tree backend for pytorch☆327Updated 6 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,125Updated last week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆15,947Updated this week
- CUDA Python: Performance meets Productivity☆1,120Updated this week
- ONNXMLTools enables conversion of models to ONNX☆1,055Updated 2 months ago
- Examples for using ONNX Runtime for machine learning inferencing.☆1,324Updated last month
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,381Updated last month
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,349Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,017Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,746Updated this week
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆7,952Updated this week
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆914Updated this week
- ☆486Updated last week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆1,979Updated this week
- AMD's Machine Intelligence Library☆1,125Updated this week
- OpenCL SDK☆624Updated last month
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆2,260Updated this week
- PlaidML is a framework for making deep learning work everywhere.☆4,588Updated last year
- SHARK Studio -- Web UI for SHARK+IREE High Performance Machine Learning Distribution☆1,436Updated 4 months ago