microsoft / DirectMLLinks
⚠️DirectML is in maintenance mode ⚠️ DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm…
☆2,543Updated this week
Alternatives and similar repositories for DirectML
Users that are interested in DirectML are comparing it to the libraries listed below
Sorting:
- Fork of TensorFlow accelerated by DirectML☆469Updated last year
- DirectML PluggableDevice plugin for TensorFlow 2☆198Updated 10 months ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,002Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,224Updated this week
- Intel® NPU Acceleration Library☆700Updated 8 months ago
- AMD ROCm™ Software - GitHub Home☆6,058Updated this week
- AMD Ryzen™ AI Software includes the tools and runtime libraries for optimizing and deploying AI inference on AMD Ryzen™ AI powered PCs.☆727Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆1,187Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,418Updated last week
- DLPrimitives/OpenCL out of tree backend for pytorch☆383Updated last month
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆247Updated this week
- Samples and Tools for Windows ML.☆1,115Updated 5 months ago
- ONNXMLTools enables conversion of models to ONNX☆1,133Updated this week
- Intel® Extension for TensorFlow*☆350Updated 2 months ago
- An Open Source Machine Learning Framework for Everyone☆1,149Updated 5 months ago
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆674Updated 3 weeks ago
- Build and run containers leveraging NVIDIA GPUs☆3,968Updated this week
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,565Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,578Updated this week
- CUDA Python: Performance meets Productivity☆3,126Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆507Updated last month
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,694Updated 3 weeks ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,917Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆645Updated this week
- TensorFlow ROCm port☆699Updated 2 weeks ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,630Updated last month
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,507Updated 4 months ago
- OpenCL SDK☆736Updated 4 months ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…☆3,055Updated last week
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆704Updated this week