plaidml / onnx-plaidml
An ONNX backend using PlaidML
☆28Updated 6 years ago
Alternatives and similar repositories for onnx-plaidml:
Users that are interested in onnx-plaidml are comparing it to the libraries listed below
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- ☆14Updated 5 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆18Updated 6 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Bugfixing fork of Python bindings for the NVIDIA GPU Management Library☆51Updated 7 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Scoreboard for ONNX Backend Compatibility☆28Updated this week
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 4 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated last year
- CNNs in Halide☆23Updated 9 years ago
- ☆102Updated 5 years ago
- MXNet - nGraph integration☆34Updated 3 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- A prototype implementation of AllReduce collective communication routine.☆19Updated 6 years ago
- DLPack for Tensorflow☆36Updated 4 years ago
- Distributed Learning by Pair-Wise Averaging☆53Updated 7 years ago
- Python Binding to NVRTC☆79Updated 5 months ago
- ☆26Updated 2 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- MLModelScope is an open source, extensible, and customizable platform to facilitate evaluation and measurement of ML models within AI pip…☆50Updated 6 months ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- Fast binary matrix product on CPU☆10Updated 9 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Torch is a scientific computing framework with wide support for machine learning algorithms. It is easy to use and efficient, thanks to a…☆37Updated 2 years ago
- Tools and extensions for CUDA profiling☆65Updated 5 years ago
- ☆19Updated last year