plaidml / onnx-plaidmlLinks
An ONNX backend using PlaidML
☆28Updated 7 years ago
Alternatives and similar repositories for onnx-plaidml
Users that are interested in onnx-plaidml are comparing it to the libraries listed below
Sorting:
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆173Updated 2 weeks ago
- Scoreboard for ONNX Backend Compatibility☆29Updated this week
- Bridge to connect nGraph with TensorFlow☆52Updated 2 years ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆226Updated this week
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- Bugfixing fork of Python bindings for the NVIDIA GPU Management Library☆51Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆55Updated 7 years ago
- ☆102Updated 5 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 5 years ago
- Benchmarking Keras application network performance☆52Updated 6 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 5 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Updated 5 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated 4 months ago
- Deep Learning Benchmarking Suite☆130Updated 2 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- ☆14Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated 2 years ago
- Python Binding to NVRTC☆79Updated 11 months ago
- Test data for DALI project☆43Updated 3 weeks ago
- Notes and artifacts from the ONNX steering committee☆26Updated last week
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.☆394Updated 2 years ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated last year
- ArrayFire's Machine Learning Library.☆105Updated 6 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- CNNs in Halide☆23Updated 9 years ago
- A Raspberry Pi GPU-accelerated implementation of the GEMM matrix-multiply function☆88Updated 11 years ago