plaidml / onnx-plaidml
An ONNX backend using PlaidML
☆28Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for onnx-plaidml
- nGraph™ Backend for ONNX☆42Updated last year
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago
- ONNX Parser is a tool that automatically generates openvx inference code (CNN) from onnx binary model files.☆17Updated 5 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆12Updated 3 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- DLPack for Tensorflow☆36Updated 4 years ago
- XLA integration of Open Neural Network Exchange (ONNX)☆19Updated 6 years ago
- ☆14Updated 5 years ago
- Scoreboard for ONNX Backend Compatibility☆27Updated this week
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆64Updated 4 years ago
- ☆11Updated 3 years ago
- ☆102Updated 5 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- CNNs in Halide☆23Updated 9 years ago
- An experimental ahead of time compiler for Relay.☆51Updated 4 years ago
- Neural Network Exchange Format registry☆12Updated last year
- ☆12Updated 3 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Python Binding to NVRTC☆79Updated last month
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆28Updated 4 years ago
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- A prototype implementation of AllReduce collective communication routine.☆20Updated 6 years ago