ROCm / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
☆28Updated 5 years ago
Alternatives and similar repositories for mxnet:
Users that are interested in mxnet are comparing it to the libraries listed below
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 9 months ago
- MIOpenGEMM is now deprecated☆62Updated last year
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- ONNX model format support for Apache MXNet☆96Updated 5 years ago
- ArrayFire's Machine Learning Library.☆103Updated 6 years ago
- ☆14Updated 5 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆167Updated this week
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- The repo is obsolete. Use at your own risk.☆12Updated 6 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆45Updated 3 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Library to manipulate tensors on the GPU.☆190Updated last year
- Python Binding to NVRTC☆79Updated 4 months ago
- An ONNX backend using PlaidML☆28Updated 6 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- MXNet Model Serving☆25Updated 7 years ago
- Python bindings for libNVVM☆37Updated 10 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆63Updated 5 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- OpenCL Torch☆147Updated 6 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Benchmarking State-of-the-Art Deep Learning Software Tools☆170Updated 7 years ago
- BLAS OpenCL implementation.☆15Updated 9 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- The repo to host all the web data including images for documents in dmlc projects.☆84Updated 2 years ago