ROCm / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
☆28Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for mxnet
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 6 months ago
- MIOpenGEMM is now deprecated☆61Updated last year
- nGraph™ Backend for ONNX☆42Updated last year
- ☆30Updated 7 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆49Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- ☆14Updated 5 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆163Updated this week
- Fork of magma to include more BLAS☆28Updated 7 years ago
- ☆9Updated 5 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Automatic Differentiation for Tensor Algebras☆28Updated 6 years ago
- This fork of Theano/Theano is dedicated to improve its performance on CPU device, in particular Intel® Xeon® processors and Intel® Xeon P…☆59Updated 2 years ago
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 6 years ago
- Python Binding to NVRTC☆79Updated last month
- Python bindings for libNVVM☆37Updated 10 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- The repo is obsolete. Use at your own risk.☆12Updated 6 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 7 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- Torch is a scientific computing framework with wide support for machine learning algorithms. It is easy to use and efficient, thanks to a…☆38Updated 2 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated last year