ROCm / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
☆28Updated 5 years ago
Alternatives and similar repositories for mxnet:
Users that are interested in mxnet are comparing it to the libraries listed below
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 11 months ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆170Updated 3 weeks ago
- MIOpenGEMM is now deprecated☆62Updated last year
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- An ONNX backend using PlaidML☆28Updated 6 years ago
- ☆14Updated 6 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Python Binding to NVRTC☆79Updated 6 months ago
- Python bindings for libNVVM☆37Updated 11 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- The repo to host all the web data including images for documents in dmlc projects.☆84Updated 2 years ago
- Torch is a scientific computing framework with wide support for machine learning algorithms. It is easy to use and efficient, thanks to a…☆37Updated 2 years ago
- Reference workloads for modern deep learning methods.☆73Updated 2 years ago
- This fork of Theano/Theano is dedicated to improve its performance on CPU device, in particular Intel® Xeon® processors and Intel® Xeon P…☆58Updated 2 years ago
- ArrayFire's Machine Learning Library.☆104Updated 6 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆44Updated 3 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- BLAS OpenCL implementation.☆15Updated 10 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- ☆30Updated 7 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago
- Automatic Differentiation for Tensor Algebras☆28Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆297Updated 6 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆109Updated 2 years ago
- CMake modules used within the ROCm libraries☆65Updated this week