ROCm / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
☆28Updated 5 years ago
Alternatives and similar repositories for mxnet:
Users that are interested in mxnet are comparing it to the libraries listed below
- MIOpenGEMM is now deprecated☆62Updated last year
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 10 months ago
- Python Binding to NVRTC☆79Updated 5 months ago
- Python bindings for libNVVM☆37Updated 10 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Updated 8 years ago
- ☆14Updated 6 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- ArrayFire's Machine Learning Library.☆103Updated 6 years ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆168Updated this week
- An ONNX backend using PlaidML☆28Updated 6 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- OpenCL compilation with clang compiler.☆27Updated last week
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆91Updated 9 years ago
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- Torch is a scientific computing framework with wide support for machine learning algorithms. It is easy to use and efficient, thanks to a…☆37Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 7 years ago
- Boda: A C++ Framework for Efficient Experiments in Computer Vision☆64Updated 5 years ago
- ONNX model format support for Apache MXNet☆96Updated 6 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated last year
- Symbolic differentiation engine for optimization-based machine learning models.☆42Updated 7 years ago
- Bridge to connect nGraph with TensorFlow☆52Updated 2 years ago
- ROCm Device Libraries☆97Updated 10 months ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆297Updated 6 years ago
- This fork of Theano/Theano is dedicated to improve its performance on CPU device, in particular Intel® Xeon® processors and Intel® Xeon P…☆58Updated 2 years ago