ROCm / mxnetLinks
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
☆29Updated 6 years ago
Alternatives and similar repositories for mxnet
Users that are interested in mxnet are comparing it to the libraries listed below
Sorting:
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated last year
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆174Updated 2 weeks ago
- MIOpenGEMM is now deprecated☆61Updated 2 years ago
- Easy to run kernels using OpenCL☆187Updated 9 months ago
- OpenCL Torch☆146Updated 7 years ago
- Python bindings for libNVVM☆38Updated 11 years ago
- ArrayFire's Machine Learning Library.☆105Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆137Updated 8 years ago
- Symbolic Expression and Statement Module for new DSLs☆205Updated 5 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Updated 7 years ago
- Original Python version of Intel® Nervana™ Graph☆214Updated 3 years ago
- clang with OpenMP 3.1 and some elements of OpenMP 4.0 support☆90Updated 10 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated 8 months ago
- Bridge to connect nGraph with TensorFlow☆52Updated 3 years ago
- Intel(R) Machine Learning Scaling Library is a library providing an efficient implementation of communication patterns used in deep learn…☆108Updated 3 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆34Updated 9 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Updated 8 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14Updated 4 years ago
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆438Updated 5 years ago
- ☆14Updated 6 years ago
- HIP back-end for Thrust that has been replaced by rocThrust☆28Updated 2 years ago
- CL Offline Compiler : Compile OpenCL kernels to HSAIL☆50Updated 8 years ago
- nGraph™ Backend for ONNX☆42Updated 3 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆247Updated last week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆184Updated 3 years ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆232Updated last month
- Python wrappers for the NVIDIA cuDNN libraries☆142Updated 8 years ago