Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆1,121Aug 4, 2019Updated 6 years ago
Alternatives and similar repositories for mshadow
Users that are interested in mshadow are comparing it to the libraries listed below
Sorting:
- move forward to https://github.com/dmlc/mxnet☆1,026Sep 29, 2015Updated 10 years ago
- A common bricks library for building scalable and portable distributed machine learning.☆878Mar 9, 2026Updated last week
- ☆1,653Sep 11, 2018Updated 7 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆512Nov 5, 2020Updated 5 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆706Nov 1, 2018Updated 7 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,823Oct 25, 2023Updated 2 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,014Oct 4, 2018Updated 7 years ago
- A lightweight parameter server interface☆1,561Mar 2, 2026Updated 2 weeks ago
- Deprecated☆337Mar 14, 2017Updated 9 years ago
- moved to https://github.com/dmlc/ps-lite☆648Sep 8, 2015Updated 10 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,702Jun 11, 2024Updated last year
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,870Dec 23, 2020Updated 5 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆307Aug 8, 2017Updated 8 years ago
- NumPy interface with mixed backend execution☆1,101Feb 19, 2018Updated 8 years ago
- Standalone TensorBoard for visualizing in deep learning☆371Mar 24, 2020Updated 5 years ago
- header only, dependency-free deep learning framework in C++14☆6,020Apr 17, 2022Updated 3 years ago
- Computation Graph Toolkit☆634Apr 10, 2018Updated 7 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,688Jun 9, 2017Updated 8 years ago
- MultiGPU enabled image generative models (GAN and DCGAN)☆207Oct 11, 2020Updated 5 years ago
- DyNet: The Dynamic Neural Network Toolkit☆3,435Dec 1, 2023Updated 2 years ago
- Purified Purine.☆256May 26, 2015Updated 10 years ago
- ☆127Jun 23, 2016Updated 9 years ago
- Low-precision matrix multiplication☆1,832Jan 29, 2024Updated 2 years ago
- Caffe2 is a lightweight, modular, and scalable deep learning framework.☆8,396Feb 7, 2023Updated 3 years ago
- LSTM implementation on Caffe☆494Aug 30, 2016Updated 9 years ago
- Parameter server framework for distributed machine learning☆779Jan 20, 2019Updated 7 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Nov 1, 2016Updated 9 years ago
- C++ interface for mxnet☆115Mar 22, 2017Updated 9 years ago
- Multi-core implementation of Regularized Greedy Forest☆466Jul 14, 2018Updated 7 years ago
- common in-memory tensor structure☆1,177Jan 26, 2026Updated last month
- Intel® Deep Learning Framework☆313Jun 16, 2016Updated 9 years ago
- ATen: A TENsor library for C++11☆717Nov 20, 2019Updated 6 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,821Oct 9, 2023Updated 2 years ago
- Collective communications library with various primitives for multi-machine training.☆1,407Updated this week
- Open Machine Learning Compiler Framework☆13,197Updated this week
- Stochastic Gradient Boosted Decision Trees as Standalone, TMVAPlugin and Python-Interface☆248Jul 19, 2020Updated 5 years ago
- Notebooks for MXNet☆608Jan 28, 2018Updated 8 years ago
- A domain specific language to express machine learning workloads.☆1,764Apr 28, 2023Updated 2 years ago
- Marvin: A Minimalist GPU-only N-Dimensional ConvNets Framework☆429Mar 21, 2018Updated 8 years ago