Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning
☆1,121Aug 4, 2019Updated 6 years ago
Alternatives and similar repositories for mshadow
Users that are interested in mshadow are comparing it to the libraries listed below
Sorting:
- move forward to https://github.com/dmlc/mxnet☆1,026Sep 29, 2015Updated 10 years ago
- A common bricks library for building scalable and portable distributed machine learning.☆878Updated this week
- ☆1,655Sep 11, 2018Updated 7 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆514Nov 5, 2020Updated 5 years ago
- Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bind…☆705Nov 1, 2018Updated 7 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,016Oct 4, 2018Updated 7 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Oct 25, 2023Updated 2 years ago
- A lightweight parameter server interface☆1,560Jan 11, 2023Updated 3 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,701Jun 11, 2024Updated last year
- Deprecated☆337Mar 14, 2017Updated 8 years ago
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,869Dec 23, 2020Updated 5 years ago
- moved to https://github.com/dmlc/ps-lite☆648Sep 8, 2015Updated 10 years ago
- NumPy interface with mixed backend execution☆1,101Feb 19, 2018Updated 8 years ago
- header only, dependency-free deep learning framework in C++14☆6,017Apr 17, 2022Updated 3 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,689Jun 9, 2017Updated 8 years ago
- DyNet: The Dynamic Neural Network Toolkit☆3,435Dec 1, 2023Updated 2 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆307Aug 8, 2017Updated 8 years ago
- MultiGPU enabled image generative models (GAN and DCGAN)☆207Oct 11, 2020Updated 5 years ago
- Standalone TensorBoard for visualizing in deep learning☆371Mar 24, 2020Updated 5 years ago
- Computation Graph Toolkit☆634Apr 10, 2018Updated 7 years ago
- Low-precision matrix multiplication☆1,831Jan 29, 2024Updated 2 years ago
- LSTM implementation on Caffe☆494Aug 30, 2016Updated 9 years ago
- Parameter server framework for distributed machine learning☆779Jan 20, 2019Updated 7 years ago
- Purified Purine.☆256May 26, 2015Updated 10 years ago
- Caffe2 is a lightweight, modular, and scalable deep learning framework.☆8,397Feb 7, 2023Updated 3 years ago
- ATen: A TENsor library for C++11☆717Nov 20, 2019Updated 6 years ago
- Intel® Deep Learning Framework☆313Jun 16, 2016Updated 9 years ago
- Fast parallel CTC.☆4,075Mar 4, 2024Updated last year
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,820Oct 9, 2023Updated 2 years ago
- C++ interface for mxnet☆115Mar 22, 2017Updated 8 years ago
- Multi-core implementation of Regularized Greedy Forest☆466Jul 14, 2018Updated 7 years ago
- common in-memory tensor structure☆1,169Jan 26, 2026Updated last month
- ☆127Jun 23, 2016Updated 9 years ago
- Open Machine Learning Compiler Framework☆13,142Updated this week
- THE Deep Learning Benchmarks☆351Nov 2, 2016Updated 9 years ago
- A domain specific language to express machine learning workloads.☆1,765Apr 28, 2023Updated 2 years ago
- Stochastic Gradient Boosted Decision Trees as Standalone, TMVAPlugin and Python-Interface☆248Jul 19, 2020Updated 5 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,956Updated this week
- http://torch.ch☆9,105Mar 31, 2025Updated 11 months ago