tqchen / tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines
☆2,002Updated 5 years ago
Related projects: ⓘ
- ☆1,655Updated 6 years ago
- DyNet: The Dynamic Neural Network Toolkit☆3,418Updated 9 months ago
- A lightweight parameter server interface☆1,531Updated last year
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,107Updated 5 years ago
- Easy benchmarking of all publicly accessible implementations of convnets☆2,673Updated 7 years ago
- Acceleration package for neural networks on multi-core CPUs☆1,671Updated 3 months ago
- Assignment 1: automatic differentiation☆472Updated 5 years ago
- A domain specific language to express machine learning workloads.☆1,758Updated last year
- Low-precision matrix multiplication☆1,772Updated 7 months ago
- A common bricks library for building scalable and portable distributed machine learning.☆861Updated 3 months ago
- Benchmarking Deep Learning operations on different hardware☆1,065Updated 3 years ago
- Collective communications library with various primitives for multi-machine training.☆1,192Updated 2 months ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,783Updated last year
- Deep learning with dynamic computation graphs in TensorFlow☆1,825Updated 3 years ago
- Open-source implementation of Google Vizier for hyper parameters tuning☆1,542Updated 4 years ago
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,208Updated 4 years ago
- A curated list of MXNet examples, tutorials and blogs.☆837Updated 2 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,579Updated this week
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,518Updated 5 years ago
- A high performance and generic framework for distributed DNN training☆3,616Updated 11 months ago
- ☆562Updated 6 years ago
- NumPy interface with mixed backend execution☆1,109Updated 6 years ago
- Dive into Deep Learning Compiler☆640Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆2,313Updated last year
- Tutorials and implementations for "Self-normalizing networks"☆1,580Updated 2 years ago
- Reliable Allreduce and Broadcast Interface for distributed machine learning☆505Updated 3 years ago
- Minimal numerical computation library with TensorFlow APIs☆301Updated 5 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit☆3,744Updated 3 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,104Updated 2 years ago
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,893Updated last year