stanford-futuredata / dawn-bench-models
☆36Updated 5 years ago
Related projects: ⓘ
- ☆47Updated 4 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆52Updated 5 years ago
- Path-Level Network Transformation for Efficient Architecture Search, in ICML 2018.☆113Updated 6 years ago
- Efficient Architecture Search by Network Transformation, in AAAI 2018☆170Updated 5 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- An analytical performance modeling tool for deep neural networks.☆85Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- ☆51Updated 6 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆38Updated 5 years ago
- ☆72Updated 5 years ago
- ☆13Updated this week
- Deep learning with a multiplication budget☆47Updated 6 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆180Updated 5 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆85Updated 3 years ago
- ☆29Updated 4 years ago
- Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS☆14Updated 7 years ago
- Cyclades☆28Updated 6 years ago
- Training wide residual networks for deployment using a single bit for each weight☆36Updated 4 years ago
- Binarized Neural Network TF training code + C matrix / eval library.☆98Updated 6 years ago
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆111Updated 4 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆148Updated 7 years ago
- This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.☆42Updated last year
- ☆35Updated 5 years ago
- Spatially Adaptive Computation Time for Residual Networks☆246Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- Code for ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization☆55Updated 6 years ago
- ☆137Updated 7 years ago
- Implementing Google's DistBelief paper☆107Updated last year