utsaslab / MONeT
MONeT framework for reducing memory consumption of DNN training
☆173Updated 3 years ago
Alternatives and similar repositories for MONeT:
Users that are interested in MONeT are comparing it to the libraries listed below
- PyTorch implementation of L2L execution algorithm☆107Updated 2 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Slicing a PyTorch Tensor Into Parallel Shards☆298Updated 3 years ago
- Lightweight and Parallel Deep Learning Framework☆262Updated 2 years ago
- Example code showing how to use Nvidia DALI in pytorch, with fallback to torchvision. Contains a few differences to the official Nvidia …☆197Updated 5 years ago
- Train ImageNet in 18 minutes on AWS☆129Updated last year
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆86Updated 4 years ago
- Distributed, mixed-precision training with PyTorch☆90Updated 4 years ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- Labels and other data for the paper "Are we done with ImageNet?"☆190Updated 3 years ago
- Block-sparse primitives for PyTorch☆154Updated 4 years ago
- Programmable Neural Network Compression☆148Updated 2 years ago
- ☆62Updated 4 years ago
- PyTorch layer-by-layer model profiler☆606Updated 3 years ago
- On Network Design Spaces for Visual Recognition☆94Updated 4 years ago
- A GPU performance profiling tool for PyTorch models☆506Updated 3 years ago
- A Re-implementation of Fixed-update Initialization☆153Updated 5 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆146Updated 3 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆130Updated 3 years ago
- Estimate/count FLOPS for a given neural network using pytorch☆303Updated 2 years ago
- Research and development for optimizing transformers☆125Updated 4 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆331Updated 9 months ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago
- ☆165Updated 6 years ago
- ☆109Updated 4 years ago
- How and why you want to make your pytorch CUDA/CPP extension with a Makefile☆172Updated 5 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- Accelerate training by storing parameters in one contiguous chunk of memory.☆291Updated 4 years ago
- Experimental ground for optimizing memory of pytorch models☆365Updated 6 years ago
- Implementation of a Transformer, but completely in Triton☆263Updated 3 years ago