cybertronai / gradient-checkpointingLinks
Make huge neural nets fit in memory
☆2,797Updated 5 years ago
Alternatives and similar repositories for gradient-checkpointing
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
Sorting:
- Mesh TensorFlow: Model Parallelism Made Easier☆1,608Updated last year
- A GPipe implementation in PyTorch☆843Updated 10 months ago
- Profiling and inspecting memory in pytorch☆1,061Updated 10 months ago
- PyTorch extensions for high performance and large scale training.☆3,331Updated last month
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,041Updated 2 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,577Updated 4 years ago
- A lightweight library for PyTorch training tools and utilities☆1,699Updated last week
- ☆1,209Updated 5 years ago
- Experimental ground for optimizing memory of pytorch models☆366Updated 7 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,115Updated last year
- Pytorch library for fast transformer implementations☆1,714Updated 2 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,434Updated this week
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,668Updated this week
- On the Variance of the Adaptive Learning Rate and Beyond☆2,549Updated 3 years ago
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,832Updated last year
- PyTorch elastic training☆728Updated 3 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 5 years ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,639Updated 3 weeks ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,686Updated this week
- FastFormers - highly efficient transformer models for NLU☆705Updated 3 months ago
- ☆536Updated 3 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,098Updated 3 years ago
- Reference implementations of MLPerf™ training benchmarks☆1,683Updated last month
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,623Updated this week
- An optimizer that trains as fast as Adam and as good as SGD.☆2,916Updated last year
- Gin provides a lightweight configuration framework for Python☆2,107Updated last month
- Reformer, the efficient Transformer, in Pytorch☆2,170Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,113Updated 3 years ago
- torchbearer: A model fitting library for PyTorch☆640Updated last year
- Demo of running NNs across different frameworks☆1,652Updated 2 years ago