cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,745Updated 4 years ago
Alternatives and similar repositories for gradient-checkpointing:
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
- On the Variance of the Adaptive Learning Rate and Beyond☆2,539Updated 3 years ago
- Train AI models efficiently on medical images using any framework☆1,870Updated 7 months ago
- Profiling and inspecting memory in pytorch☆1,038Updated 5 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,539Updated 4 years ago
- PyTorch extensions for high performance and large scale training.☆3,232Updated this week
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆734Updated 2 years ago
- A lightweight library for PyTorch training tools and utilities☆1,678Updated last week
- Mesh TensorFlow: Model Parallelism Made Easier☆1,598Updated last year
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,244Updated this week
- Experimental ground for optimizing memory of pytorch models☆361Updated 6 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,569Updated last week
- 🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code☆2,798Updated last year
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,803Updated 11 months ago
- A GPipe implementation in PyTorch☆821Updated 5 months ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,103Updated 3 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆684Updated 4 years ago
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,709Updated last year
- A small package to create visualizations of PyTorch execution graphs☆3,271Updated 2 weeks ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,909Updated last year
- PyTorch elastic training☆730Updated 2 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,030Updated last year
- ☆1,207Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,067Updated 9 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,337Updated last month
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,364Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,502Updated last month
- Fast and Easy Infinite Neural Networks in Python☆2,306Updated 10 months ago
- Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.☆4,275Updated last month
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,693Updated 4 months ago
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,809Updated 7 months ago