cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,789Updated 5 years ago
Alternatives and similar repositories for gradient-checkpointing
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,317Updated 3 weeks ago
- A lightweight library for PyTorch training tools and utilities☆1,695Updated this week
- A GPipe implementation in PyTorch☆840Updated 9 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,571Updated 4 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,660Updated last week
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,719Updated last year
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,040Updated last year
- On the Variance of the Adaptive Learning Rate and Beyond☆2,548Updated 3 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 5 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,604Updated this week
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,390Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,661Updated this week
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆735Updated 2 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,112Updated last year
- ☆1,209Updated 4 years ago
- Experimental ground for optimizing memory of pytorch models☆365Updated 7 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,401Updated this week
- Collective communications library with various primitives for multi-machine training.☆1,302Updated this week
- PyTorch elastic training☆729Updated 2 years ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,030Updated last week
- Profiling and inspecting memory in pytorch☆1,057Updated 9 months ago
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,827Updated last year
- Train AI models efficiently on medical images using any framework☆1,874Updated 11 months ago
- Model summary in PyTorch similar to `model.summary()` in Keras☆4,040Updated last year
- Differentiable architecture search for convolutional and recurrent networks☆3,958Updated 4 years ago
- A Python toolbox for performing gradient-free optimization☆4,060Updated 3 weeks ago
- tensorboard for pytorch (and chainer, mxnet, numpy, ...)☆7,939Updated 2 weeks ago
- The convertor/conversion of deep learning models for different deep learning frameworks/softwares.☆3,248Updated last year
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,580Updated 5 years ago
- A scikit-learn compatible neural network library that wraps PyTorch☆6,028Updated 3 weeks ago