cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,784Updated 4 years ago
Alternatives and similar repositories for gradient-checkpointing:
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
- PyTorch extensions for high performance and large scale training.☆3,306Updated 2 weeks ago
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,716Updated last year
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,650Updated last week
- On the Variance of the Adaptive Learning Rate and Beyond☆2,548Updated 3 years ago
- ☆1,208Updated 4 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 5 years ago
- A lightweight library for PyTorch training tools and utilities☆1,691Updated 2 weeks ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,914Updated last year
- torch-optimizer -- collection of optimizers for Pytorch☆3,103Updated last year
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,825Updated last year
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,580Updated 5 years ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,389Updated 2 years ago
- Profiling and inspecting memory in pytorch☆1,057Updated 8 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,569Updated 4 years ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,029Updated 2 months ago
- A GPipe implementation in PyTorch☆836Updated 9 months ago
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆735Updated 2 years ago
- The convertor/conversion of deep learning models for different deep learning frameworks/softwares.☆3,249Updated last year
- Mesh TensorFlow: Model Parallelism Made Easier☆1,604Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,630Updated 2 weeks ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,564Updated 2 months ago
- Train AI models efficiently on medical images using any framework☆1,874Updated 10 months ago
- Accelerated deep learning R&D☆3,347Updated last year
- Experimental ground for optimizing memory of pytorch models☆365Updated 7 years ago
- Tutorial for building a custom CUDA function for Pytorch☆511Updated 6 years ago
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,811Updated 10 months ago
- Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.☆4,295Updated 4 months ago
- Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyda…☆1,362Updated last year
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,112Updated 3 years ago
- Differentiable architecture search for convolutional and recurrent networks☆3,950Updated 4 years ago