cybertronai / gradient-checkpointingLinks
Make huge neural nets fit in memory
☆2,830Updated 5 years ago
Alternatives and similar repositories for gradient-checkpointing
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
Sorting:
- A lightweight library for PyTorch training tools and utilities☆1,720Updated this week
- PyTorch extensions for high performance and large scale training.☆3,397Updated 9 months ago
- Profiling and inspecting memory in pytorch☆1,077Updated 5 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,625Updated 2 years ago
- ☆1,210Updated 5 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,549Updated 4 years ago
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆735Updated 3 years ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,032Updated 2 weeks ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,607Updated 5 years ago
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,859Updated last year
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,063Updated 2 years ago
- Providing reproducibility in deep learning frameworks☆434Updated last year
- Train AI models efficiently on medical images using any framework☆1,877Updated last year
- Experimental ground for optimizing memory of pytorch models☆366Updated 7 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 5 years ago
- A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python☆1,210Updated last year
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,981Updated this week
- A GPipe implementation in PyTorch☆863Updated last year
- A Docker image for PyTorch☆993Updated 2 years ago
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,729Updated 2 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,161Updated last year
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,748Updated last month
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,580Updated 6 years ago
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆1,012Updated this week
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,629Updated 3 years ago
- Gin provides a lightweight configuration framework for Python☆2,149Updated 3 weeks ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,754Updated 3 weeks ago
- A small package to create visualizations of PyTorch execution graphs☆3,485Updated last year
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,113Updated 4 years ago
- Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.☆1,538Updated last year