cybertronai / gradient-checkpointingLinks
Make huge neural nets fit in memory
☆2,826Updated 5 years ago
Alternatives and similar repositories for gradient-checkpointing
Users that are interested in gradient-checkpointing are comparing it to the libraries listed below
Sorting:
- A lightweight library for PyTorch training tools and utilities☆1,719Updated 3 weeks ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,624Updated 2 years ago
- PyTorch extensions for high performance and large scale training.☆3,393Updated 8 months ago
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,859Updated last year
- Profiling and inspecting memory in pytorch☆1,076Updated 4 months ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,033Updated 3 weeks ago
- A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python☆1,210Updated last year
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆735Updated 3 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,606Updated 5 years ago
- Train AI models efficiently on medical images using any framework☆1,876Updated last year
- Experimental ground for optimizing memory of pytorch models☆366Updated 7 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,549Updated 4 years ago
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,730Updated 2 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 5 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,739Updated 3 weeks ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,062Updated 2 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,723Updated this week
- ☆1,211Updated 5 years ago
- Reference implementations of MLPerf® training benchmarks☆1,736Updated 3 weeks ago
- Providing reproducibility in deep learning frameworks☆434Updated last year
- A GPipe implementation in PyTorch☆861Updated last year
- Standalone TFRecord reader/writer with PyTorch data loaders☆899Updated 7 months ago
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,580Updated 6 years ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,754Updated last week
- A benchmark framework for Tensorflow☆1,145Updated 2 years ago
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆1,002Updated this week
- Model summary in PyTorch similar to `model.summary()` in Keras☆4,066Updated last year
- Differentiable architecture search for convolutional and recurrent networks☆3,989Updated 5 years ago
- torchbearer: A model fitting library for PyTorch☆641Updated 2 years ago
- PyTorch elastic training☆728Updated 3 years ago