cybertronai / gradient-checkpointingLinks

Make huge neural nets fit in memory

☆2,803

Alternatives and similar repositories for gradient-checkpointing

Users that are interested in gradient-checkpointing are comparing it to the libraries listed below

Sorting:

tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,613Updated last year
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,350Updated 3 months ago
pytorch / tnt
A lightweight library for PyTorch training tools and utilities
☆1,700Updated last week
bckenstler / CLR
☆1,208Updated 5 years ago
openai / blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
☆1,044Updated 2 years ago
anderskm / gputil
A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi programmically in Python
☆1,196Updated last year
Stonesjtu / pytorch_memlab
Profiling and inspecting memory in pytorch
☆1,065Updated 11 months ago
google-research / morph-net
Fast & Simple Resource-Constrained Learning of Deep Network Structure
☆1,031Updated last month
mlcommons / training
Reference implementations of MLPerf™ training benchmarks
☆1,696Updated last week
LiyuanLucasLiu / RAdam
On the Variance of the Adaptive Learning Rate and Beyond
☆2,552Updated 4 years ago
williamFalcon / test-tube
Python library to easily log experiments and parallelize hyperparameter search for neural networks
☆736Updated 3 years ago
prigoyal / pytorch_memonger
Experimental ground for optimizing memory of pytorch models
☆366Updated 7 years ago
kakaobrain / torchgpipe
A GPipe implementation in PyTorch
☆846Updated last year
openai / sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
☆1,583Updated 4 years ago
Santosh-Gupta / SpeedTorch
Library for faster pinned CPU <-> GPU transfer in Pytorch
☆685Updated 5 years ago
carpedm20 / ENAS-pytorch
PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
☆2,722Updated 2 years ago
pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆966Updated this week
waleedka / hiddenlayer
Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.
☆1,840Updated last year
pytorch / xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,647Updated this week
pytorch / elastic
PyTorch elastic training
☆729Updated 3 years ago
IntelLabs / distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…
☆4,400Updated 2 years ago
webdataset / webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
☆2,736Updated last month
vahidk / tfrecord
Standalone TFRecord reader/writer with PyTorch data loaders
☆889Updated 2 months ago
nitrain / nitrain
Train AI models efficiently on medical images using any framework
☆1,874Updated last year
NVIDIA / framework-reproducibility
Providing reproducibility in deep learning frameworks
☆428Updated last year
facebookresearch / TensorComprehensions
A domain specific language to express machine learning workloads.
☆1,760Updated 2 years ago
Luolc / AdaBound
An optimizer that trains as fast as Adam and as good as SGD.
☆2,916Updated 2 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆769Updated 2 years ago
chrischoy / pytorch-custom-cuda-tutorial
Tutorial for building a custom CUDA function for Pytorch
☆519Updated 6 years ago
szagoruyko / pytorchviz
A small package to create visualizations of PyTorch execution graphs
☆3,403Updated 7 months ago