cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,730Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for gradient-checkpointing
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,489Updated this week
- Mesh TensorFlow: Model Parallelism Made Easier☆1,591Updated last year
- PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"☆2,707Updated last year
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,524Updated 4 years ago
- Profiling and inspecting memory in pytorch☆1,020Updated 3 months ago
- A lightweight library for PyTorch training tools and utilities☆1,665Updated this week
- Train AI models efficiently on medical images using any framework☆1,866Updated 5 months ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,027Updated last year
- PyTorch extensions for high performance and large scale training.☆3,195Updated last week
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆736Updated 2 years ago
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,528Updated last week
- An optimizer that trains as fast as Adam and as good as SGD.☆2,907Updated last year
- A domain specific language to express machine learning workloads.☆1,761Updated last year
- On the Variance of the Adaptive Learning Rate and Beyond☆2,535Updated 3 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,415Updated 2 weeks ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,043Updated 7 months ago
- Collective communications library with various primitives for multi-machine training.☆1,227Updated this week
- A GPipe implementation in PyTorch☆818Updated 3 months ago
- Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…☆4,351Updated last year
- TensorFlow Code for paper "Efficient Neural Architecture Search via Parameter Sharing"☆1,582Updated 5 years ago
- Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.☆1,800Updated 9 months ago
- Model interpretability and understanding for PyTorch☆4,935Updated this week
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,695Updated 2 weeks ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,328Updated last month
- Lingvo☆2,816Updated this week
- Neural network visualization toolkit for keras☆2,982Updated 2 years ago
- PyTorch elastic training☆730Updated 2 years ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆8,524Updated this week
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 4 years ago
- A curated list of awesome resources related to capsule networks☆974Updated 4 years ago