dbaranchuk / memory-efficient-maml
Memory efficient MAML using gradient checkpointing
☆83Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for memory-efficient-maml
- Meta-Learning with Warped Gradient Descent☆92Updated 3 years ago
- The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) p…☆40Updated 4 years ago
- Implementation of the paper Recurrent Independent Mechanisms (https://arxiv.org/pdf/1909.10893.pdf)☆98Updated 2 years ago
- Reparameterize your PyTorch modules☆72Updated 3 years ago
- Library to manage machine learning problems as `Tasks' and to sample from Task distributions. Includes Tensorflow implementation of impli…☆48Updated 2 years ago
- Code for "Recurrent Independent Mechanisms"☆118Updated 2 years ago
- Measuring compositionality in representation learning☆71Updated 5 years ago
- A collection of Gradient-Based Meta-Learning Algorithms with pytorch☆61Updated 4 years ago
- Official code for ICLR 2020 paper "A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning."☆98Updated 4 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75Updated last year
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- A library for evaluating representations.☆76Updated 3 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Benchmark for Lifelong learning research☆118Updated 3 years ago
- Hypergradient descent☆138Updated 5 months ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆39Updated 5 years ago
- An official PyTorch implementation of “Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation” (NeurIPS 2019) by Risto Vuorio*…☆137Updated 4 years ago
- Code for "Supermasks in Superposition"☆117Updated last year
- This repository is no longer maintained. Check☆82Updated 4 years ago
- ☆119Updated 5 months ago
- Code from the article: "The Role of Disentanglement in Generalisation" (ICLR, 2021).☆22Updated 2 years ago
- Code for "Online Learned Continual Compression with Adaptive Quantization Modules"☆27Updated 4 years ago
- A Neuromodulated Meta-Learning algorithm☆113Updated 4 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆60Updated 3 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 6 years ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- A Pytorch Implementation of Attentive Neural Process☆73Updated 5 years ago
- PyTorch Implementation of Neural Statistician☆59Updated 2 years ago
- Implementation of the paper "Direct Optimization through argmax for discrete Variational Auto-Encoder"☆14Updated 4 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆36Updated 3 years ago