facebookresearch / ppudaLinks
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
☆489Updated last year
Alternatives and similar repositories for ppuda
Users that are interested in ppuda are comparing it to the libraries listed below
Sorting:
- Code for Neural Architecture Search without Training (ICML 2021)☆466Updated 3 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆473Updated 3 years ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆479Updated 2 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago
- Naszilla is a Python library for neural architecture search (NAS)☆310Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆322Updated 2 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆339Updated 11 months ago
- ☆376Updated last year
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 2 years ago
- Gradient based Hyperparameter Tuning library in PyTorch☆290Updated 4 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆209Updated last year
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆215Updated 4 years ago
- Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)☆349Updated last month
- Named tensors with first-class dimensions for PyTorch☆331Updated last year
- Neural Architecture Search (NAS) papers with code☆160Updated 3 years ago
- ☆471Updated last month
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆488Updated 2 years ago
- Implementation for the paper "Adversarial Continual Learning" in PyTorch.☆254Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 3 years ago
- Understanding Training Dynamics of Deep ReLU Networks☆293Updated 3 weeks ago
- ☆226Updated 10 months ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆438Updated 9 months ago
- Implementation for the Lookahead Optimizer.☆240Updated 3 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Official code for the Stochastic Polyak step-size optimizer☆139Updated 11 months ago
- ☆598Updated 7 months ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆485Updated 4 years ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 3 years ago