facebookresearch / ppudaLinks
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
☆492Updated 2 years ago
Alternatives and similar repositories for ppuda
Users that are interested in ppuda are comparing it to the libraries listed below
Sorting:
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆482Updated 3 years ago
- Code for Neural Architecture Search without Training (ICML 2021)☆471Updated 4 years ago
- ☆383Updated 2 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆324Updated 2 years ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆281Updated 2 years ago
- Gradient based Hyperparameter Tuning library in PyTorch☆290Updated 5 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆344Updated last year
- Official code for the Stochastic Polyak step-size optimizer☆139Updated last year
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- ☆193Updated 4 years ago
- An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…☆194Updated 4 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆485Updated 4 years ago
- ☆605Updated 2 months ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 3 years ago
- Pytorch Lightning Distributed Accelerators using Ray☆215Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆475Updated 3 years ago
- 🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet☆186Updated 2 years ago
- An alternative to convolution in neural networks☆257Updated last year
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆493Updated 2 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆217Updated 4 years ago
- ☆133Updated 4 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆149Updated last year
- Understanding Training Dynamics of Deep ReLU Networks☆300Updated last week
- Unofficial JAX implementations of deep learning research papers☆158Updated 3 years ago
- Lightweight Hyperparameter Optimization 🚂☆150Updated last year
- Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)☆237Updated 3 years ago
- ☆227Updated last year
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆182Updated 2 years ago
- ☆471Updated 2 weeks ago