facebookresearch / ppudaLinks
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
☆490Updated 2 years ago
Alternatives and similar repositories for ppuda
Users that are interested in ppuda are comparing it to the libraries listed below
Sorting:
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆487Updated 3 years ago
- Code for Neural Architecture Search without Training (ICML 2021)☆474Updated 4 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆344Updated last year
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆334Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 4 years ago
- ☆619Updated 3 weeks ago
- Official code for the Stochastic Polyak step-size optimizer☆139Updated last year
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆492Updated 2 years ago
- ☆388Updated 2 years ago
- Understanding Training Dynamics of Deep ReLU Networks☆306Updated 3 months ago
- Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using co…☆345Updated 2 years ago
- Gradient based Hyperparameter Tuning library in PyTorch☆291Updated 5 years ago
- An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…☆196Updated 4 years ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆484Updated 2 months ago
- Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural N…☆306Updated 5 years ago
- ☆472Updated this week
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆502Updated 2 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆149Updated last year
- Naszilla is a Python library for neural architecture search (NAS)☆316Updated 3 years ago
- ☆211Updated 3 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆129Updated 2 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆476Updated 3 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆485Updated 4 years ago
- Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)☆237Updated 3 years ago
- ☆133Updated 4 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆131Updated 3 years ago
- ☆194Updated 3 weeks ago
- Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"☆415Updated last year
- An alternative to convolution in neural networks☆259Updated last year