facebookresearch / ppuda
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
☆487Updated last year
Alternatives and similar repositories for ppuda
Users that are interested in ppuda are comparing it to the libraries listed below
Sorting:
- A library to inspect and extract intermediate layers of PyTorch models.☆473Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago
- Code for Neural Architecture Search without Training (ICML 2021)☆465Updated 3 years ago
- Naszilla is a Python library for neural architecture search (NAS)☆309Updated 2 years ago
- ☆376Updated last year
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆477Updated 2 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆339Updated 10 months ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 2 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆214Updated 4 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆484Updated 4 years ago
- Lightweight Hyperparameter Optimization 🚂☆147Updated 8 months ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Gradient based Hyperparameter Tuning library in PyTorch☆289Updated 4 years ago
- Unofficial JAX implementations of deep learning research papers☆156Updated 2 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆488Updated 2 years ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆437Updated 8 months ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆321Updated 2 years ago
- Named tensors with first-class dimensions for PyTorch☆320Updated last year
- ☆597Updated 6 months ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 3 years ago
- Implementation for the Lookahead Optimizer.☆241Updated 3 years ago
- Fully featured implementation of Routing Transformer☆292Updated 3 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆115Updated last year
- Train ImageNet *fast* in 500 lines of code with FFCV☆142Updated last year
- An alternative to convolution in neural networks☆254Updated last year
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆276Updated 2 years ago
- Code for our NeurIPS 2022 paper☆368Updated 2 years ago
- Pytorch Lightning Distributed Accelerators using Ray☆210Updated last year