OATML / non-parametric-transformersLinks
Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"
☆415Updated last year
Alternatives and similar repositories for non-parametric-transformers
Users that are interested in non-parametric-transformers are comparing it to the libraries listed below
Sorting:
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆485Updated 3 years ago
- ☆471Updated 2 months ago
- ☆387Updated 2 years ago
- ☆312Updated 9 months ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆259Updated 2 years ago
- Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)☆492Updated 2 years ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆247Updated 3 years ago
- Fast Differentiable Sorting and Ranking☆612Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆475Updated 3 years ago
- An alternative to convolution in neural networks☆258Updated last year
- My implementation of DeepMind's Perceiver☆63Updated 4 years ago
- Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization☆344Updated last year
- Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.☆160Updated 4 years ago
- Fast, differentiable sorting and ranking in PyTorch☆846Updated 6 months ago
- Hopular: Modern Hopfield Networks for Tabular Data☆313Updated 3 years ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆184Updated 4 years ago
- ☆251Updated 2 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆601Updated 2 weeks ago
- This library would form a permanent home for reusable components for deep probabilistic programming. The library would form and harness a…☆311Updated 5 months ago
- MADGRAD Optimization Method☆804Updated 10 months ago
- Enabling easy statistical significance testing for deep neural networks.☆338Updated last year
- Gradient based Hyperparameter Tuning library in PyTorch☆291Updated 5 years ago
- ☆100Updated 4 years ago
- Pytorch Lightning Distributed Accelerators using Ray☆215Updated 2 years ago
- VICReg official code base☆550Updated 2 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆769Updated last year
- Project site for "Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One"☆425Updated 3 years ago
- Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using co…☆343Updated 2 years ago
- A repository for explaining feature attributions and feature interactions in deep neural networks.☆192Updated 3 years ago
- Differentiable Sorting Networks☆125Updated 2 years ago