☆388Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for Nystromformer
Users that are interested in Nystromformer are comparing it to the libraries listed below
Sorting:
- Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"☆27Dec 8, 2022Updated 3 years ago
- A library for evaluating representations.☆77Nov 21, 2021Updated 4 years ago
- Estimating Example Difficulty using Variance of Gradients☆64Jan 10, 2023Updated 3 years ago
- Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.☆80Feb 16, 2021Updated 5 years ago
- ☆14Jul 2, 2019Updated 6 years ago
- Pytorch library for fast transformer implementations☆1,763Mar 23, 2023Updated 2 years ago
- There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)☆53Apr 7, 2020Updated 5 years ago
- A lightweight experimental logging library☆52Dec 23, 2025Updated 2 months ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- [ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)☆107Dec 19, 2022Updated 3 years ago
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆485May 7, 2021Updated 4 years ago
- ☆112Aug 6, 2024Updated last year
- ☆21Jul 1, 2021Updated 4 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- ☆42May 20, 2020Updated 5 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Apr 18, 2022Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆114Jun 10, 2021Updated 4 years ago
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,123Apr 20, 2022Updated 3 years ago
- ☆38Mar 9, 2021Updated 4 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Oct 5, 2022Updated 3 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,163Mar 22, 2024Updated last year
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,068Aug 9, 2024Updated last year
- Hopfield Networks is All You Need☆1,900Apr 23, 2023Updated 2 years ago
- FairSeq repo with Apollo optimizer☆114Dec 20, 2023Updated 2 years ago
- My implementation of DeepMind's Perceiver☆63Apr 23, 2021Updated 4 years ago
- PyTorch Implementation of OpenAI GPT☆128Jun 28, 2023Updated 2 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆158Mar 13, 2021Updated 4 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆415Jan 13, 2021Updated 5 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆781Dec 16, 2023Updated 2 years ago
- Structured state space sequence models☆2,854Jul 17, 2024Updated last year
- ☆91Nov 15, 2019Updated 6 years ago
- Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"☆416Mar 21, 2024Updated last year
- JAvascript Design Environment☆88Apr 14, 2023Updated 2 years ago
- [ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.☆171Mar 8, 2021Updated 4 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago
- A pytorch implementation of Progressive-GAN that is actually works, readable and simple to customize☆85Mar 12, 2022Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago