☆391Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for Nystromformer
Users that are interested in Nystromformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆145Mar 24, 2025Updated last year
- Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"☆29Dec 8, 2022Updated 3 years ago
- Collection of machine learning research paper references☆25Feb 23, 2025Updated last year
- A library for evaluating representations.☆80Nov 21, 2021Updated 4 years ago
- Estimating Example Difficulty using Variance of Gradients☆66Jan 10, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Jul 1, 2021Updated 4 years ago
- A lightweight experimental logging library☆54Dec 23, 2025Updated 4 months ago
- ☆14Jul 2, 2019Updated 6 years ago
- Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.☆82Feb 16, 2021Updated 5 years ago
- Run fully connected artificial neural networks with dropout applied (mini)batchwise, rather than samplewise. Given two hidden layers each…☆15May 18, 2015Updated 10 years ago
- There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)☆53Apr 7, 2020Updated 6 years ago
- ☆115Aug 6, 2024Updated last year
- Pytorch library for fast transformer implementations☆1,769Mar 23, 2023Updated 3 years ago
- ☆42May 20, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆44Apr 14, 2021Updated 5 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,177Feb 2, 2022Updated 4 years ago
- [ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)☆106Dec 19, 2022Updated 3 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Apr 18, 2022Updated 4 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆679Sep 28, 2021Updated 4 years ago
- My implementation of DeepMind's Perceiver☆65Apr 23, 2021Updated 5 years ago
- ☆93Nov 15, 2019Updated 6 years ago
- A Python library for mathematical optimization☆143Sep 27, 2024Updated last year
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JAvascript Design Environment☆91Apr 14, 2023Updated 3 years ago
- Neural Ensemble Search for Uncertainty Estimation and Dataset Shift☆35Jan 10, 2026Updated 3 months ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆145Oct 5, 2022Updated 3 years ago
- PyTorch Implementation of OpenAI GPT☆130Jun 28, 2023Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,128Apr 20, 2022Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆114Jun 10, 2021Updated 4 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆158Mar 13, 2021Updated 5 years ago
- ☆256Dec 27, 2022Updated 3 years ago
- Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural N…☆309Jan 9, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆485May 7, 2021Updated 4 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆417Jan 13, 2021Updated 5 years ago
- Official DeiT repository☆4,341Mar 15, 2024Updated 2 years ago
- A pytorch implementation of Progressive-GAN that is actually works, readable and simple to customize☆85Mar 12, 2022Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,168Mar 22, 2024Updated 2 years ago
- Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)☆42Aug 25, 2023Updated 2 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago