☆390Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for Nystromformer
Users that are interested in Nystromformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆145Mar 24, 2025Updated last year
- Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"☆29Dec 8, 2022Updated 3 years ago
- Collection of machine learning research paper references☆28Feb 23, 2025Updated last year
- A library for evaluating representations.☆80Nov 21, 2021Updated 4 years ago
- Estimating Example Difficulty using Variance of Gradients☆66Jan 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Jul 1, 2021Updated 4 years ago
- A lightweight experimental logging library☆54Dec 23, 2025Updated 3 months ago
- ☆14Jul 2, 2019Updated 6 years ago
- Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.☆82Feb 16, 2021Updated 5 years ago
- Run fully connected artificial neural networks with dropout applied (mini)batchwise, rather than samplewise. Given two hidden layers each…☆15May 18, 2015Updated 10 years ago
- There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)☆53Apr 7, 2020Updated 6 years ago
- ☆114Aug 6, 2024Updated last year
- Pytorch library for fast transformer implementations☆1,767Mar 23, 2023Updated 3 years ago
- ☆42May 20, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆44Apr 14, 2021Updated 5 years ago
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,177Feb 2, 2022Updated 4 years ago
- [ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)☆106Dec 19, 2022Updated 3 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Apr 18, 2022Updated 3 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆676Sep 28, 2021Updated 4 years ago
- My implementation of DeepMind's Perceiver☆65Apr 23, 2021Updated 4 years ago
- ☆93Nov 15, 2019Updated 6 years ago
- A Python library for mathematical optimization☆141Sep 27, 2024Updated last year
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- JAvascript Design Environment☆90Apr 14, 2023Updated 3 years ago
- Neural Ensemble Search for Uncertainty Estimation and Dataset Shift☆35Jan 10, 2026Updated 3 months ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Oct 5, 2022Updated 3 years ago
- PyTorch Implementation of OpenAI GPT☆130Jun 28, 2023Updated 2 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆113Jun 10, 2021Updated 4 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,128Apr 20, 2022Updated 3 years ago
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆159Mar 13, 2021Updated 5 years ago
- ☆255Dec 27, 2022Updated 3 years ago
- Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural N…☆309Jan 9, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)☆486May 7, 2021Updated 4 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆416Jan 13, 2021Updated 5 years ago
- Official DeiT repository☆4,334Mar 15, 2024Updated 2 years ago
- A pytorch implementation of Progressive-GAN that is actually works, readable and simple to customize☆85Mar 12, 2022Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,170Mar 22, 2024Updated 2 years ago
- Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)☆43Aug 25, 2023Updated 2 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago