mlpen/Nystromformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mlpen/Nystromformer)

mlpen / Nystromformer

☆392

Alternatives and similar repositories for Nystromformer

Users that are interested in Nystromformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / nystrom-attention
View on GitHub
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆145Mar 24, 2025Updated last year
anguyen8 / sam
View on GitHub
Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"
☆29Dec 8, 2022Updated 3 years ago
donutloop / machine-learning-research-papers
View on GitHub
Collection of machine learning research paper references
☆26Jul 16, 2026Updated last week
willwhitney / reprieve
View on GitHub
A library for evaluating representations.
☆81Nov 21, 2021Updated 4 years ago
chirag-agarwall / VOG
View on GitHub
Estimating Example Difficulty using Variance of Gradients
☆66Jan 10, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MadryLab / cox
View on GitHub
A lightweight experimental logging library
☆53Dec 23, 2025Updated 7 months ago
mlpen / YOSO
View on GitHub
☆21Jul 1, 2021Updated 5 years ago
prichemond / ds3
View on GitHub
☆14Jul 2, 2019Updated 7 years ago
parrt / fundamentals-of-deep-learning
View on GitHub
Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.
☆84Feb 16, 2021Updated 5 years ago
btgraham / Batchwise-Dropout
View on GitHub
Run fully connected artificial neural networks with dropout applied (mini)batchwise, rather than samplewise. Given two hidden layers each…
☆15May 18, 2015Updated 11 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 3 weeks ago
srebuffi / revisiting_saliency
View on GitHub
There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)
☆53Apr 7, 2020Updated 6 years ago
idiap / fast-transformers
View on GitHub
Pytorch library for fast transformer implementations
☆1,775Mar 23, 2023Updated 3 years ago
hadarser / ProvablyPowerfulGraphNetworks_torch
View on GitHub
☆42May 20, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
timy90022 / DropLoss
View on GitHub
Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch
☆44Apr 14, 2021Updated 5 years ago
lsj2408 / GraphNorm
View on GitHub
[ICML 2021] GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training (official implementation)
☆106Dec 19, 2022Updated 3 years ago
lucidrains / performer-pytorch
View on GitHub
An implementation of Performer, a linear attention-based transformer, in Pytorch
☆1,179Feb 2, 2022Updated 4 years ago
NVIDIA / transformer-ls
View on GitHub
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Apr 18, 2022Updated 4 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
louislva / deepmind-perceiver
View on GitHub
My implementation of DeepMind's Perceiver
☆65Apr 23, 2021Updated 5 years ago
openopt / copt
View on GitHub
A Python library for mathematical optimization
☆143Sep 27, 2024Updated last year
oxwhirl / treeqn
View on GitHub
☆93Nov 15, 2019Updated 6 years ago
lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
automl / nes
View on GitHub
Neural Ensemble Search for Uncertainty Estimation and Dataset Shift
☆35Jan 10, 2026Updated 6 months ago
lyeoni / gpt-pytorch
View on GitHub
PyTorch Implementation of OpenAI GPT
☆131Jun 28, 2023Updated 3 years ago
facebookresearch / transformer-sequential
View on GitHub
Trains Transformer model variants. Data isn't shuffled between batches.
☆147Oct 5, 2022Updated 3 years ago
benjs / nfnets_pytorch
View on GitHub
Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".
☆157Mar 13, 2021Updated 5 years ago
harvardnlp / pytorch-struct
View on GitHub
Fast, general, and tested differentiable structured prediction in PyTorch
☆1,133Apr 20, 2022Updated 4 years ago
ischlag / fast-weight-transformers
View on GitHub
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆115Jun 10, 2021Updated 5 years ago
izmailovpavel / understandingbdl
View on GitHub
☆257Dec 27, 2022Updated 3 years ago
davidbau / dissect
View on GitHub
Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural N…
☆310Jan 9, 2021Updated 5 years ago
clovaai / AdamP
View on GitHub
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)
☆412Jan 13, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
bingchenlll / Progressive-GAN-pytorch
View on GitHub
A pytorch implementation of Progressive-GAN that is actually works, readable and simple to customize
☆85Mar 12, 2022Updated 4 years ago
mlelarge / graph_neural_net
View on GitHub
Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)
☆43Aug 25, 2023Updated 2 years ago
lukemelas / do-you-even-need-attention
View on GitHub
Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)
☆485May 7, 2021Updated 5 years ago
jettify / pytorch-optimizer
View on GitHub
torch-optimizer -- collection of optimizers for Pytorch
☆3,170Mar 22, 2024Updated 2 years ago
basiclab / CPCStoryVisualization-Pytorch
View on GitHub
Character-Preserving Coherent Story Visualization, ECCV 2020
☆42Mar 12, 2021Updated 5 years ago
lucidrains / long-short-transformer
View on GitHub
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆120Aug 4, 2021Updated 4 years ago