yingyichen-cyy / PrimalAttentionLinks
(NeurIPS 2023) PyTorch implementation of "Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation"
☆19Updated 8 months ago
Alternatives and similar repositories for PrimalAttention
Users that are interested in PrimalAttention are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆33Updated 7 months ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆91Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆53Updated 8 months ago
- ☆63Updated 4 months ago
- Mixture of Attention Heads☆47Updated 2 years ago
- ☆27Updated 2 years ago
- Transformers w/o Attention, based fully on MLPs☆93Updated last year
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆54Updated 2 years ago
- ☆31Updated 2 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆34Updated last year
- A repository for DenseSSMs☆87Updated last year
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆88Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆55Updated last year
- ☆66Updated 8 months ago
- Variance Covariance Regularization☆14Updated 2 years ago
- ☆90Updated 2 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆43Updated 4 years ago
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆46Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆16Updated 7 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆80Updated 3 weeks ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆97Updated 2 months ago
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆101Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆57Updated last year