An implementation of the efficient attention module.
☆328Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for efficient-attention
Users that are interested in efficient-attention are comparing it to the libraries listed below
Sorting:
- Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention☆24Jan 7, 2021Updated 5 years ago
- Attention mechanism☆52Sep 13, 2021Updated 4 years ago
- Pytorch library for fast transformer implementations☆1,763Mar 23, 2023Updated 2 years ago
- list of efficient attention modules☆1,022Aug 23, 2021Updated 4 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆292Apr 25, 2022Updated 3 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Oct 11, 2022Updated 3 years ago
- ☆196Feb 14, 2023Updated 3 years ago
- ☆10Dec 13, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- FairSeq repo with Apollo optimizer☆114Dec 20, 2023Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantization☆14Apr 10, 2019Updated 6 years ago
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆192Mar 31, 2022Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- ☆249Mar 16, 2022Updated 4 years ago
- My take on a practical implementation of Linformer for Pytorch.☆423Jul 27, 2022Updated 3 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆27Oct 27, 2023Updated 2 years ago
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆199Dec 2, 2022Updated 3 years ago
- Transformers without Tears: Improving the Normalization of Self-Attention☆134May 29, 2024Updated last year
- Representative Graph Neural Network☆35Aug 12, 2020Updated 5 years ago
- ☆14Nov 20, 2022Updated 3 years ago
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆155Aug 19, 2023Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- Directed masked autoencoders☆14Updated this week
- 음성인식과 신호처리☆14Sep 12, 2021Updated 4 years ago
- ☆74Dec 8, 2022Updated 3 years ago
- Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)☆14Jan 8, 2023Updated 3 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- ☆22Aug 1, 2018Updated 7 years ago
- PyTorch implementation of Non-Local Neural Networks (https://arxiv.org/pdf/1711.07971.pdf)☆253Feb 13, 2023Updated 3 years ago
- Pytorch implementation of Performer from the paper "Rethinking Attention with Performers".☆25Oct 5, 2020Updated 5 years ago
- CoaT: Co-Scale Conv-Attentional Image Transformers☆15Apr 20, 2021Updated 4 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- ☆92Jan 22, 2021Updated 5 years ago
- Megatron LM 11B on Huggingface Transformers☆27Jul 11, 2021Updated 4 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- DisCo Transformer for Non-autoregressive MT☆77Jul 28, 2022Updated 3 years ago