ICCV2021 / AutoformerLinks
☆19Updated 4 years ago
Alternatives and similar repositories for Autoformer
Users that are interested in Autoformer are comparing it to the libraries listed below
Sorting:
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆326Updated last year
- ☆47Updated last year
- Implementation of the paper "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting", https://arxi…☆19Updated 4 years ago
- State Space Models☆70Updated last year
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆26Updated 10 months ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- The official GitHub page for the survey paper "A Survey of RWKV".☆30Updated 8 months ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 9 months ago
- BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)☆19Updated 2 years ago
- A repository for DenseSSMs☆88Updated last year
- ☆47Updated 2 years ago
- The repo for reproducing the main results in TSMixer: An all-MLP Architecture for Time Series Forecasting.☆10Updated 2 years ago
- code for Explicit Sparse Transformer☆61Updated 2 years ago
- Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention☆23Updated 4 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆36Updated 2 years ago
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆125Updated 2 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆54Updated last year
- ☆17Updated 3 years ago
- ☆22Updated 3 years ago
- Simba☆212Updated last year
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆36Updated 2 years ago
- A Tight-fisted Optimizer☆50Updated 2 years ago
- ☆12Updated last year
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆27Updated last year
- ☆27Updated 3 years ago
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆122Updated 2 weeks ago
- Official implementation of paper:Towards Deeper Level Decomposition of Linear and Nonlinear Patterns in Time Series.☆17Updated 2 weeks ago
- ☆19Updated 3 years ago