ICCV2021 / AutoformerLinks
☆20Updated 4 years ago
Alternatives and similar repositories for Autoformer
Users that are interested in Autoformer are comparing it to the libraries listed below
Sorting:
- ☆48Updated 2 years ago
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆28Updated last year
- BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)☆19Updated 3 years ago
- ☆48Updated last year
- ☆13Updated 4 years ago
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆18Updated 2 years ago
- ☆13Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Updated 3 years ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆44Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆39Updated last year
- ☆20Updated 3 years ago
- [ICLR 2024] Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks☆43Updated last year
- A repository for DenseSSMs☆88Updated last year
- ☆22Updated 4 years ago
- ☆17Updated 3 years ago
- ☆53Updated last year
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆36Updated 2 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆33Updated 3 years ago
- code for Explicit Sparse Transformer☆61Updated 2 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆55Updated 2 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Updated 3 years ago
- State Space Models☆71Updated last year
- Sparse Attention with Linear Units☆19Updated 4 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Updated 3 years ago
- The repo for reproducing the main results in TSMixer: An all-MLP Architecture for Time Series Forecasting.☆10Updated 2 years ago
- This project contains code for the paper titled "SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentia…☆28Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆30Updated last year
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated last year
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆65Updated 4 years ago
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 3 years ago