ICCV2021 / Autoformer
☆16Updated 3 years ago
Related projects: ⓘ
- BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)☆18Updated last year
- code for Explicit Sparse Transformer☆57Updated last year
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆26Updated last year
- PyTorch implementation of RealFormer: Transformer Likes Residual Attention☆11Updated 3 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Updated last year
- Sparse Attention with Linear Units☆17Updated 3 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆45Updated 9 months ago
- ☆13Updated 3 years ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆25Updated last month
- A simple program scheduler for your code on different devices.☆11Updated last month
- ☆12Updated 7 months ago
- Transformer are RNNs: Fast Autoregressive Transformer with Linear Attention☆17Updated 3 years ago
- ☆16Updated 2 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆33Updated 6 months ago
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆16Updated last year
- custom pytorch implementation of MoCo v3☆43Updated 3 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆22Updated 11 months ago
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆63Updated 3 years ago
- ☆21Updated 2 years ago
- ☆17Updated last year
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆32Updated last year
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆19Updated last year
- ☆23Updated last year
- [TPAMI-2023] Official implementations of L-MCL: Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition☆21Updated last year
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆24Updated 8 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆80Updated 9 months ago
- Data-Free Neural Architecture Search via Recursive Label Calibration. ECCV 2022.☆32Updated 2 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆40Updated last year