ziplab / MesaLinks
This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".
☆120Updated 3 years ago
Alternatives and similar repositories for Mesa
Users that are interested in Mesa are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99Updated 3 years ago
- ☆73Updated 2 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆111Updated 4 months ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 4 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 4 months ago
- Official codes for ConMIM (ICLR 2023)☆60Updated 2 years ago
- Official Code Release for Container : Context Aggregation Network☆46Updated 3 years ago
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆55Updated 3 years ago
- ☆109Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆69Updated 3 years ago
- Batch Normalization with Enhanced Linear Transformation☆53Updated last year
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Updated 2 years ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆80Updated 2 years ago
- ReSSL: Relational Self-Supervised Learning with Weak Augmentation☆58Updated 3 years ago
- The implementation of our paper: Towards Robust Vision Transformer (CVPR2022)☆142Updated 2 years ago
- Implementation of momentum^2 teacher☆121Updated 4 years ago
- ☆44Updated 2 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆90Updated last year
- ☆59Updated 3 years ago
- Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better perfo…☆90Updated 2 years ago
- This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022☆175Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆18Updated 2 years ago
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 3 years ago