LeapLabTHU / EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
☆221Updated 8 months ago
Alternatives and similar repositories for EfficientTrain
Users that are interested in EfficientTrain are comparing it to the libraries listed below
Sorting:
- Open source implementation of "Vision Transformers Need Registers"☆176Updated last month
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated last month
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆156Updated 7 months ago
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆205Updated 11 months ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- [NeurIPS 2023] Rank-DETR for High Quality Object Detection☆91Updated last year
- ☆66Updated 2 months ago
- [CVPR 2023] Official repository of Generative Semantic Segmentation☆213Updated last year
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆211Updated last year
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆89Updated this week
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆197Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆109Updated last month
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆62Updated last year
- ☆65Updated 2 years ago
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆192Updated 9 months ago
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆113Updated 2 months ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆179Updated last month
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆261Updated last week
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆187Updated last year
- ☆105Updated 11 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆77Updated 10 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆93Updated last month
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling☆93Updated 3 weeks ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆72Updated 4 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆121Updated last month
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2☆127Updated last month
- [ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"☆346Updated 4 months ago
- ☆257Updated 2 years ago