ChuanyangZheng / iFormer
Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]
☆42Updated last month
Alternatives and similar repositories for iFormer:
Users that are interested in iFormer are comparing it to the libraries listed below
- ☆26Updated last month
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆39Updated 3 months ago
- Open-Vocabulary Panoptic Segmentation☆23Updated 8 months ago
- ☆28Updated 3 months ago
- ☆30Updated last year
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 11 months ago
- The official implementation of AnySR.☆42Updated 9 months ago
- Official Implementation of OneNet☆16Updated 5 months ago
- ☆28Updated last year
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆24Updated last year
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆40Updated 6 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆15Updated last month
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 5 months ago
- [ECCV 2022] EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers☆13Updated 2 years ago
- ☆32Updated 2 years ago
- [ICLR 2025] Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"☆51Updated 3 months ago
- PyTorch re-implementation of Hierarchical Normalization for Robust Monocular Depth Estimation☆18Updated 2 years ago
- ☆11Updated 5 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 10 months ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated 10 months ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆19Updated 5 months ago
- ☆43Updated 4 months ago
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆98Updated last year
- Code for Learning to Zoom and Unzoom (CVPR 2023)☆47Updated last year
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆51Updated 2 months ago
- ☆67Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆77Updated last month