THU-MIG / RepViT
RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything
☆916Updated 9 months ago
Alternatives and similar repositories for RepViT:
Users that are interested in RepViT are comparing it to the libraries listed below
- [CVPR 2024] Deformable Convolution v4☆606Updated 10 months ago
- [CVPR 2023] Code for PConv and FasterNet☆743Updated last year
- [CVPR 2024] Code release for TransNeXt model☆495Updated 9 months ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆824Updated last week
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆317Updated last month
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,028Updated last year
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆535Updated last year
- [CVPR 2024] Rewrite the Stars☆360Updated 10 months ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆455Updated 9 months ago
- ☆822Updated last year
- [ICCV 2023] DETRs with Collaborative Hybrid Assignments Training☆1,150Updated 2 months ago
- Code release for ConvNeXt V2 model☆1,666Updated 7 months ago
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆1,154Updated this week
- the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”☆369Updated 3 months ago
- [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…☆1,287Updated last year
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆342Updated 7 months ago
- Official repository of Agent Attention (ECCV2024)☆602Updated 4 months ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,342Updated last year
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆285Updated 3 months ago
- Official repository of FLatten Transformer (ICCV2023)☆420Updated 4 months ago
- Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".☆563Updated last year
- Efficient vision foundation models for high-resolution generation and perception.☆2,748Updated 2 months ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆274Updated 2 months ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆834Updated 11 months ago
- This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".☆294Updated last month
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆274Updated last year
- [ICCV 2023] Official PyTorch implementation of "Rethinking Mobile Block for Efficient Attention-based Models"☆239Updated last year
- [Official Repo] Visual Mamba: A Survey and New Outlooks☆631Updated last month
- [CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition☆976Updated 5 months ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆898Updated 11 months ago