[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
☆906Jul 22, 2025Updated 7 months ago
Alternatives and similar repositories for FasterViT
Users that are interested in FasterViT are comparing it to the libraries listed below
Sorting:
- Efficient vision foundation models for high-resolution generation and perception.☆3,243Sep 5, 2025Updated 5 months ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆446Dec 22, 2023Updated 2 years ago
- RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything☆1,065Jun 14, 2024Updated last year
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆1,989Nov 30, 2023Updated 2 years ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆1,055Mar 2, 2024Updated last year
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,174May 15, 2024Updated last year
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,109Aug 13, 2023Updated 2 years ago
- Code release for ConvNeXt V2 model☆1,975Aug 14, 2024Updated last year
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,034Feb 9, 2026Updated 2 weeks ago
- This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone" CVPR 2023.☆817Jul 25, 2022Updated 3 years ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,665Feb 11, 2026Updated 2 weeks ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Updated this week
- This is a collection of our NAS and Vision Transformer work.☆1,823Jul 25, 2024Updated last year
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,367Jun 1, 2024Updated last year
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,701Oct 3, 2024Updated last year
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆751Nov 7, 2023Updated 2 years ago
- Official PyTorch implementation of Fully Attentional Networks☆482Mar 31, 2023Updated 2 years ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,799Feb 13, 2025Updated last year
- CVNets: A library for training computer vision networks☆1,967Oct 30, 2023Updated 2 years ago
- [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"☆2,754Jul 31, 2024Updated last year
- Code release for ConvNeXt model☆6,300Jan 8, 2023Updated 3 years ago
- This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!☆5,631Dec 19, 2025Updated 2 months ago
- Fast Segment Anything☆8,271Jul 30, 2024Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆495Jun 1, 2024Updated last year
- [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…☆4,892Dec 3, 2025Updated 2 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,343Oct 5, 2023Updated 2 years ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆311Jul 18, 2025Updated 7 months ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,368May 19, 2025Updated 9 months ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,170Jun 17, 2024Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,647Aug 1, 2024Updated last year
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆488Jun 2, 2023Updated 2 years ago
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,631Jun 28, 2024Updated last year
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions☆2,793Mar 25, 2025Updated 11 months ago
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,276Sep 11, 2025Updated 5 months ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,475Jun 3, 2025Updated 8 months ago
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,382May 31, 2024Updated last year
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,013Jul 30, 2024Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,397Updated this week
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆208Jun 1, 2023Updated 2 years ago