[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
☆916Jul 22, 2025Updated 10 months ago
Alternatives and similar repositories for FasterViT
Users that are interested in FasterViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆450Dec 22, 2023Updated 2 years ago
- Efficient vision foundation models for high-resolution generation and perception.☆3,320Sep 5, 2025Updated 9 months ago
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆2,007Nov 30, 2023Updated 2 years ago
- RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything☆1,091Jun 14, 2024Updated last year
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆1,065Mar 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone☆2,172Mar 11, 2026Updated 2 months ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,179May 15, 2024Updated 2 years ago
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,111Aug 13, 2023Updated 2 years ago
- Code release for ConvNeXt V2 model☆2,038Aug 14, 2024Updated last year
- This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone" CVPR 2023.☆824Jul 25, 2022Updated 3 years ago
- Official repository for "AM-RADIO: Reduce All Domains Into One"☆1,857May 29, 2026Updated last week
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,944Jun 3, 2026Updated last week
- This is a collection of our NAS and Vision Transformer work.☆1,836Jul 25, 2024Updated last year
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,364Jun 1, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆751Nov 7, 2023Updated 2 years ago
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,722Oct 3, 2024Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆497Jun 1, 2024Updated 2 years ago
- [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆3,879Feb 13, 2025Updated last year
- CVNets: A library for training computer vision networks☆1,974Oct 30, 2023Updated 2 years ago
- Fast Segment Anything☆8,360Jul 30, 2024Updated last year
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆499Jun 2, 2023Updated 3 years ago
- This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!☆5,785May 5, 2026Updated last month
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,645Jun 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for ConvNeXt model☆6,384Jan 8, 2023Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆483Mar 31, 2023Updated 3 years ago
- ☆589Jul 23, 2023Updated 2 years ago
- [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"☆2,807Jul 31, 2024Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,868Jun 3, 2026Updated last week
- [CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥…☆5,267May 20, 2026Updated 3 weeks ago
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions☆2,832Mar 25, 2025Updated last year
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,200Jun 17, 2024Updated last year
- [ICCV - 2023] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applic…☆318Jul 18, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EVA Series: Visual Representation Fantasies from BAAI☆2,683Aug 1, 2024Updated last year
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆209Jun 1, 2023Updated 3 years ago
- [CVPR 2024] Deformable Convolution v4☆735May 17, 2024Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,344Oct 5, 2023Updated 2 years ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,491Jun 3, 2025Updated last year
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,040Jul 30, 2024Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,607May 31, 2024Updated 2 years ago