apple / ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
☆1,892Updated last year
Alternatives and similar repositories for ml-fastvit:
Users that are interested in ml-fastvit are comparing it to the libraries listed below
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆971Updated last year
- CVNets: A library for training computer vision networks☆1,856Updated last year
- Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised…☆2,961Updated 11 months ago
- Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).☆2,230Updated last year
- This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!☆5,130Updated 5 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,581Updated 9 months ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆844Updated last month
- EVA Series: Visual Representation Fantasies from BAAI☆2,468Updated 8 months ago
- Segment Anything in High Quality [NeurIPS 2023]☆3,880Updated 4 months ago
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,559Updated 8 months ago
- Efficient vision foundation models for high-resolution generation and perception.☆2,806Updated 2 weeks ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,315Updated 3 months ago
- This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinf…☆887Updated 4 months ago
- Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds☆1,574Updated 8 months ago
- This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".☆763Updated 2 years ago
- 4M: Massively Multimodal Masked Modeling☆1,713Updated last month
- Segment Anything Labelling Tool☆1,038Updated last year
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,420Updated last month
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,308Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆10,294Updated 8 months ago
- Tracking Anything in High Quality☆748Updated last year
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆7,878Updated 8 months ago
- Painter & SegGPT Series: Vision Foundation Models from BAAI☆2,564Updated 4 months ago
- [ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)☆2,161Updated 2 years ago
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,267Updated 4 months ago
- Fast Segment Anything☆7,824Updated 8 months ago
- [CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions☆2,632Updated 3 weeks ago
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,036Updated last year
- An Open-source Toolkit for LLM Development☆2,768Updated 3 months ago
- A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT☆741Updated last year