apple / ml-autofocusformerLinks
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
☆123Updated 2 years ago
Alternatives and similar repositories for ml-autofocusformer
Users that are interested in ml-autofocusformer are comparing it to the libraries listed below
Sorting:
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆351Updated 11 months ago
- Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]☆286Updated 2 years ago
- This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".☆69Updated 2 years ago
- [CVPR2023] FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation☆214Updated last year
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆403Updated 3 years ago
- Code and models for mobile-former☆131Updated 3 years ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆310Updated 6 months ago
- [NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation☆254Updated 2 years ago
- ☆133Updated 3 years ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆184Updated 2 years ago
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆409Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆208Updated 2 years ago
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆114Updated last month
- yolov8 model with SAM meta☆142Updated 2 years ago
- [CVPR 2024 Workshops] SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusi…☆71Updated last year
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆268Updated 9 months ago
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆318Updated 3 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆497Updated last year
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆284Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆205Updated 3 years ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆338Updated last year
- The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"☆89Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆61Updated last year
- [CVPR 2024] Deformable Convolution v4☆702Updated last year
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆48Updated last year
- SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)☆257Updated 5 months ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆265Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆144Updated 3 years ago
- Training and testing of DINOv2 for segmentation downstream☆44Updated 11 months ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆902Updated 6 months ago