apple / ml-autofocusformerLinks
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
☆123Updated 2 years ago
Alternatives and similar repositories for ml-autofocusformer
Users that are interested in ml-autofocusformer are comparing it to the libraries listed below
Sorting:
- Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]☆284Updated 2 years ago
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆346Updated 9 months ago
- This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".☆69Updated 2 years ago
- ☆132Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆205Updated 3 years ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆403Updated 3 years ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆298Updated 4 months ago
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆113Updated 5 months ago
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆204Updated 2 years ago
- [CVPR2023] FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation☆212Updated last year
- [NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation☆254Updated 2 years ago
- Code and models for mobile-former☆131Updated 3 years ago
- ☆52Updated 2 years ago
- Unofficial edge detection implementation using the Automatic Mask Generation (AMG) of the Segment Anything Model (SAM).☆75Updated 5 months ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆182Updated 2 years ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆265Updated 7 months ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆441Updated last year
- ☆67Updated last year
- SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)☆249Updated 3 months ago
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆263Updated 2 years ago
- The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"☆89Updated last year
- [CVPR 2024] Deformable Convolution v4☆686Updated last year
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆401Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆145Updated 3 years ago
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆887Updated 4 months ago
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆335Updated 11 months ago
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆742Updated 2 years ago
- ☆83Updated 2 years ago
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆489Updated 2 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆281Updated 2 years ago