apple / ml-autofocusformerLinks
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
☆125Updated 2 years ago
Alternatives and similar repositories for ml-autofocusformer
Users that are interested in ml-autofocusformer are comparing it to the libraries listed below
Sorting:
- Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]☆281Updated 2 years ago
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆339Updated 7 months ago
- This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".☆69Updated 2 years ago
- [CVPR2023] FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation☆207Updated last year
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆402Updated 2 years ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆294Updated 2 months ago
- Code and models for mobile-former☆130Updated 3 years ago
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆106Updated 3 months ago
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆202Updated 2 years ago
- [NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation☆254Updated last year
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆317Updated 2 years ago
- ☆168Updated last month
- ☆133Updated 2 years ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆177Updated 2 years ago
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆263Updated 2 years ago
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆392Updated 2 years ago
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆255Updated 2 years ago
- Collect some resource about Segment Anything (SAM), including the latest papers and demo☆125Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆58Updated last year
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆262Updated 5 months ago
- ☆94Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆325Updated 8 months ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆278Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)☆200Updated 3 years ago
- ☆52Updated 2 years ago
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆232Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆489Updated last year
- This is a warehouse for MobileNetV4-Pytorch-model, can be used to train your image-datasets for vision tasks.☆212Updated 8 months ago
- SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)☆244Updated last month
- Detection Transformers with Assignment☆260Updated 2 years ago