apple / ml-autofocusformerLinks
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
☆125Updated 2 years ago
Alternatives and similar repositories for ml-autofocusformer
Users that are interested in ml-autofocusformer are comparing it to the libraries listed below
Sorting:
- Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]☆278Updated 2 years ago
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆338Updated 6 months ago
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆292Updated last month
- This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".☆69Updated 2 years ago
- [CVPR2023] FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation☆207Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆103Updated 2 months ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆177Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR"☆202Updated 2 years ago
- ☆132Updated 2 years ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆401Updated 2 years ago
- Code and models for mobile-former☆130Updated 3 years ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆259Updated 4 months ago
- ☆121Updated 2 years ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆321Updated 7 months ago
- Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection☆96Updated last year
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆388Updated 2 years ago
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆249Updated 2 years ago
- Collect some resource about Segment Anything (SAM), including the latest papers and demo☆122Updated 2 years ago
- ☆170Updated last month
- yolov8 model with SAM meta☆140Updated last year
- [NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation☆253Updated last year
- Training and testing of DINOv2 for segmentation downstream☆41Updated 6 months ago
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆232Updated last year
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆263Updated last year
- SAM (Segment Anything Model) for generating rotated bounding boxes with MMRotate, which is a comparison method of H2RBox-v2.☆191Updated 2 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆134Updated last year
- [CVPR 2024] Deformable Convolution v4☆662Updated last year
- SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)☆241Updated 3 weeks ago
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆316Updated 2 years ago
- Lite Vision Transformer (CVPR 2022)☆144Updated 2 years ago