ChenhongyiYang / GPViTLinks
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
☆101Updated 2 years ago
Alternatives and similar repositories for GPViT
Users that are interested in GPViT are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆114Updated 2 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆60Updated 2 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆51Updated 8 months ago
- Winning solution to the semantic segmentation task on Robust Vision Challenge - ECCV 2022☆28Updated 2 years ago
- ☆72Updated 6 months ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 4 months ago
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆79Updated last year
- ☆59Updated 3 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Updated 3 years ago
- [ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…☆75Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆52Updated 3 years ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Updated last year
- Code release for "Language-conditioned Detection Transformer"☆87Updated last year
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆111Updated 2 weeks ago
- [CVPR2023] This is an official mmdet implementation of paper "DETRs with Hybrid Matching".☆51Updated 2 years ago
- Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"☆72Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆65Updated 5 months ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆94Updated 3 years ago
- ☆60Updated 2 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆144Updated 2 years ago
- [CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance Segmentation☆79Updated 2 years ago
- ☆16Updated 2 years ago
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆75Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆80Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 5 months ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year