ChenhongyiYang / GPViTLinks
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
☆101Updated 2 years ago
Alternatives and similar repositories for GPViT
Users that are interested in GPViT are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆114Updated 2 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆60Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 4 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆51Updated 9 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Updated last year
- code base for vision transformers☆36Updated 3 years ago
- Winning solution to the semantic segmentation task on Robust Vision Challenge - ECCV 2022☆28Updated 2 years ago
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆79Updated last year
- ☆72Updated 6 months ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆118Updated 3 years ago
- ☆59Updated 3 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆111Updated last month
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆65Updated 6 months ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 3 years ago
- Code release for "Language-conditioned Detection Transformer"☆87Updated last year
- [CVPR2023] This is an official mmdet implementation of paper "DETRs with Hybrid Matching".☆51Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆70Updated 3 months ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆94Updated 3 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Updated 2 years ago
- [ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…☆76Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆82Updated last year
- ☆16Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- [CVPR 2023] Exploring High-Quality Pseudo Masks for Weakly Supervised Instance Segmentation☆79Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆81Updated 6 months ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 4 years ago