ViTAE-Transformer / ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
☆533Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ViTDet
- The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vis…☆252Updated last year
- The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"☆157Updated last year
- The official repo for [TGRS'22] "Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model"☆420Updated 11 months ago
- [CVPR 2023] Referring Image Matting☆203Updated last year
- A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, …☆231Updated last year
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆795Updated 7 months ago
- [ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"☆519Updated last year
- A comprehensive list [SAMRS@NeurIPS'23, RVSA@TGRS'22, RSP@TGRS'22] of our research works related to remote sensing, including papers, cod…☆462Updated 5 months ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆927Updated 2 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,354Updated 2 years ago
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆469Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆547Updated last year
- [ICCV 2023] You Only Look at One Partial Sequence☆336Updated last year
- [ACM MM 2021] Privacy-Preserving Portrait Matting☆289Updated last year
- [CVPR 2022 Oral] Official implementation of DN-DETR☆543Updated 11 months ago
- [ICLR2022] official implementation of UniFormer☆827Updated 7 months ago
- Official PyTorch implementation of Fully Attentional Networks☆467Updated last year
- This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…☆368Updated last year
- ☆244Updated last year
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆385Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆238Updated last year
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,262Updated 8 months ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆484Updated last year
- Semi-Supervised Learning, Object Detection, ICCV2021☆904Updated 5 months ago
- Official MegEngine implementation of RepLKNet☆268Updated 2 years ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆421Updated 5 months ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆870Updated 6 months ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆330Updated 9 months ago
- Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)☆789Updated last year
- [ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".☆492Updated 2 years ago