hustvl / MIMDet
[ICCV 2023] You Only Look at One Partial Sequence
☆340Updated last year
Alternatives and similar repositories for MIMDet:
Users that are interested in MIMDet are comparing it to the libraries listed below
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆195Updated last year
- ☆250Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆243Updated 2 years ago
- This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…☆372Updated last year
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆224Updated 10 months ago
- Detection Transformers with Assignment☆249Updated last year
- Dense Distinct Query for End-to-End Object Detection (CVPR2023)☆249Updated last year
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆198Updated 2 years ago
- [NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning☆176Updated 3 years ago
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆471Updated 3 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆223Updated 2 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆263Updated last year
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆396Updated 4 months ago
- [ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".☆309Updated last year
- Unofficial implementation of Pix2SEQ☆165Updated 3 years ago
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆286Updated 2 years ago
- PyTorch Implementation of Sparse DETR☆166Updated last year
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆184Updated last year
- An official implementation of the Anchor DETR.☆347Updated 2 years ago
- A full-fledged version of Pix2Seq☆238Updated 3 years ago
- reproduction of semantic segmentation using masked autoencoder (mae)☆161Updated 3 years ago
- Open-vocabulary Semantic Segmentation☆169Updated last year
- [NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary …☆290Updated 2 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆155Updated 2 years ago
- Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)☆138Updated last year
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆234Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆427Updated last year
- [ICLR 2022] Official implementation of the paper "DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR"☆536Updated last year
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆184Updated 11 months ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆231Updated 3 years ago