[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for MIMDet
Users that are interested in MIMDet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆523Mar 14, 2023Updated 3 years ago
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,029Sep 29, 2022Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 3 years ago
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆400Oct 23, 2024Updated last year
- [NeurIPS 2021] You Only Look at One Sequence☆907May 4, 2022Updated 3 years ago
- (CVPR2023)Dense Distinct Query for End-to-End Object Detection☆264May 24, 2023Updated 2 years ago
- Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral☆239Mar 4, 2023Updated 3 years ago
- ☆79Jun 23, 2022Updated 3 years ago
- ☆318Oct 26, 2022Updated 3 years ago
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,652Aug 1, 2024Updated last year
- ☆17Nov 17, 2023Updated 2 years ago
- [ICCV 2021] Instances as Queries☆414Oct 20, 2023Updated 2 years ago
- This repository is an official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence". (https://arxiv.org…☆400May 22, 2023Updated 2 years ago
- Reading list for research topics in Masked Image Modeling☆335Dec 3, 2024Updated last year
- [CVPR 2022 Oral] Official implementation of DN-DETR☆603Dec 20, 2023Updated 2 years ago
- iBOT : Image BERT Pre-Training with Online Tokenizer (ICLR 2022)☆767Apr 14, 2022Updated 3 years ago
- Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"☆579Apr 24, 2022Updated 3 years ago
- ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)☆209Apr 18, 2024Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆378Sep 16, 2022Updated 3 years ago
- Anytime Dense Prediction with Confidence Adaptivity (ICLR 2022)☆51Aug 23, 2024Updated last year
- Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.☆565Dec 26, 2023Updated 2 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆617Dec 13, 2022Updated 3 years ago
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,476Jun 3, 2025Updated 9 months ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆80Oct 20, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- [CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal☆1,347Apr 30, 2023Updated 2 years ago
- [CVPR 2021] Instance Localization for Self-supervised Detection Pretraining☆145Jun 8, 2021Updated 4 years ago
- FreeSOLO for unsupervised instance segmentation, CVPR 2022☆317Jan 16, 2023Updated 3 years ago
- [CVPR 2022 Oral] AdaMixer: A Fast-Converging Query-Based Object Detector☆237Aug 17, 2022Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆114Mar 2, 2026Updated 3 weeks ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,996Mar 21, 2024Updated 2 years ago
- [CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation☆155Aug 19, 2023Updated 2 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆145Jul 2, 2023Updated 2 years ago
- Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".☆338Jul 18, 2023Updated 2 years ago