gaopengcuhk / Stable-Pix2Seq
A full-fledged version of Pix2Seq
☆236Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Stable-Pix2Seq
- Replication of Pix2Seq with Pretrained Model☆60Updated 3 years ago
- Unofficial implementation of Pix2SEQ☆165Updated 3 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆211Updated 2 years ago
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆160Updated 2 years ago
- A new framework for open-vocabulary object detection, based on maskrcnn-benchmark☆227Updated last year
- SeqTR: A Simple yet Universal Network for Visual Grounding☆131Updated 3 weeks ago
- ☆174Updated 2 years ago
- [NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning☆175Updated 3 years ago
- ☆168Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆336Updated last year
- Open-vocabulary Semantic Segmentation☆166Updated last year
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated last year
- ☆77Updated 2 years ago
- ☆244Updated last year
- PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529☆158Updated 2 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆184Updated 8 months ago
- ☆267Updated last year
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- ☆165Updated 8 months ago
- ☆185Updated last year
- [CVPR 2021] Instance Localization for Self-supervised Detection Pretraining☆144Updated 3 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆79Updated 2 years ago
- [CVPR 2022 Oral] Crafting Better Contrastive Views for Siamese Representation Learning☆284Updated 2 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆175Updated last year
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆106Updated 4 years ago
- This is an implementation of Deformable-DETR☆46Updated 3 years ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆191Updated last year
- ☆124Updated 2 years ago
- ☆187Updated 2 years ago