xvjiarui / GroupViT
GroupViT: Semantic Segmentation Emerges from Text Supervision
☆25Updated 2 years ago
Alternatives and similar repositories for GroupViT:
Users that are interested in GroupViT are comparing it to the libraries listed below
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Updated 3 years ago
- ☆57Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆109Updated 2 months ago
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆120Updated last year
- [CVPRW'23] The official PyTorch implementation of NamedMask☆23Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Updated 3 weeks ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆110Updated last year
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆54Updated 3 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆61Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆62Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 2 years ago
- Replication of Pix2Seq with Pretrained Model☆60Updated 3 years ago
- ☆64Updated last year
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 2 weeks ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆100Updated 2 years ago
- ☆16Updated last year
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago