BCV-Uniandes / PNG
☆62Updated 3 years ago
Alternatives and similar repositories for PNG:
Users that are interested in PNG are comparing it to the libraries listed below
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆110Updated 4 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆62Updated last year
- ☆64Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- ☆26Updated last year
- ☆83Updated 3 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 2 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 3 months ago
- ☆50Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated last year
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆108Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆89Updated 2 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆111Updated last month
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- ☆44Updated 3 years ago
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆21Updated 2 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- ☆61Updated last year
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆178Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- A Unified Framework for Video-Language Understanding☆57Updated last year
- [CVPR 2022] Visual Abductive Reasoning☆122Updated 5 months ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆130Updated 2 years ago