BCV-Uniandes / PNG
☆62Updated 3 years ago
Alternatives and similar repositories for PNG:
Users that are interested in PNG are comparing it to the libraries listed below
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- ☆64Updated last year
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆61Updated last year
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆109Updated 4 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 3 weeks ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆84Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…☆128Updated 2 years ago
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆177Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- ☆81Updated 2 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated last year
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆120Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆29Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated 2 years ago
- ☆58Updated last year
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆65Updated 2 years ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆150Updated last year
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆57Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆49Updated last month
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆107Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆51Updated last year
- ☆44Updated 3 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆73Updated 8 months ago
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆60Updated 4 years ago
- A Unified Framework for Video-Language Understanding☆56Updated last year