nirat1606 / OADis
Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OADis
- ☆25Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- ☆29Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 5 months ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆64Updated 2 years ago
- ☆58Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆23Updated last year
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…☆63Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆10Updated 2 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆71Updated 5 months ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated last year
- Shared Attention for Multi-label Zero-shot Learning accepted @ CVPR20☆31Updated 2 years ago
- OVAD: Open-vocabulary Attribute Detection code☆28Updated last year
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆45Updated last year
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Updated 10 months ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆24Updated 3 years ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆31Updated last year
- ☆58Updated last year
- ☆22Updated last year
- Temporal Alignment Representations with Contrastive Learning☆22Updated last year
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆55Updated 3 weeks ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization☆21Updated 3 months ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year