baopj / DenseEventsGroundingLinks
☆17Updated last year
Alternatives and similar repositories for DenseEventsGrounding
Users that are interested in DenseEventsGrounding are comparing it to the libraries listed below
Sorting:
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆76Updated last year
- ☆12Updated 2 years ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Updated 3 years ago
- Sapsucker Woods 60 Audiovisual Dataset☆17Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Updated 3 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated 2 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Updated 2 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Updated 2 years ago
- ☆36Updated 4 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Updated 4 years ago
- ☆31Updated 3 years ago
- [CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging☆49Updated last year
- ☆77Updated 2 years ago
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆38Updated 2 years ago
- [CVPR 2022] Visual Abductive Reasoning☆123Updated 11 months ago
- Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)☆89Updated last year
- Cross Modal Retrieval with Querybank Normalisation☆55Updated last year
- Compress conventional Vision-Language Pre-training data☆52Updated 2 years ago
- CVPR2022☆21Updated 3 years ago
- ☆33Updated last year
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated last year
- ☆43Updated 4 years ago
- Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Updated 2 years ago
- A curated list of research papers in Referring Expression Comprehension (REC)☆45Updated 4 years ago
- ☆73Updated 3 years ago
- Placeholder for code of BSP.☆11Updated 4 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 3 years ago
- ☆26Updated 2 years ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆52Updated last year