baopj / DenseEventsGroundingLinks
☆17Updated last year
Alternatives and similar repositories for DenseEventsGrounding
Users that are interested in DenseEventsGrounding are comparing it to the libraries listed below
Sorting:
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆76Updated last year
 - ☆12Updated 2 years ago
 - End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Updated 4 years ago
 - Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)☆90Updated last year
 - Official Code of ECCV 2022 paper MS-CLIP☆90Updated 3 years ago
 - Sapsucker Woods 60 Audiovisual Dataset☆17Updated 3 years ago
 - ☆33Updated last year
 - ☆36Updated 4 years ago
 - Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Updated 3 years ago
 - [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Updated 3 years ago
 - [CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation☆19Updated 4 years ago
 - PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Updated 4 years ago
 - [CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging☆49Updated 2 years ago
 - Compress conventional Vision-Language Pre-training data☆52Updated 2 years ago
 - Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Updated 2 years ago
 - ☆73Updated 3 years ago
 - ☆78Updated 3 years ago
 - This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆39Updated 2 years ago
 - A Unified Framework for Video-Language Understanding☆60Updated 2 years ago
 - This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Updated 2 years ago
 - This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…☆67Updated 5 years ago
 - A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆28Updated 3 years ago
 - Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Updated 3 years ago
 - [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Updated 3 years ago
 - This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"☆24Updated last month
 - This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆29Updated last year
 - ☆11Updated 10 months ago
 - [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆108Updated last year
 - Vision-Language Pretraining & Efficient Transformer Papers.☆15Updated 3 years ago
 - ☆31Updated 3 years ago