yj-yu / CiSIN
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Updated 3 years ago
Related projects: ⓘ
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- ☆19Updated last year
- ☆11Updated 4 years ago
- ☆9Updated last year
- ☆41Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- ☆35Updated 11 months ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆26Updated 2 years ago
- ☆31Updated 5 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Updated last year
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆22Updated 2 years ago
- sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-StorytellingThe implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Updated 3 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated last year
- Learning Representational Invariances for Data-Efficient Action Recognition☆32Updated 2 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆39Updated last year
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆22Updated last year
- ☆29Updated 11 months ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆16Updated 2 years ago
- RG-UNIT, ACM MM 2020.☆11Updated 3 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆32Updated last year
- CoCon: Cooperative Contrastive Learning☆20Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆19Updated 2 months ago
- ☆31Updated 3 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Updated 2 years ago
- ☆21Updated 3 years ago
- ☆25Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated last year