Hyu-Zhang / BiHGH
ACMMM 2022 Oral
☆10Updated 2 years ago
Alternatives and similar repositories for BiHGH:
Users that are interested in BiHGH are comparing it to the libraries listed below
- A Toolbox for MultiModal Recommendation. Integrating 10+ Models...☆460Updated 3 weeks ago
- Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)☆23Updated 9 months ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆250Updated last year
- Code and dataset for CVPR 2021 paper "Personalized Outfit Recommendation with Learnable Anchors"☆15Updated 2 years ago
- ☆12Updated 3 years ago
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆134Updated last year
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Updated last year
- Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)☆27Updated 9 months ago
- Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Super…☆16Updated last year
- [CVPR 2022] A large-scale public benchmark dataset for video question-answering, especially about evidence and commonsense reasoning. The…☆67Updated last month
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆58Updated 3 years ago
- MMGCN: Multi-modal Graph Convolution Network forPersonalized Recommendation of Micro-video☆303Updated 3 years ago
- Towards Modality Generalization: A Benchmark and Prospective Analysis☆24Updated 2 months ago
- [ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".☆28Updated 4 months ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆97Updated 3 years ago
- [CVPR25] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evaluation mod…☆16Updated 3 weeks ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆63Updated last year
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 2 years ago
- https://layer6ai-labs.github.io/xpool/☆124Updated last year
- Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…☆41Updated last year
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆21Updated 2 years ago
- paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV…☆31Updated 2 years ago
- Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".☆26Updated 4 months ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆94Updated 3 years ago
- ☆12Updated last year
- ☆16Updated 2 years ago
- ☆90Updated 2 years ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆107Updated 3 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆14Updated last year
- ACM MULTIMEDIA CONFERENCE 2020☆11Updated 4 years ago