Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DVDialogues
Users that are interested in DVDialogues are comparing it to the libraries listed below
Sorting:
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago
- Implementation of the AAAI-21 Workshop on Scientific Document Understanding paper "A Paragraph-level Multi-task Learning Model for Scient…☆15Oct 9, 2023Updated 2 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"☆18Jun 8, 2021Updated 4 years ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- ☆27May 4, 2020Updated 5 years ago
- Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021☆32Nov 22, 2022Updated 3 years ago
- ROCK model for Knowledge-Based VQA in Videos☆31Oct 19, 2020Updated 5 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆27Mar 19, 2020Updated 5 years ago
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆43May 24, 2024Updated last year
- A PyTorch implementation of EmpiricalMVM☆41Dec 18, 2023Updated 2 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- ☆11Dec 20, 2023Updated 2 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- [NeurIPS'25 Spotlight] This is the official codebase for the paper: STAR: A Benchmark for Astronomical Star Fields Super-Resolution☆15Oct 9, 2025Updated 4 months ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- MAC: Mining Activity Concepts for Language-based Temporal Localization☆36Nov 26, 2018Updated 7 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- ☆10Sep 7, 2022Updated 3 years ago
- R functions and datasets related to the mapping of text to the United Nations 17 Sustainable Development Goals (SDGs).☆12May 12, 2022Updated 3 years ago
- ☆10Jun 19, 2024Updated last year
- ☆10Nov 2, 2023Updated 2 years ago
- a close enough approximation of the shadertoy framework☆12Jul 2, 2020Updated 5 years ago
- ☆14Jan 5, 2024Updated 2 years ago
- Cell2location paper - Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics☆15Nov 26, 2022Updated 3 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Code for "Learning Harmonic Molecular Representations on Riemannian Manifold", ICLR, 2023☆10Mar 23, 2023Updated 2 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 8 months ago