The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 7 years ago
Alternatives and similar repositories for CMHSE
Users that are interested in CMHSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆87Nov 22, 2020Updated 5 years ago
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆198Oct 31, 2020Updated 5 years ago
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…☆18Jun 22, 2021Updated 4 years ago
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Mar 26, 2026Updated last month
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- ☆27Aug 16, 2022Updated 3 years ago
- ☆12Nov 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- ☆26Aug 4, 2020Updated 5 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Implementation of Canonical Correlation Analysis Layer for Cross-Modality Retrieval.☆31Mar 8, 2018Updated 8 years ago
- A GCN based visual question generation model☆13Aug 21, 2019Updated 6 years ago
- Scripts of our CVPR'19 paper "Rethinking the Evaluation of Video Summaries"☆68Aug 24, 2021Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- This is the implementation for the paper "Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval"☆14Dec 7, 2017Updated 8 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 6 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 7 years ago
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆323Jul 4, 2020Updated 5 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- A curated list of grounding natural language in video and related area. :-)☆83Dec 16, 2019Updated 6 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 6 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- ☆43Apr 25, 2019Updated 7 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Moments Retrieval Project Webpage (temporal)☆31Jan 17, 2024Updated 2 years ago
- ☆12Mar 23, 2026Updated last month