The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 6 years ago
Alternatives and similar repositories for CMHSE
Users that are interested in CMHSE are comparing it to the libraries listed below
Sorting:
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Apr 26, 2020Updated 5 years ago
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆87Nov 22, 2020Updated 5 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- Official python implementation of R3-Transformer☆15Nov 30, 2020Updated 5 years ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…☆18Jun 22, 2021Updated 4 years ago
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆196Oct 31, 2020Updated 5 years ago
- Implementing ONNX runtime for android to run Segment Anything Model 2☆12Aug 1, 2025Updated 7 months ago
- ☆27Aug 16, 2022Updated 3 years ago
- ☆26Aug 4, 2020Updated 5 years ago
- Scripts of our CVPR'19 paper "Rethinking the Evaluation of Video Summaries"☆68Aug 24, 2021Updated 4 years ago
- Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching☆170Oct 12, 2020Updated 5 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)☆29Apr 7, 2020Updated 5 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Moments Retrieval Project Webpage (temporal)☆31Jan 17, 2024Updated 2 years ago
- Implementation of Canonical Correlation Analysis Layer for Cross-Modality Retrieval.☆31Mar 8, 2018Updated 7 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- This is the implementation of the paper Video Summarization by Learning from Unpaired Data(CVPR2019)☆37Sep 5, 2019Updated 6 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- ☆12May 26, 2022Updated 3 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- a recommendation list of math courses for people with no math background.☆11Mar 2, 2021Updated 4 years ago
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆323Jul 4, 2020Updated 5 years ago
- MAC: Mining Activity Concepts for Language-based Temporal Localization☆36Nov 26, 2018Updated 7 years ago
- A curated list of grounding natural language in video and related area. :-)☆83Dec 16, 2019Updated 6 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆41Jun 29, 2022Updated 3 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- ☆10Nov 21, 2016Updated 9 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 3 years ago