Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 4 years ago
Alternatives and similar repositories for CCN
Users that are interested in CCN are comparing it to the libraries listed below
Sorting:
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆18May 6, 2021Updated 4 years ago
- SILCO: Show a Few Images, Localize the Common Object(ICCV2019)☆17Oct 15, 2019Updated 6 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆45Mar 19, 2023Updated 2 years ago
- ☆15Aug 13, 2020Updated 5 years ago
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆469May 6, 2021Updated 4 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"☆47Jul 27, 2020Updated 5 years ago
- video captioning☆24Mar 14, 2019Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Jul 9, 2021Updated 4 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA☆54Sep 13, 2021Updated 4 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)☆22Mar 25, 2019Updated 6 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]☆67Aug 3, 2023Updated 2 years ago
- ☆33Jul 8, 2018Updated 7 years ago
- Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"☆187Apr 15, 2021Updated 4 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"☆64Mar 24, 2023Updated 2 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- ☆33Apr 20, 2018Updated 7 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- A TensorFlow implementation of the paper: HyperFace: A Deep Multi-task Learning Framework for Face Detection, Landmark Localization, Pos…☆10Dec 20, 2018Updated 7 years ago
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Oct 4, 2023Updated 2 years ago
- Modification of the original Mask/Faster R-CNN☆12Dec 13, 2020Updated 5 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.☆26Oct 30, 2018Updated 7 years ago
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 2 months ago
- ☆12Jan 31, 2024Updated 2 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆11Aug 28, 2020Updated 5 years ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- JEI 2015: Multifocus image fusion using phase congruency☆10Feb 24, 2017Updated 9 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 3 years ago