kittenish / Frame-Transformer-Network
Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017
☆17Updated 7 years ago
Related projects: ⓘ
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆24Updated 3 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆51Updated 4 years ago
- ☆22Updated this week
- ☆31Updated 5 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Updated last year
- dataset cleansing for Visual Genome☆30Updated 7 years ago
- Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…☆35Updated 5 years ago
- Rank-aware Attention Network from 'The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos'☆27Updated 3 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆22Updated 5 years ago
- Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020☆42Updated 4 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆60Updated last year
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆76Updated 5 years ago
- ☆19Updated last year
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- ☆16Updated 3 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Updated 5 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated last year
- Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding☆4Updated 3 years ago
- ☆15Updated 5 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated 11 months ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆32Updated 2 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆55Updated last year
- TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".☆40Updated 6 years ago
- [IJCAI 2018] Deep Reasoning with Knowledge Grap for Social Relationship Understanding.☆21Updated 2 years ago
- ☆35Updated 11 months ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago
- Code for "Compositional Video Synthesis with Action Graphs", Bar & Herzig et al., ICML 2021☆30Updated last year
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 2 months ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- Tvnet implemented in pytorch☆45Updated 5 years ago