jwehrmann / lmtd
Labeled Movie Trailer Dataset
☆16Updated 6 years ago
Alternatives and similar repositories for lmtd:
Users that are interested in lmtd are comparing it to the libraries listed below
- ☆22Updated last year
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆50Updated 2 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 3 years ago
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆51Updated last year
- ☆15Updated 4 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated last year
- Code of CVPR2020 Paper "Searching for actions on the hyperbole"☆13Updated 3 years ago
- 🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬☆120Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- The Evoked Expressions in Video dataset contains videos paired with the expected facial expressions over time exhibited by people reactin…☆35Updated 2 years ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 3 months ago
- ☆31Updated 3 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆25Updated 4 years ago
- ☆26Updated 3 years ago
- Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.☆33Updated 7 years ago
- A repository for extract CNN features from videos using pytorch☆69Updated 2 years ago
- ☆17Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 3 years ago
- PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)☆121Updated last year
- Human Emotion Understanding using multimodal dataset.☆91Updated 4 years ago
- Humor Knowledge Enriched Transformer☆28Updated 3 years ago
- A length-controllable and non-autoregressive image captioning model.☆68Updated 3 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆85Updated 3 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆128Updated 3 years ago
- Code repo for the EMOTIC dataset☆119Updated 2 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆39Updated 4 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆26Updated last month
- Using deep recurrent networks to recognize horses' pain expressions in video.☆27Updated 2 years ago