jwehrmann / lmtd
Labeled Movie Trailer Dataset
☆16Updated 7 years ago
Alternatives and similar repositories for lmtd:
Users that are interested in lmtd are comparing it to the libraries listed below
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 3 years ago
- ☆22Updated last year
- A repository for extract CNN features from videos using pytorch☆69Updated 2 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆56Updated 4 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆50Updated 2 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- ☆17Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 3 years ago
- Implementations of Transformers for Video☆23Updated 3 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 9 months ago
- A Video Summarization framework for implementation and benchmark of Deep Learning models☆34Updated 6 months ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆85Updated 3 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆25Updated 3 years ago
- ☆37Updated 3 years ago
- menovideo: pytorch library for video action recognition and video understanding☆28Updated 3 years ago
- Localized Narratives☆82Updated 3 years ago
- Audio Visual Instance Discrimination with Cross-Modal Agreement☆128Updated 3 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated last year
- Video classification tools using 3D ResNet☆23Updated 7 years ago
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆153Updated 3 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆113Updated 4 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 4 years ago
- Rank-aware Attention Network from 'The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos'☆28Updated 3 years ago
- 🔆 📝 A reading list focused on Multimodal Emotion Recognition (MER) 👂👄 👀 💬☆121Updated 4 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 6 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆27Updated 3 months ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year