jwehrmann / lmtdLinks
Labeled Movie Trailer Dataset
☆16Updated 7 years ago
Alternatives and similar repositories for lmtd
Users that are interested in lmtd are comparing it to the libraries listed below
Sorting:
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 4 years ago
- ☆22Updated last year
- Code for the paper 'Video Gesture Analysis for Autism Spectrum Disorder Detection', ICPR 2018☆20Updated 6 years ago
- Code for the AVLnet (Interspeech 2021) and Cascaded Multilingual (Interspeech 2021) papers.☆52Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆130Updated 3 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆57Updated 4 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- A repository for extract CNN features from videos using pytorch☆70Updated 2 years ago
- Implementations of Transformers for Video☆23Updated 4 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆27Updated 5 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Updated 2 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆113Updated 4 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- The Evoked Expressions in Video dataset contains videos paired with the expected facial expressions over time exhibited by people reactin…☆38Updated 3 years ago
- ☆16Updated 6 years ago
- Code repo for the EMOTIC dataset☆127Updated last month
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Updated 2 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Updated 6 years ago
- ☆23Updated 3 years ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆153Updated 3 years ago
- Humor Knowledge Enriched Transformer☆30Updated 3 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- ☆14Updated last year
- Easy to use video deep features extractor☆319Updated 5 years ago
- This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters…☆116Updated 4 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆185Updated 4 years ago
- 5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)☆13Updated 5 years ago