prajwalkr / transpellerLinks
Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.
☆12Updated 2 years ago
Alternatives and similar repositories for transpeller
Users that are interested in transpeller are comparing it to the libraries listed below
Sorting:
- PATS Dataset. Aligned Pose-Audio-Transcripts and Style for co-speech gesture research☆62Updated 2 years ago
- A curated list of awesome work on Sign Language Production☆61Updated 2 years ago
- ☆17Updated 4 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆197Updated 2 years ago
- This repository contains scripts to build Youtube Gesture Dataset.☆131Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆46Updated 2 years ago
- Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)☆273Updated 4 years ago
- You Said That?: Synthesising Talking Faces from Audio☆70Updated 7 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆128Updated 2 years ago
- Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)☆124Updated last year
- Official Repository for the paper "No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures", Findin…☆20Updated 4 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆39Updated 3 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 4 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆72Updated last year
- Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)☆121Updated last year
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Updated 5 years ago
- TFDS data loaders for sign language datasets.☆103Updated last month
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆93Updated 5 months ago
- This repository provides scripts that can be used to visualize BVH files. These scripts were developed for the GENEA Challenge 2020, and …☆40Updated 2 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆194Updated 3 years ago
- Collection of useful FFMPEG commands for processing audio and video files.☆44Updated 6 years ago
- Implementation for the paper "Can Language Models Learn to Listen?"☆69Updated 2 years ago
- ☆20Updated 3 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆234Updated 3 years ago
- This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…☆16Updated 2 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆165Updated 5 years ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆20Updated 2 months ago
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆287Updated last year
- This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video…☆12Updated 3 years ago