Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
☆108May 27, 2024Updated last year
Alternatives and similar repositories for pseudo-visual-speech-denoising
Users that are interested in pseudo-visual-speech-denoising are comparing it to the libraries listed below
Sorting:
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21May 21, 2021Updated 4 years ago
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Jul 25, 2024Updated last year
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Jun 22, 2025Updated 8 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆47Sep 1, 2024Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆15Aug 26, 2020Updated 5 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated 2 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Apr 16, 2023Updated 2 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆712Jul 6, 2023Updated 2 years ago
- Export yolov5 model to run on cpu using tflite☆14Aug 12, 2021Updated 4 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆74Apr 7, 2024Updated last year
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Jun 22, 2023Updated 2 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Jun 24, 2020Updated 5 years ago
- ☆11May 7, 2022Updated 3 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- ObamaNet fork☆12Sep 16, 2019Updated 6 years ago
- ☆17Oct 15, 2021Updated 4 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 5 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker☆17Mar 11, 2022Updated 4 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- 処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ☆11Mar 5, 2023Updated 3 years ago
- ☆38Jul 20, 2020Updated 5 years ago
- ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"☆100Feb 27, 2026Updated 2 weeks ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- 基于深度学习的语音增强、去混响☆100Jan 30, 2024Updated 2 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆43Mar 23, 2022Updated 3 years ago
- ☆20Feb 27, 2018Updated 8 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆10Apr 22, 2021Updated 4 years ago
- INCREASING COMPACTNESS OF DEEP LEARNING BASED SPEECH ENHANCEMENT MODELS WITH PARAMETER PRUNING AND QUANTIZATION TECHNIQUES☆15Oct 18, 2019Updated 6 years ago
- ☆208Mar 10, 2021Updated 5 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Jul 29, 2020Updated 5 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago