Sindhu-Hegde / pseudo-visual-speech-denoisingView external linksLinks
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
☆108May 27, 2024Updated last year
Alternatives and similar repositories for pseudo-visual-speech-denoising
Users that are interested in pseudo-visual-speech-denoising are comparing it to the libraries listed below
Sorting:
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21May 21, 2021Updated 4 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Dec 8, 2022Updated 3 years ago
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Jun 22, 2025Updated 7 months ago
- Export yolov5 model to run on cpu using tflite☆14Aug 12, 2021Updated 4 years ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆43Mar 23, 2022Updated 3 years ago
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Jul 25, 2024Updated last year
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Apr 16, 2023Updated 2 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆713Jul 6, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- The official implementation of OpenSR (ACL2023 Oral)☆16Nov 29, 2023Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆73Apr 7, 2024Updated last year
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Jun 24, 2020Updated 5 years ago
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 4 years ago
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- 処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ☆11Mar 5, 2023Updated 2 years ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Jun 22, 2023Updated 2 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆15Aug 26, 2020Updated 5 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆10Apr 22, 2021Updated 4 years ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- ☆24Mar 30, 2024Updated last year
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆46Sep 1, 2024Updated last year
- Conditional StyleGAN2 for virtual try on guided by densepose☆11Sep 20, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result…☆11Jul 28, 2021Updated 4 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago