hilab-open-source / WatchYourMouthLinks
Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024
☆15Updated 3 weeks ago
Alternatives and similar repositories for WatchYourMouth
Users that are interested in WatchYourMouth are comparing it to the libraries listed below
Sorting:
- ☆30Updated 4 months ago
- ☆69Updated last year
- ☆23Updated last year
- ☆59Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated 2 years ago
- Real-time binaural target sound extraction model.☆93Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆57Updated 2 months ago
- ASLP Summer Inter@NPU☆12Updated last year
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆47Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆76Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆48Updated last year
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆70Updated last year
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆25Updated last year
- ☆68Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆50Updated 6 months ago
- ☆89Updated this week
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆64Updated last year
- ☆122Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- EMO-SUPERB submission☆47Updated 2 weeks ago
- ☆47Updated last year
- The Introduction of the OLKAVS Dataset☆33Updated last year
- ☆17Updated 11 months ago