JacobLinCool / MPSENetLinks
Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.
☆21Updated last year
Alternatives and similar repositories for MPSENet
Users that are interested in MPSENet are comparing it to the libraries listed below
Sorting:
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Viterbi decoding in PyTorch☆40Updated 4 months ago
- Official repository of Wavehax vocoder☆66Updated last month
- Unofficial implementation of wavenext vocoder☆55Updated last year
- ☆14Updated 5 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Updated 3 months ago
- Kanade is a speech tokenizer that encodes speech into compact content tokens and global embeddings and decodes them back to mel spectrogr…☆36Updated 3 weeks ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 4 months ago
- ☆44Updated last year
- ☆36Updated 3 weeks ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆50Updated this week
- Training code and dataset cleasing with Sidon☆73Updated 2 weeks ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 11 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 7 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- Prosody and Pronunciation Modification Network☆60Updated 8 months ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆16Updated 9 months ago
- ☆28Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated 2 years ago