JacobLinCool / MPSENetLinks
Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.
☆21Updated last year
Alternatives and similar repositories for MPSENet
Users that are interested in MPSENet are comparing it to the libraries listed below
Sorting:
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 4 months ago
- Official repository of Wavehax vocoder☆66Updated last month
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- Viterbi decoding in PyTorch☆40Updated 4 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- Implementation of vocoders empowered with pytorch lightning☆18Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- ☆44Updated last year
- source code of EfficientTTS 2☆20Updated last year
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated 3 years ago
- ☆28Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- Training code and dataset cleasing with Sidon☆75Updated 3 weeks ago
- Unofficial implementation of wavenext vocoder☆56Updated last year
- ☆19Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆66Updated last year
- ☆14Updated 6 months ago
- ☆16Updated 9 months ago
- ☆26Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Updated 3 months ago
- ☆52Updated 7 months ago
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Updated this week
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- ☆70Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Updated 4 months ago