JacobLinCool / MPSENet
Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.
☆12Updated 4 months ago
Alternatives and similar repositories for MPSENet:
Users that are interested in MPSENet are comparing it to the libraries listed below
- ☆11Updated 4 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 2 weeks ago
- ☆10Updated 3 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 7 months ago
- ☆9Updated this week
- Production-ready vocoder using BigVSAN☆11Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 7 months ago
- source code of EfficientTTS 2☆12Updated last year
- ☆24Updated last year
- ☆12Updated 6 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 4 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆10Updated 2 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆15Updated 4 months ago
- ☆11Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Viterbi decoding in PyTorch☆27Updated this week
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- ☆28Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 7 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆18Updated 2 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 6 months ago
- Unofficial implementation of wavenext vocoder☆42Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆14Updated 10 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- My vocoder experiments☆26Updated 4 months ago