philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆19Updated last week
Alternatives and similar repositories for brever:
Users that are interested in brever are comparing it to the libraries listed below
- ☆17Updated 8 months ago
- A neural speech codec based on discrete WavLM representations☆23Updated 7 months ago
- ☆24Updated last year
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 8 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Official implementation of Self-Remixing☆13Updated last year
- ☆10Updated 5 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 8 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆61Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 8 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Viterbi decoding in PyTorch☆30Updated last week
- ☆13Updated 7 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆16Updated 2 weeks ago
- Streaming Vocos☆24Updated 3 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆13Updated 2 weeks ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆23Updated 3 weeks ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆48Updated 3 weeks ago
- GPT for FACodec☆13Updated last year
- ☆24Updated last year
- offical code for Dense-TSNet☆12Updated 6 months ago
- ☆26Updated last year
- ☆48Updated 2 years ago