philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆20Updated last month
Alternatives and similar repositories for brever:
Users that are interested in brever are comparing it to the libraries listed below
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- ☆24Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 9 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆19Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆24Updated 8 months ago
- Official implementation of Self-Remixing☆13Updated last year
- ☆17Updated 9 months ago
- ☆61Updated last year
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- Official implementation for FlowSep☆45Updated 4 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- Self-supervised Generative LM-based Voice Conversion☆27Updated last week
- ☆13Updated 8 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- offical code for Dense-TSNet☆12Updated 7 months ago
- ☆15Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆24Updated last month
- ☆26Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆44Updated 3 weeks ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- ☆24Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆50Updated 3 weeks ago
- ☆48Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆52Updated 6 months ago
- ☆49Updated 2 years ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated 2 years ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆21Updated last year