philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆20Updated 2 weeks ago
Alternatives and similar repositories for brever:
Users that are interested in brever are comparing it to the libraries listed below
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 7 months ago
- ☆24Updated last year
- A neural speech codec based on discrete WavLM representations☆23Updated 6 months ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆60Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- Official implementation of Self-Remixing☆13Updated last year
- ☆16Updated 8 months ago
- ☆23Updated last year
- Streaming Vocos☆21Updated 2 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆47Updated this week
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- ☆10Updated 4 months ago
- ☆48Updated 2 years ago
- Reimplementation of Miipher☆20Updated last year
- ☆26Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 7 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 8 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆51Updated 5 months ago