philgzl / breverView external linksLinks
Speech enhancement in noisy and reverberant environments using deep neural networks
☆22Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for brever
Users that are interested in brever are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- text to speech☆10Mar 19, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- ☆11Nov 7, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Audio tokenization, in the fastest way possible!☆53Aug 26, 2024Updated last year
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆15Nov 11, 2024Updated last year
- ☆14Aug 1, 2025Updated 6 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 2 months ago
- ☆66Aug 16, 2023Updated 2 years ago
- ☆26Mar 20, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆42Feb 13, 2024Updated 2 years ago
- ☆15Mar 31, 2025Updated 10 months ago
- ☆19Sep 20, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Dec 3, 2025Updated 2 months ago
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 9 months ago
- ☆13Sep 12, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- ☆57Apr 24, 2024Updated last year