ahmedshah1494 / speech_robust_bench
☆9Updated last month
Related projects ⓘ
Alternatives and complementary repositories for speech_robust_bench
- ☆17Updated 3 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆17Updated this week
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- ☆23Updated last year
- ☆10Updated 2 months ago
- ☆13Updated 2 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆18Updated this week
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- Source code for DM-Codec.☆18Updated 3 weeks ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated 10 months ago
- Production-ready vocoder using BigVSAN☆11Updated 8 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Aligner for text-to-speech☆15Updated 3 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 3 months ago
- Unofficial implementation of wavenext vocoder☆31Updated 2 months ago
- ☆12Updated 3 months ago
- ☆26Updated 8 months ago
- ☆15Updated 3 months ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆21Updated last month
- ☆25Updated 4 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆28Updated 3 weeks ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆42Updated 4 months ago
- ☆18Updated 2 months ago
- ☆34Updated 6 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆13Updated 3 weeks ago
- GPT for FACodec☆13Updated 7 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- source code of EfficientTTS 2☆12Updated 8 months ago