ahmedshah1494 / speech_robust_bench
☆10Updated last month
Alternatives and similar repositories for speech_robust_bench:
Users that are interested in speech_robust_bench are comparing it to the libraries listed below
- ☆13Updated 7 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 3 weeks ago
- ☆24Updated last year
- Collection of scripts from mHuBERT-147.☆24Updated 4 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 3 weeks ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆14Updated 8 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated 2 weeks ago
- ☆26Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated last year
- ☆26Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Updated 4 months ago
- Streaming Vocos☆22Updated 2 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- ☆13Updated 6 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆51Updated last month
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 3 weeks ago
- ☆12Updated 2 months ago
- ☆14Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated 2 weeks ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- ☆16Updated 8 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 8 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆14Updated 3 weeks ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆12Updated 9 months ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆11Updated 8 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆13Updated last week