ahmedshah1494 / speech_robust_benchLinks
☆16Updated 6 months ago
Alternatives and similar repositories for speech_robust_bench
Users that are interested in speech_robust_bench are comparing it to the libraries listed below
Sorting:
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Updated 7 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆11Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17Updated 5 months ago
- ☆17Updated last year
- Collection of scripts from mHuBERT-147.☆32Updated last year
- ☆91Updated last week
- ☆15Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆48Updated 7 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆43Updated 7 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Updated 7 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆23Updated last month
- ☆44Updated 4 months ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Updated last month
- ☆61Updated last year
- ☆24Updated 6 months ago
- ☆17Updated 10 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆42Updated 9 months ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Unofficial implementation of wavenext vocoder☆52Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Updated last month
- ☆21Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆37Updated 6 months ago
- A toolkit dedicate for speech evaluation.☆24Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆69Updated last month
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Updated 3 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆56Updated last year