lcn-kul / xls-r-analysis-sqa
Analysis of XLS-R for Speech Quality Assessment
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for xls-r-analysis-sqa
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆44Updated last week
- ☆64Updated last year
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- ☆27Updated 7 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆19Updated 11 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆40Updated 3 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆25Updated last month
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- An example of a speech enhancement model deployed with TensorRT.☆38Updated 10 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- ☆48Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆38Updated last month
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆33Updated last month
- ConMamba for Automatic Speech Recognition☆44Updated 3 months ago
- ☆68Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago
- Query-conditioned target sound extraction model☆17Updated 3 weeks ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 3 weeks ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆35Updated last year
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS …☆95Updated last month
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆36Updated last month
- ☆55Updated last month
- ☆59Updated last year