lcn-kul / xls-r-analysis-sqaLinks
Analysis of XLS-R for Speech Quality Assessment
☆14Updated 11 months ago
Alternatives and similar repositories for xls-r-analysis-sqa
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
Sorting:
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆160Updated 3 weeks ago
- ☆18Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆82Updated 5 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆101Updated last year
- Official Repository For VoxBlink2☆85Updated last year
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆84Updated 7 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆99Updated 5 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 10 months ago
- ☆69Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated 2 years ago
- Official repository for FlowSE (Interspeech 2025)☆84Updated 6 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆111Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 7 months ago
- Source code for Consistent ensemble distillation for audio tagging☆54Updated 6 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆185Updated 3 months ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Updated last year
- ☆106Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆98Updated 2 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆72Updated 3 months ago
- The VoxTube dataset official repository☆71Updated last year
- Reference-aware automatic speech evaluation toolkit☆176Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆74Updated last year
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆44Updated 7 months ago
- ☆199Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆117Updated 5 months ago
- Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…☆34Updated 2 weeks ago
- A simple package for Guided source separation (GSS)☆132Updated last year