lcn-kul / xls-r-analysis-sqaLinks
Analysis of XLS-R for Speech Quality Assessment
☆14Updated 8 months ago
Alternatives and similar repositories for xls-r-analysis-sqa
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
Sorting:
- Official Repository For VoxBlink2☆84Updated last year
- ☆17Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆108Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆97Updated 9 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆72Updated 3 months ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆77Updated 5 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆151Updated last month
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆98Updated 11 months ago
- Official repository of NeXt-TDNN for speaker verification☆79Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆43Updated 5 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆74Updated last year
- Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…☆34Updated 2 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆94Updated 3 months ago
- ☆186Updated 11 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆57Updated 8 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆101Updated 3 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆106Updated 10 months ago
- The VoxTube dataset official repository☆70Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆69Updated last year
- ☆91Updated this week
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆87Updated 7 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆95Updated 11 months ago
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆111Updated 2 months ago
- ☆69Updated last year
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆48Updated 7 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆180Updated last month
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆66Updated last month