lcn-kul / xls-r-analysis-sqaLinks
Analysis of XLS-R for Speech Quality Assessment
☆15Updated 11 months ago
Alternatives and similar repositories for xls-r-analysis-sqa
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
Sorting:
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆103Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆162Updated last month
- ☆18Updated 2 years ago
- Official Repository For VoxBlink2☆85Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆98Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- ☆96Updated this week
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 11 months ago
- Official repository of NeXt-TDNN for speaker verification☆81Updated last year
- ☆15Updated 2 years ago
- ☆70Updated last year
- A simple package for Guided source separation (GSS)☆132Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆45Updated 8 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆85Updated last week
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆55Updated 9 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Updated last year
- Expressive Anechoic Recordings of Speech (EARS)☆208Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 8 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆120Updated 5 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆75Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆94Updated 9 months ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Updated last year
- Reference-aware automatic speech evaluation toolkit☆176Updated last year
- The VoxTube dataset official repository☆71Updated last year
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆117Updated 4 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Updated 2 years ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆86Updated 8 months ago
- Official repository for FlowSE (Interspeech 2025)☆85Updated 6 months ago