lcn-kul / xls-r-analysis-sqaLinks
Analysis of XLS-R for Speech Quality Assessment
☆14Updated 9 months ago
Alternatives and similar repositories for xls-r-analysis-sqa
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
Sorting:
- Official Repository For VoxBlink2☆84Updated last year
- ☆69Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆157Updated last week
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆75Updated 3 months ago
- Official repository of NeXt-TDNN for speaker verification☆79Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆98Updated 10 months ago
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆111Updated 2 months ago
- ☆18Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 9 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆94Updated 7 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆70Updated 3 years ago
- Official repository for FlowSE (Interspeech 2025)☆69Updated 4 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆98Updated 3 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆97Updated 2 years ago
- ☆189Updated 11 months ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆43Updated 6 months ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆82Updated 6 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆110Updated 11 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆95Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆99Updated last year
- ☆82Updated 10 months ago
- ☆92Updated 3 weeks ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated last year
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆166Updated 6 months ago
- Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…☆34Updated 2 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆69Updated 2 months ago
- This is the M-AILABS Speech Dataset☆90Updated last year