lcn-kul / xls-r-analysis-sqaLinks
Analysis of XLS-R for Speech Quality Assessment
☆14Updated 8 months ago
Alternatives and similar repositories for xls-r-analysis-sqa
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
Sorting:
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆179Updated 3 weeks ago
- ☆16Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆49Updated 4 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆143Updated 3 weeks ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆97Updated 9 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- ☆69Updated last year
- Reference-aware automatic speech evaluation toolkit☆163Updated 10 months ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆217Updated last year
- ☆185Updated 10 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆95Updated 10 months ago
- This is the M-AILABS Speech Dataset☆87Updated 10 months ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- ☆24Updated 3 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆57Updated 8 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆69Updated 2 months ago
- Official Repository For VoxBlink2☆84Updated last year
- Training code for FAcodec presented in NaturalSpeech3☆222Updated last year
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆239Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆165Updated 4 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆172Updated this week
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆92Updated 2 months ago
- Official repository of NeXt-TDNN for speaker verification☆78Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆97Updated 10 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆74Updated 4 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆152Updated 2 years ago
- The VoxTube dataset official repository☆70Updated last year
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆231Updated 5 months ago