lcn-kul / xls-r-analysis-sqa
Analysis of XLS-R for Speech Quality Assessment
☆13Updated last month
Alternatives and similar repositories for xls-r-analysis-sqa:
Users that are interested in xls-r-analysis-sqa are comparing it to the libraries listed below
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 7 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆57Updated 8 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆68Updated 2 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 2 months ago
- ☆65Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆45Updated last week
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆42Updated 3 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- ☆46Updated 6 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆52Updated 2 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆58Updated 8 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆59Updated 2 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- ☆46Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆87Updated 8 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated last year
- ☆24Updated last year
- ☆69Updated 2 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆64Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆65Updated 3 weeks ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆61Updated 4 months ago
- ☆14Updated last year
- ☆26Updated 10 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆31Updated 2 weeks ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆123Updated 3 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 4 months ago