aminEdraki / py-intelligibility
Python implementation of a few speech intelligibility prediction algorithms
☆13Updated 10 months ago
Alternatives and similar repositories for py-intelligibility:
Users that are interested in py-intelligibility are comparing it to the libraries listed below
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 6 months ago
- A python implementation of Speech intelligibility in bits (SIIB)☆24Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 3 weeks ago
- ☆16Updated 4 years ago
- ☆23Updated last year
- Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)☆48Updated 2 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆21Updated 3 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆15Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 4 months ago
- ☆14Updated last year
- ☆54Updated 10 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆78Updated 9 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- ☆13Updated 2 years ago
- Translating Synthetic RIRs to Real RIRs☆41Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆19Updated last year
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- ☆10Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆37Updated 6 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.☆14Updated 2 months ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆23Updated 4 years ago