aminEdraki / py-intelligibilityLinks
Python implementation of a few speech intelligibility prediction algorithms
☆15Updated last year
Alternatives and similar repositories for py-intelligibility
Users that are interested in py-intelligibility are comparing it to the libraries listed below
Sorting:
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆22Updated 2 years ago
- ☆32Updated 3 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 4 months ago
- Manage audio and video datasets☆33Updated last month
- Goodness of Pronunciation algorithm using PyKaldi☆18Updated 3 years ago
- ☆13Updated 3 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated last month
- Generative Adversarial Networks for different impaired speech conversions☆38Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- ☆22Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 4 years ago
- ☆16Updated 4 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 7 months ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)☆48Updated 3 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 10 months ago
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆13Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Updated 3 years ago
- ☆27Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- Easy to use Audio Tagging in PyTorch☆22Updated 4 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Reproducible Subjective Evaluation☆60Updated last year