Python implementation of a few speech intelligibility prediction algorithms
☆15May 29, 2024Updated last year
Alternatives and similar repositories for py-intelligibility
Users that are interested in py-intelligibility are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Sep 6, 2024Updated last year
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Nov 25, 2021Updated 4 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Apr 4, 2022Updated 4 years ago
- Source code for Multi-resolution Common Fate Transform.☆12Jun 5, 2020Updated 5 years ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Mar 4, 2024Updated 2 years ago
- Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!☆10Apr 24, 2023Updated 2 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆65Aug 24, 2025Updated 7 months ago
- ☆17Oct 18, 2023Updated 2 years ago
- ☆62May 31, 2024Updated last year
- MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.☆17Apr 15, 2025Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆19Dec 1, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- Audio production style transfer with inference-time optimization☆50Nov 18, 2024Updated last year
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆72Aug 13, 2024Updated last year
- Runpod WhisperX Docker Container Repo☆16Mar 10, 2024Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆18May 12, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- ☆41Jun 25, 2025Updated 9 months ago
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 2 years ago
- Sound Morphing Toolbox (SMT)☆31Jul 8, 2022Updated 3 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Detect text dialogue on manga using YoloV3 model trained with Manga109 dataset☆20Sep 27, 2023Updated 2 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆44Oct 14, 2021Updated 4 years ago
- repository for questions that are asked (or you want answered!) during storytelling sessions☆12Sep 7, 2025Updated 7 months ago
- A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed☆11Jul 29, 2022Updated 3 years ago
- Initial repo for behavioral analyses☆10Aug 24, 2022Updated 3 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆38May 12, 2024Updated last year
- The bare metal in my basement☆21Dec 4, 2025Updated 4 months ago
- ☆28Nov 7, 2023Updated 2 years ago