Python implementation of a few speech intelligibility prediction algorithms
☆15May 29, 2024Updated last year
Alternatives and similar repositories for py-intelligibility
Users that are interested in py-intelligibility are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Nov 25, 2021Updated 4 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Apr 4, 2022Updated 3 years ago
- Source code for Multi-resolution Common Fate Transform.☆12Jun 5, 2020Updated 5 years ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- ☆21Mar 4, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!☆10Apr 24, 2023Updated 2 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆65Aug 24, 2025Updated 7 months ago
- WaveNet with TensorFlow 2.0☆23Mar 9, 2020Updated 6 years ago
- ☆62May 31, 2024Updated last year
- MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.☆17Apr 15, 2025Updated 11 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- ☆39Jun 25, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- Audio production style transfer with inference-time optimization☆49Nov 18, 2024Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆70Aug 13, 2024Updated last year
- Runpod WhisperX Docker Container Repo☆15Mar 10, 2024Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Aug 19, 2024Updated last year
- A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed☆10Jul 29, 2022Updated 3 years ago
- Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆18May 12, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 2 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Detect text dialogue on manga using YoloV3 model trained with Manga109 dataset☆20Sep 27, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆44Oct 14, 2021Updated 4 years ago
- Initial repo for behavioral analyses☆10Aug 24, 2022Updated 3 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆38May 12, 2024Updated last year
- ☆28Nov 7, 2023Updated 2 years ago
- Python implementation of performance metrics in Loizou's Speech Enhancement book☆449Feb 15, 2025Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago