Python implementation of a few speech intelligibility prediction algorithms
☆15May 29, 2024Updated last year
Alternatives and similar repositories for py-intelligibility
Users that are interested in py-intelligibility are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Sep 6, 2024Updated last year
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆23Nov 25, 2024Updated last year
- Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"☆12Nov 25, 2021Updated 4 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Apr 4, 2022Updated 4 years ago
- Source code for Multi-resolution Common Fate Transform.☆12Jun 5, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A list of datasets made available by members of the Aalto Acoustics Lab☆30Sep 6, 2024Updated last year
- ☆21Mar 4, 2024Updated 2 years ago
- Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!☆10Apr 24, 2023Updated 3 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆67Aug 24, 2025Updated 9 months ago
- ☆17Oct 18, 2023Updated 2 years ago
- ☆62May 31, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 9 months ago
- MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.☆17Apr 15, 2025Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆19Dec 1, 2024Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Audio production style transfer with inference-time optimization☆52Nov 18, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Easy to use Audio Tagging in PyTorch☆23Aug 22, 2021Updated 4 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆72Aug 13, 2024Updated last year
- Runpod WhisperX Docker Container Repo☆16Mar 10, 2024Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆18May 12, 2022Updated 4 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- ☆45Jun 25, 2025Updated 11 months ago
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 3 years ago
- Sound Morphing Toolbox (SMT)☆31Jul 8, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆14Jun 15, 2021Updated 4 years ago
- Detect text dialogue on manga using YoloV3 model trained with Manga109 dataset☆20Sep 27, 2023Updated 2 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆44Oct 14, 2021Updated 4 years ago
- A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed☆11Jul 29, 2022Updated 3 years ago
- repository for questions that are asked (or you want answered!) during storytelling sessions☆12Sep 7, 2025Updated 8 months ago
- Initial repo for behavioral analyses☆10Aug 24, 2022Updated 3 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆39May 12, 2024Updated 2 years ago