Universal multilingual automatic speech transcription into IPA
☆77Feb 28, 2025Updated last year
Alternatives and similar repositories for multipa
Users that are interested in multipa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆29Mar 14, 2025Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆301Oct 22, 2025Updated 5 months ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 5 years ago
- Keyword spotting and forced alignment in any language☆92Feb 12, 2026Updated last month
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- Convert native orthographies to the International Phonetic Alphabet☆18Jul 4, 2025Updated 8 months ago
- ☆56Dec 19, 2022Updated 3 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- VoxAngeles Corpus☆14Aug 23, 2025Updated 7 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆37Jul 24, 2025Updated 7 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆17Mar 1, 2024Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Mar 13, 2026Updated last week
- An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish☆16Mar 29, 2024Updated last year
- ☆31Aug 23, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆21Mar 4, 2024Updated 2 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆100Nov 20, 2023Updated 2 years ago
- Search for pronuncations in different languages☆11Nov 2, 2024Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆38Mar 3, 2025Updated last year
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆28Feb 21, 2025Updated last year