Universal multilingual automatic speech transcription into IPA
☆77Feb 28, 2025Updated last year
Alternatives and similar repositories for multipa
Users that are interested in multipa are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated 11 months ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆28Mar 14, 2025Updated 11 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆295Oct 22, 2025Updated 4 months ago
- ☆14Aug 19, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- ☆17Mar 1, 2024Updated 2 years ago
- ☆71Jul 13, 2023Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 5 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- ☆21Mar 4, 2024Updated last year
- ☆57Apr 18, 2023Updated 2 years ago
- The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)☆36Jul 24, 2025Updated 7 months ago
- ☆56Dec 19, 2022Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Feb 6, 2022Updated 4 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Jun 30, 2025Updated 8 months ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago