Labeled data for homograph disambiguation
☆62Jun 1, 2023Updated 2 years ago
Alternatives and similar repositories for WikipediaHomographData
Users that are interested in WikipediaHomographData are comparing it to the libraries listed below
Sorting:
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆147Apr 5, 2024Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- Convert English text from written expressions into spoken forms☆28Jun 22, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- a lightweight voice conversion☆86Updated this week
- ☆11May 7, 2022Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- Code repository for FreGrad☆52May 19, 2024Updated last year
- Massively multilingual pronunciation mining☆362Jan 13, 2026Updated last month
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- ☆14Jun 12, 2015Updated 10 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆175Oct 20, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Hebrew grapheme to phoneme (G2P)☆89Feb 18, 2026Updated last week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆190Jan 26, 2026Updated last month
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆130Jul 30, 2024Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- Charsiu: A neural phonetic aligner.☆332Sep 19, 2022Updated 3 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆94Sep 1, 2021Updated 4 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago