Labeled data for homograph disambiguation
☆62Jun 1, 2023Updated 2 years ago
Alternatives and similar repositories for WikipediaHomographData
Users that are interested in WikipediaHomographData are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Massively multilingual pronunciation mining☆363Updated this week
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆151Apr 5, 2024Updated 2 years ago
- Heteronym to Phoneme Parser☆19Nov 4, 2023Updated 2 years ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- a lightweight voice conversion☆86Feb 25, 2026Updated last month
- Convert English text from written expressions into spoken forms☆28Jun 22, 2022Updated 3 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆55Jan 13, 2023Updated 3 years ago
- IPA Phonetic dataset lexicon☆18Mar 20, 2026Updated 3 weeks ago
- Code repository for FreGrad☆52May 19, 2024Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆195Apr 2, 2026Updated last week
- ☆11May 7, 2022Updated 3 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆58Updated this week
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago
- Hebrew grapheme to phoneme (G2P)☆93Mar 17, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆177Oct 20, 2022Updated 3 years ago
- Charsiu: A neural phonetic aligner.☆337Sep 19, 2022Updated 3 years ago
- ☆12Nov 7, 2024Updated last year
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆95Sep 1, 2021Updated 4 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆148Aug 22, 2022Updated 3 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- Covering grammars for English and Russian text normalization☆61Sep 15, 2019Updated 6 years ago
- ☆112Mar 9, 2026Updated last month
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- Phonetisaurus G2P☆516Jun 1, 2024Updated last year