google-research-datasets / WikipediaHomographDataLinks
Labeled data for homograph disambiguation
☆59Updated 2 years ago
Alternatives and similar repositories for WikipediaHomographData
Users that are interested in WikipediaHomographData are comparing it to the libraries listed below
Sorting:
- ☆80Updated last year
- Covering grammars for English and Russian text normalization☆61Updated 5 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- ☆40Updated 3 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Convert English text from written expressions into spoken forms☆25Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- A phoneme-allophone database for many languages☆52Updated 5 years ago
- multilingual speech aligner☆74Updated last year
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 2 weeks ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 3 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- asr2k☆51Updated last year
- ☆42Updated 3 years ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 3 years ago
- CMU multilingual speech repository☆31Updated 3 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 9 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆93Updated last year