repodiac / german_transliterateLinks
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
☆33Updated 4 years ago
Alternatives and similar repositories for german_transliterate
Users that are interested in german_transliterate are comparing it to the libraries listed below
Sorting:
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- ☆80Updated 3 weeks ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆87Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 3 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆169Updated 2 years ago
- Convert English text from written expressions into spoken forms☆26Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆74Updated 2 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆123Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆173Updated this week
- multilingual speech aligner☆76Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Linguistic processing for Common Voice☆57Updated last year
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- ☆64Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆36Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆57Updated last year
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆139Updated 3 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆95Updated last year
- ☆16Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago