repodiac / german_transliterate
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
☆31Updated 4 years ago
Alternatives and similar repositories for german_transliterate:
Users that are interested in german_transliterate are comparing it to the libraries listed below
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- ☆79Updated 7 months ago
- ☆62Updated 8 months ago
- ☆35Updated 3 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆47Updated 5 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 10 months ago
- ☆38Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆72Updated 3 years ago
- This is the M-AILABS Speech Dataset☆36Updated last month
- multilingual speech aligner☆73Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆39Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆73Updated last year
- Convert English text from written expressions into spoken forms☆21Updated 2 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆147Updated last year
- Unofficial implementation of miipher☆114Updated 8 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A collection of utilities for handling IPA phones.☆25Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆48Updated 8 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- A sequence-to-sequence voice conversion toolkit.☆92Updated 6 months ago
- ☆33Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆151Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training