repodiac / german_transliterateLinks
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
☆33Updated 4 years ago
Alternatives and similar repositories for german_transliterate
Users that are interested in german_transliterate are comparing it to the libraries listed below
Sorting:
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated 2 years ago
- Convert English text from written expressions into spoken forms☆25Updated 3 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- ☆80Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆92Updated last year
- ☆37Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated last month
- ☆37Updated 2 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆162Updated 2 years ago
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- multilingual speech aligner☆74Updated last year
- ☆63Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆165Updated 2 weeks ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆52Updated 10 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- phone inventory library☆16Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆57Updated 3 years ago