repodiac / german_transliterateLinks
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
☆33Updated 4 years ago
Alternatives and similar repositories for german_transliterate
Users that are interested in german_transliterate are comparing it to the libraries listed below
Sorting:
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- ☆80Updated last year
- Linguistic processing for Common Voice☆55Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆35Updated last year
- multilingual speech aligner☆74Updated last year
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆100Updated 11 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆66Updated 3 weeks ago
- ☆38Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆162Updated last year
- Simple diarization model☆49Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- ☆63Updated last year
- A collection of utilities for handling IPA phones.☆25Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- phone inventory library☆16Updated 2 years ago
- ☆36Updated last month
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆137Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆34Updated this week
- ☆37Updated last year