repodiac / german_transliterate
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
☆32Updated 4 years ago
Alternatives and similar repositories for german_transliterate:
Users that are interested in german_transliterate are comparing it to the libraries listed below
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated 2 years ago
- ☆62Updated 10 months ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆41Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆63Updated last year
- multilingual speech aligner☆73Updated last year
- ☆80Updated 10 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆30Updated 8 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 7 months ago
- ☆35Updated 3 weeks ago
- ☆21Updated 7 months ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 7 months ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆44Updated 3 years ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆22Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- Linguistic processing for Common Voice☆55Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- ☆27Updated last year
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- ☆36Updated 6 months ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 8 months ago