repodiac / german_transliterate

Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to clean messy text (e.g. map peculiar Unicode encodings to ASCII) or replace common abbreviations in text in combination with various text mining tasks.
30Updated 3 years ago

Related projects

Alternatives and complementary repositories for german_transliterate