olastor / german-word-frequencies
Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.
☆12Updated 4 years ago
Alternatives and similar repositories for german-word-frequencies
Users that are interested in german-word-frequencies are comparing it to the libraries listed below
Sorting:
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆80Updated 5 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆48Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 2 months ago
- A library for fetching and reading Tatoeba's weekly exports☆22Updated last year
- Script for workflow to add morphological analysis into ELAN files☆13Updated 5 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆34Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- Massively multilingual pronunciation mining☆340Updated 3 weeks ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆158Updated this week
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆45Updated 2 years ago
- Audiobook alignment for Indigenous languages☆40Updated this week
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆47Updated 3 years ago
- ⚙️ Powerful JS library to manage audio recording : intelligent cutting, saturation control, various export options...☆41Updated last year
- ☆26Updated last year
- linguistics backend☆41Updated 2 years ago
- 🏆 • 5050 most frequent words in 109 languages☆42Updated 2 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated last month
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆52Updated 2 weeks ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- ☆47Updated 9 months ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆14Updated 11 months ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆16Updated last month
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month