olastor / german-word-frequenciesLinks
Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.
☆14Updated 4 years ago
Alternatives and similar repositories for german-word-frequencies
Users that are interested in german-word-frequencies are comparing it to the libraries listed below
Sorting:
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 7 months ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 3 months ago
- Aksharamukha Python Library☆52Updated 8 months ago
- JavaScript port of SymSpell for Node.js☆13Updated 3 years ago
- German stopwords collection☆86Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆176Updated 4 months ago
- All languages stopwords collection☆458Updated last year
- Text to IPA converter in JavaScript☆58Updated 3 years ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆83Updated 4 months ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆502Updated 11 months ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆131Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆159Updated 9 months ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆43Updated 8 months ago
- A list of vocabulary lists☆22Updated 5 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆58Updated last year
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12Updated 8 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆34Updated this week
- 🎀 JavaScript API for spaCy with Python REST API☆198Updated 2 years ago
- English lexicon useful in NLP/NLU☆15Updated 2 years ago
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆155Updated 10 months ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆20Updated last year
- Massively multilingual pronunciation mining☆352Updated last month
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated 2 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Rezonator: Dynamics of human engagement☆35Updated last month
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆51Updated 2 years ago