IlyaSemenov / wikipedia-word-frequencyLinks
Gather modern English word frequencies from all enwiki articles.
☆216Updated last year
Alternatives and similar repositories for wikipedia-word-frequency
Users that are interested in wikipedia-word-frequency are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆251Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆102Updated last month
- A python module for English lemmatization and inflection.☆268Updated last year
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- The Open English WordNet☆576Updated this week
- Python Finite-State Toolkit☆56Updated last week
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆194Updated 4 years ago
- English data☆208Updated last week
- English Lemma Database - Compiled by Referencing British National Corpus☆31Updated 9 months ago
- This packages up data for the Open Multilingual Wordnet☆49Updated 3 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- Universal Dependencies online documentation☆285Updated this week
- A Python Wiktionary Parser☆361Updated 4 months ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆39Updated 8 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆130Updated 5 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆309Updated 4 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year
- Runnable morphological analysis tools from the UniMorph project☆16Updated 6 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆248Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Various utilities for processing the data.☆209Updated this week
- Helsinki Finite-State Technology (library and application suite)☆131Updated last month
- Offline database of synonyms/thesaurus☆196Updated last year
- Wikitionary in accessible JSON format☆36Updated 2 years ago
- University of Colorado VerbNet☆107Updated last year