IlyaSemenov / wikipedia-word-frequency
Gather modern English word frequencies from all enwiki articles.
☆212Updated last year
Alternatives and similar repositories for wikipedia-word-frequency:
Users that are interested in wikipedia-word-frequency are comparing it to the libraries listed below
- A Python Wiktionary Parser☆358Updated 3 weeks ago
- A modern, interlingual wordnet interface for Python☆235Updated 3 weeks ago
- A list of vocabulary lists☆21Updated 4 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 5 months ago
- The Open English WordNet☆521Updated last month
- A python module for English lemmatization and inflection.☆265Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- Universal Dependencies online documentation☆282Updated this week
- Lexical database for ~70k English words with morphological variables☆42Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- WordNet in JSON format.☆90Updated 4 years ago
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆36Updated 5 months ago
- ☆64Updated 10 months ago
- English HPSG parser☆51Updated 6 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- This packages up data for the Open Multilingual Wordnet☆47Updated last week
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- Offline database of synonyms/thesaurus☆192Updated last year
- English data☆205Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Various utilities for processing the data.☆208Updated this week
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆29Updated last month
- Open Language Profiles — English profile datasets from CEFR-J☆122Updated 4 years ago
- Gale-Church sentence aligner with options for variable parameters☆17Updated 5 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆36Updated 2 years ago