IlyaSemenov / wikipedia-word-frequencyLinks
Gather modern English word frequencies from all enwiki articles.
☆227Updated last year
Alternatives and similar repositories for wikipedia-word-frequency
Users that are interested in wikipedia-word-frequency are comparing it to the libraries listed below
Sorting:
- A Python Wiktionary Parser☆368Updated 5 months ago
- A modern, interlingual wordnet interface for Python☆276Updated 3 weeks ago
- A list of vocabulary lists☆22Updated 5 years ago
- Sentence aligner☆122Updated 4 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆354Updated 3 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆162Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- The Open English WordNet☆686Updated this week
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- Verb forms dictionary☆67Updated 8 years ago
- Universal Dependencies online documentation☆287Updated this week
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆191Updated 7 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated last month
- WordNet in JSON format.☆96Updated 5 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆198Updated 5 years ago
- Improved Sentence Alignment in Linear Time and Space☆186Updated 2 years ago
- The World Atlas of Language Structures☆72Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆52Updated 10 months ago
- British English pronunciation dictionary☆98Updated 8 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆160Updated 5 years ago
- German Morphological Analyzer☆51Updated 4 years ago
- Lexical database for ~70k English words with morphological variables☆48Updated 3 years ago
- The Open Multilingual Wordnet☆66Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- All the words from Google Books, sorted by frequency☆121Updated 2 years ago
- Python Finite-State Toolkit☆60Updated this week
- Helsinki Finite-State Technology (library and application suite)☆136Updated 2 months ago