oprogramador / most-common-words-by-language
List of the most common words in many languages
β171Updated this week
Alternatives and similar repositories for most-common-words-by-language:
Users that are interested in most-common-words-by-language are comparing it to the libraries listed below
- π β’ 5050 most frequent words in 109 languagesβ42Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.β212Updated last year
- Letterpress Word Listβ405Updated 9 years ago
- A list of the most popular English words.β371Updated 2 years ago
- Morphological Dictionaries for German Languageβ29Updated 7 years ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.β185Updated 5 years ago
- Webster's English Dictionary in JSON format, and related Swift parsing utilityβ429Updated 2 years ago
- Repository for Frequency Word List Generator and processed filesβ1,257Updated 3 years ago
- List of ~636,000 Spanish wordsβ51Updated 5 years ago
- Offline database of synonyms/thesaurusβ195Updated last year
- English Lemma Database - Compiled by Referencing British National Corpusβ30Updated 7 months ago
- Lists of most-frequently-used english words / nouns / verbs etc.β63Updated 4 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiserβ54Updated 8 years ago
- Spanish to English dictionary, frequency list, and lemma dataβ33Updated 3 weeks ago
- A very long list of English profanity.β258Updated 4 months ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)β45Updated 4 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.β51Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)β30Updated 3 years ago
- A modern, interlingual wordnet interface for Pythonβ243Updated this week
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and β¦β9Updated 4 years ago
- for splitting words into their component syllablesβ53Updated 8 years ago
- This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpusβ¦β73Updated 6 years ago
- British English pronunciation dictionaryβ95Updated 7 years ago
- Wiktionary parser tool for many language editions.β54Updated 2 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techβ¦β75Updated 7 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Foβ¦β98Updated last week
- This repository contains all the words from every language that exists in the universe.β90Updated 4 months ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.β51Updated last year
- Lexical data at Unicodeβ68Updated 7 months ago
- All the words from Google Books, sorted by frequencyβ115Updated last year