first20hours / google-10000-englishLinks
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
☆4,269Updated 2 years ago
Alternatives and similar repositories for google-10000-english
Users that are interested in google-10000-english are comparing it to the libraries listed below
Sorting:
- A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion☆11,829Updated 11 months ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆195Updated 5 years ago
- A list of the most popular English words.☆388Updated 3 years ago
- Common English Vocabulary Word List☆371Updated 6 years ago
- A JSON representation of Webster's Unabridged Dictionary☆691Updated 4 years ago
- Access a database of word frequencies, in various natural languages.☆1,592Updated 11 months ago
- Repository for Frequency Word List Generator and processed files☆1,411Updated 3 years ago
- The Open Source Dictionary☆585Updated 9 months ago
- Letterpress Word List☆416Updated 9 years ago
- Wiktionary dump file parser and multilingual data extractor☆1,058Updated this week
- Webster's English Dictionary in JSON format, and related Swift parsing utility☆452Updated 2 years ago
- hand-written dictionaries from the FreeDict project☆456Updated 5 months ago
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- Compact Language Detector 2☆884Updated 4 years ago
- A Python Wiktionary Parser☆368Updated 5 months ago
- SCOWL (and friends).☆459Updated last week
- The most popular spellchecking library.☆2,403Updated 3 months ago
- List of the most common words in many languages☆184Updated last week
- Monolingual wordlists with pronunciation information in IPA☆700Updated 7 months ago
- A collection of small corpuses of interesting data for the creation of bots and similar stuff.☆5,047Updated 2 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.