hackerb9 / gwordlist
All the words from Google Books, sorted by frequency
☆114Updated last year
Alternatives and similar repositories for gwordlist:
Users that are interested in gwordlist are comparing it to the libraries listed below
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆61Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Gather modern English word frequencies from all enwiki articles.☆211Updated 11 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- List of the most common words in many languages☆167Updated this week
- WordNet in JSON format.☆90Updated 4 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆46Updated 3 months ago
- Open Language Profiles — English profile datasets from CEFR-J☆116Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- Data for the International Phonetic Alphabet (IPA)☆27Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated last month
- A Python Wiktionary Parser☆357Updated last year
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆48Updated last year
- SCOWL (and friends).☆413Updated 5 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆89Updated last year
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆177Updated 5 years ago
- English Resource Grammar☆20Updated 6 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year
- The World Atlas Of Language Structures Online☆126Updated last month
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- ☆59Updated 2 weeks ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆43Updated 4 years ago
- Collaborative data curation for Glottolog☆156Updated this week