hackerb9 / gwordlistLinks
All the words from Google Books, sorted by frequency
☆127Updated 2 years ago
Alternatives and similar repositories for gwordlist
Users that are interested in gwordlist are comparing it to the libraries listed below
Sorting:
- Gather modern English word frequencies from all enwiki articles.☆228Updated last year
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆103Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- Verb forms dictionary☆70Updated 8 years ago
- SCOWL (and friends).☆464Updated last week
- Text to IPA converter in JavaScript☆58Updated 3 years ago
- A browser-based tool to convert International Phonetic Alpha (IPA) phonetic notation to speech using the meSpeak.js package☆277Updated 3 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆358Updated 4 years ago
- WordNet in JSON format.☆97Updated 5 years ago
- A modern, interlingual wordnet interface for Python☆282Updated last week
- The Unicode Cookbook for Linguists☆56Updated 5 years ago
- The largest English-language thesaurus☆313Updated 4 months ago
- Collaborative data curation for Glottolog☆184Updated last week
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆164Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆55Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- The Open English WordNet☆716Updated last week
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆82Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- Monolingual wordlists with pronunciation information in IPA☆719Updated 8 months ago
- Han character library for CJKV languages☆165Updated 4 years ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated last month
- English Lemma Database - Compiled by Referencing British National Corpus☆36Updated last year
- Data for the International Phonetic Alphabet (IPA)☆33Updated 3 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆21Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Automatically exported from code.google.com/p/foma☆128Updated 5 months ago
- hand-written dictionaries from the FreeDict project☆462Updated 6 months ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- The World Atlas of Language Structures☆74Updated last year