hackerb9 / gwordlist
All the words from Google Books, sorted by frequency
☆109Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gwordlist
- WordNet in JSON format.☆90Updated 4 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆49Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆67Updated 3 years ago
- Verb forms dictionary☆60Updated 7 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- British English pronunciation dictionary☆89Updated 7 years ago
- A simple phonetic respelling for the English language☆9Updated last year
- SCOWL (and friends).☆394Updated 2 months ago
- Gather modern English word frequencies from all enwiki articles.☆202Updated 8 months ago
- X-SAMPA to IPA converter☆25Updated 4 years ago
- Collaborative data curation for Glottolog☆152Updated this week
- Monolingual wordlists with pronunciation information in IPA☆552Updated last year
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆166Updated 4 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆62Updated last month
- The World Atlas of Language Structures☆55Updated 3 weeks ago
- Interactive visualization of Wiktionary words and etymologies.☆90Updated last week
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆43Updated last week
- A list of vocabulary lists☆21Updated 4 years ago
- Massively multilingual pronunciation mining☆320Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Data for the International Phonetic Alphabet (IPA)☆26Updated last year
- Etymological graphs based on Wiktionary dumps☆18Updated last year
- International Phonetic Alphabet (IPA) Unicode Chart and Character Picker☆129Updated 3 years ago
- Text to IPA converter in JavaScript☆52Updated 2 years ago
- Crawler for linguistic corpora☆192Updated 11 months ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆66Updated last year
- pronunciation dictionaries for multiple languages☆83Updated 7 years ago
- PHOIBLE Online☆42Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year