hackerb9 / gwordlistLinks
All the words from Google Books, sorted by frequency
☆117Updated last year
Alternatives and similar repositories for gwordlist
Users that are interested in gwordlist are comparing it to the libraries listed below
Sorting:
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆76Updated last year
- British English pronunciation dictionary☆95Updated 7 years ago
- CMUdict maintenance, and tools☆222Updated 5 months ago
- WordNet in JSON format.☆91Updated 4 years ago
- Verb forms dictionary☆66Updated 7 years ago
- The Open English WordNet☆576Updated last week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆102Updated last month
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆79Updated 3 years ago
- Monolingual wordlists with pronunciation information in IPA☆632Updated last month
- Converts English text to IPA notation☆388Updated 2 years ago
- The CMU Pronouncing Dictionary converted to IPA☆84Updated 5 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆86Updated last year
- Convert phoneme codes and lexicon formats for English speech synths☆45Updated 3 months ago
- Data for the International Phonetic Alphabet (IPA)☆28Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆216Updated last year
- A modern, interlingual wordnet interface for Python☆251Updated this week
- Massively multilingual pronunciation mining☆342Updated 2 weeks ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆14Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- SCOWL (and friends).☆422Updated 2 months ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆184Updated last year
- A browser-based tool to convert International Phonetic Alpha (IPA) phonetic notation to speech using the meSpeak.js package☆272Updated 2 years ago
- A word list containing 25 000 of the most popular English words, divided into syllables.☆47Updated 9 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆50Updated 7 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated 2 months ago
- pronunciation dictionaries for multiple languages☆88Updated 7 years ago
- English Resource Grammar☆21Updated 3 weeks ago