david47k / top-english-wordlists
Lists of most-frequently-used english words / nouns / verbs etc.
☆54Updated 4 years ago
Alternatives and similar repositories for top-english-wordlists:
Users that are interested in top-english-wordlists are comparing it to the libraries listed below
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆56Updated last year
- All the words from Google Books, sorted by frequency☆112Updated last year
- A list of vocabulary lists☆21Updated 4 years ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- British English pronunciation dictionary☆92Updated 7 years ago
- Scrapes Google Books Ngram data to create a long word list☆13Updated 11 months ago
- Aksharamukha Python Library☆44Updated 3 months ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆26Updated this week
- WordNet in JSON format.☆91Updated 4 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆32Updated last week
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆61Updated 3 weeks ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆47Updated last year
- Machine-readable lists of lemma-token pairs in 23 languages.☆335Updated 3 years ago
- PyDictionary is an offline English dictionary made using Python along with the Wordnet Lexical Database and Enchant Spell Dictionary. The…☆17Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Gather modern English word frequencies from all enwiki articles.☆206Updated 10 months ago
- Multilingual sentence alignment using sentence embeddings☆106Updated 2 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 4 months ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆50Updated 2 weeks ago
- ☆25Updated last year
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆18Updated 3 years ago
- An even smaller speech recognizer / force aligner☆32Updated last month
- 📦 A list, huge one (~200K) of human male/female first/last names.☆44Updated last year
- 🏆 • 5050 most frequent words in 109 languages☆40Updated 2 years ago
- Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.☆11Updated 3 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆150Updated last month
- 😎 Curated list of Tibetan NLP projects☆36Updated 4 years ago