david47k / top-english-wordlistsLinks
Lists of most-frequently-used english words / nouns / verbs etc.
☆85Updated 5 years ago
Alternatives and similar repositories for top-english-wordlists
Users that are interested in top-english-wordlists are comparing it to the libraries listed below
Sorting:
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆44Updated 8 months ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆194Updated 5 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆93Updated 2 years ago
- British English pronunciation dictionary☆95Updated 7 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆345Updated 3 years ago
- Gather modern English word frequencies from all enwiki articles.☆226Updated last year
- All the words from Google Books, sorted by frequency☆118Updated 2 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Convert phoneme codes and lexicon formats for English speech synths☆47Updated 3 months ago
- A word list containing 25,000 of the most common English words, divided into syllables.☆51Updated 3 months ago
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- Customizable machine translation in C++☆53Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- 📦 A list, huge one (~200K) of human male/female first/last names.☆54Updated last year
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆52Updated last year
- Converts English text to IPA notation☆390Updated 2 years ago
- Monolingual wordlists with pronunciation information in IPA☆678Updated 4 months ago
- Local cross-platform machine translation GUI, based on CTranslate2☆96Updated last year
- A program that sets the stress and the letter ё of Russian text and ebooks using Wiktionary data and grammar analysis.☆31Updated last year
- A list of vocabulary lists☆22Updated 5 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 3 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆100Updated 2 months ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆191Updated last year
- CMUdict maintenance, and tools☆232Updated 9 months ago
- Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words☆98Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated last week
- Model for recasing and repunctuating ASR transcripts☆139Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- A list of the most popular English words.☆383Updated 3 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 5 years ago