david47k / top-english-wordlists
Lists of most-frequently-used english words / nouns / verbs etc.
☆49Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for top-english-wordlists
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆22Updated 7 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆50Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆94Updated this week
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆23Updated 2 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆51Updated 8 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- Scrapes Google Books Ngram data to create a long word list☆13Updated 8 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- This is an SQL file of Oxford English Dictionary. It includes more than 41,OOO words! Just import the SQL.☆39Updated 9 years ago
- Multilingual sentence alignment using sentence embeddings☆101Updated 2 weeks ago
- British English pronunciation dictionary☆89Updated 7 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆103Updated 4 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆144Updated 7 months ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆16Updated 2 years ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆47Updated 10 months ago
- PyDictionary is an offline English dictionary made using Python along with the Wordnet Lexical Database and Enchant Spell Dictionary. The…☆16Updated 3 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 5 years ago
- Customizable machine translation in C++☆43Updated 7 months ago
- Tokenizes Chinese texts into words.☆95Updated last year
- All the words from Google Books, sorted by frequency☆109Updated last year
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆67Updated 3 years ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆62Updated 2 months ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- ☆26Updated 2 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆10Updated 6 months ago
- Etymological graphs based on Wiktionary dumps☆18Updated last year
- This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins t…☆100Updated last year
- Translate HTML using Argos Translate☆49Updated last year
- Gather modern English word frequencies from all enwiki articles.☆204Updated 8 months ago