orgtre / google-books-ngram-frequencyLinks
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
☆100Updated 2 years ago
Alternatives and similar repositories for google-books-ngram-frequency
Users that are interested in google-books-ngram-frequency are comparing it to the libraries listed below
Sorting:
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated 2 weeks ago
- All the words from Google Books, sorted by frequency☆120Updated 2 years ago
- The Open English WordNet☆673Updated this week
- A modern, interlingual wordnet interface for Python☆276Updated this week
- Lists of most-frequently-used english words / nouns / verbs etc.☆92Updated 5 years ago
- The World Atlas of Language Structures☆72Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆51Updated 10 months ago
- Wiktionary dump file parser and multilingual data extractor☆1,050Updated this week
- Multilingual sentence alignment using sentence embeddings☆131Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆33Updated last year
- British English pronunciation dictionary☆96Updated 8 years ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆40Updated last month
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 7 years ago
- A list of vocabulary lists☆22Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆353Updated 3 years ago
- Verb forms dictionary☆67Updated 8 years ago
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆193Updated 5 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆55Updated 9 years ago
- Monolingual wordlists with pronunciation information in IPA☆696Updated 6 months ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆27Updated 5 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆180Updated 6 months ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆194Updated last year
- The Open Parallel Corpus☆79Updated last week
- Sentence aligner☆121Updated 4 years ago
- List of Chinese characters ordered by frequency rank (from most common to least common). Based on Jun Da's Modern Chinese Character Frequ…☆36Updated 2 years ago
- Crawler for linguistic corpora☆212Updated 3 months ago