david47k / top-english-wordlists
Lists of most-frequently-used english words / nouns / verbs etc.
☆57Updated 4 years ago
Alternatives and similar repositories for top-english-wordlists:
Users that are interested in top-english-wordlists are comparing it to the libraries listed below
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- British English pronunciation dictionary☆92Updated 7 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆98Updated last week
- Inflecting Finnish words (verb inflection, comparatives, cases, possessive suffixes, clitics) using Wiktionary-compatible declensions and…☆31Updated 4 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 5 months ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆87Updated 2 years ago
- Translate HTML using Argos Translate☆50Updated last year
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆62Updated last year
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆50Updated last year
- Extracts plain text, language identification and more metadata from WARC records☆21Updated last week
- All the words from Google Books, sorted by frequency☆114Updated last year
- A list of vocabulary lists☆21Updated 4 years ago
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- The Open Parallel Corpus☆66Updated last week
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆19Updated last year
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆28Updated last month
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆45Updated 2 years ago
- Probably the most advanced command-line english dictionary ever.☆38Updated 5 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Multilingual sentence alignment using sentence embeddings☆110Updated 4 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆50Updated last month
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Etymological graphs based on Wiktionary dumps☆19Updated last week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆189Updated 7 months ago