hermitdave / FrequencyWords
Repository for Frequency Word List Generator and processed files
☆1,204Updated 2 years ago
Alternatives and similar repositories for FrequencyWords:
Users that are interested in FrequencyWords are comparing it to the libraries listed below
- Wiktionary dump file parser and multilingual data extractor☆847Updated this week
- A Python Wiktionary Parser☆360Updated last year
- Converts English text to IPA notation☆371Updated last year
- Monolingual wordlists with pronunciation information in IPA☆574Updated last year
- Access a database of word frequencies, in various natural languages.☆1,425Updated 2 weeks ago
- Gather modern English word frequencies from all enwiki articles.☆207Updated 10 months ago
- List of the most common words in many languages☆164Updated last week
- Machine-readable lists of lemma-token pairs in 23 languages.☆335Updated 2 years ago
- A list of the most popular English words.☆364Updated 2 years ago
- Small example scripts for working with Japanese texts in Python☆26Updated 5 years ago
- Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.☆738Updated last month
- Modern spell checking library - accurate, fast, multi-language☆621Updated 4 months ago
- SCOWL (and friends).☆405Updated 4 months ago
- hand-written dictionaries from the FreeDict project☆401Updated 3 months ago
- LingPy: Python library for quantitative tasks in historical linguistics☆128Updated last year
- Crawler for linguistic corpora☆197Updated last year
- All languages stopwords collection☆426Updated last year
- All the words from Google Books, sorted by frequency☆112Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 3 months ago
- Use SRT subtitle files to study foreign languages (in progress)☆309Updated 7 months ago
- Universal Dependencies online documentation☆278Updated this week
- Sentence aligner☆109Updated 3 years ago
- A multilingual parallel corpus created from translations of the Bible.☆177Updated 3 months ago
- Anki plugin that reorders language cards based on the words you know☆261Updated last year
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆372Updated last month
- Dynamic JavaScript version of phpSyntaxTree - a tool to draw syntax trees from labelled bracket notation.☆84Updated 10 months ago
- CMU US English Dictionary☆643Updated last month
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆671Updated 4 months ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆278Updated last month
- Simple sentence mining tool for language learning☆410Updated last month