rspeer / wordfreq
Access a database of word frequencies, in various natural languages.
☆1,425Updated 2 weeks ago
Alternatives and similar repositories for wordfreq:
Users that are interested in wordfreq are comparing it to the libraries listed below
- A Python Wiktionary Parser☆360Updated last year
- A python module for English lemmatization and inflection.☆265Updated last year
- Gather modern English word frequencies from all enwiki articles.☆207Updated 10 months ago
- The Open English WordNet☆493Updated this week
- All languages stopwords collection☆426Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆810Updated 3 weeks ago
- ☆807Updated last year
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆498Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆629Updated 3 years ago
- Offline database of synonyms/thesaurus☆192Updated 11 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,171Updated last week
- Repository for Frequency Word List Generator and processed files☆1,204Updated 2 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,788Updated 7 months ago
- A fast, robust Python library to check for offensive language in strings.☆635Updated 5 months ago
- spellchecking library for python☆603Updated 6 months ago
- Multilingual text (NLP) processing toolkit☆2,317Updated last year
- Beautiful visualizations of how language differs among document types.☆2,272Updated 3 months ago
- Bitextor generates translation memories from multilingual websites☆293Updated 2 months ago
- extract text from any document. no muss. no fuss.☆3,956Updated last month
- A modern, interlingual wordnet interface for Python☆229Updated last month
- Fast and secure translation on your local machine, powered by marian and Bergamot.☆517Updated 2 months ago
- All the words from Google Books, sorted by frequency☆112Updated last year
- Python parser for SubRip (srt) files☆459Updated last year
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆372Updated last month
- The CMU Link Grammar natural language parser☆390Updated 7 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆722Updated 3 weeks ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆747Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆368Updated 2 years ago
- Sentence aligner☆109Updated 3 years ago