rspeer / wordfreqLinks
Access a database of word frequencies, in various natural languages.
☆1,491Updated 5 months ago
Alternatives and similar repositories for wordfreq
Users that are interested in wordfreq are comparing it to the libraries listed below
Sorting:
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,299Updated 2 weeks ago
- Wiktionary dump file parser and multilingual data extractor☆940Updated 2 weeks ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,259Updated 2 months ago
- A Python Wiktionary Parser☆361Updated 4 months ago
- SCOWL (and friends).☆422Updated 2 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆340Updated 3 years ago
- Pure Python spell-checker, (almost) full port of Hunspell☆291Updated last year
- NLP, before and after spaCy☆2,226Updated last year
- Gather modern English word frequencies from all enwiki articles.☆216Updated last year
- Heuristic based boilerplate removal tool☆785Updated 4 months ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- A modern, interlingual wordnet interface for Python☆251Updated last week
- Multilingual text (NLP) processing toolkit☆2,345Updated last year
- spellchecking library for python☆610Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆831Updated 2 months ago
- Compact Language Detector 2☆864Updated 4 years ago
- The Open English WordNet☆576Updated last week
- A tool for extracting plain text from Wikipedia dumps☆3,878Updated last year
- Crawler for linguistic corpora☆204Updated last year
- hand-written dictionaries from the FreeDict project☆420Updated 8 months ago
- enchant spellchecking library☆366Updated this week
- Python stemming library using snowball stemmers☆262Updated 3 weeks ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆380Updated 7 months ago
- ☆836Updated 2 years ago
- 🦆 Contextually-keyed word vectors☆1,655Updated 2 months ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆384Updated 9 months ago
- A python module for English lemmatization and inflection.☆268Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- All languages stopwords collection☆449Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago