rspeer / wordfreq
Access a database of word frequencies, in various natural languages.
☆1,464Updated 4 months ago
Alternatives and similar repositories for wordfreq:
Users that are interested in wordfreq are comparing it to the libraries listed below
- The Open English WordNet☆546Updated last week
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆631Updated 3 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,271Updated 2 months ago
- A modern, interlingual wordnet interface for Python☆244Updated this week
- SCOWL (and friends).☆419Updated 3 weeks ago
- A Python library to inspect and modify the internal structure of a PDF file☆990Updated last week
- Heuristic based boilerplate removal tool☆769Updated 2 months ago
- A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.☆303Updated 5 months ago
- A Python Wiktionary Parser☆359Updated 2 months ago
- Multilingual text (NLP) processing toolkit☆2,333Updated last year
- Official implementation of the paper "Watermark Anything with Localized Messages"☆1,004Updated last month
- Compact Language Detector 2☆859Updated 3 years ago
- Pure Python spell-checker, (almost) full port of Hunspell☆290Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- NLP, before and after spaCy☆2,225Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,768Updated this week
- Article extraction benchmark: dataset and evaluation scripts☆315Updated last year
- List of common stop words in various languages.☆337Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆336Updated 3 years ago
- A fast, low-resource Natural Language Processing and Text Correction library written in Rust.☆624Updated last year
- tinyworldmap is a tiny world map for offline-first and low-bandwidth web apps☆1,414Updated last year
- Open Language Profiles — English profile datasets from CEFR-J☆123Updated 5 years ago
- Repository for Frequency Word List Generator and processed files☆1,263Updated 3 years ago
- SeekStorm - sub-millisecond full-text search library & multi-tenancy server in Rust☆1,673Updated last week
- ☆826Updated last year
- Python stemming library using snowball stemmers☆255Updated 6 months ago
- Just the facts -- web page content extraction☆1,263Updated 10 months ago
- Snowball compiler and stemming algorithms☆788Updated this week
- Python wrapper for LanguageTool grammar checker☆328Updated 3 years ago