rspeer / wordfreqLinks
Access a database of word frequencies, in various natural languages.
☆1,606Updated last year
Alternatives and similar repositories for wordfreq
Users that are interested in wordfreq are comparing it to the libraries listed below
Sorting:
- The Open English WordNet☆706Updated 2 weeks ago
- A Python Wiktionary Parser☆371Updated 6 months ago
- Wiktionary dump file parser and multilingual data extractor☆1,083Updated last week
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- A Python parser for MediaWiki wikicode☆856Updated 6 months ago
- A modern, interlingual wordnet interface for Python☆279Updated 2 weeks ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,348Updated last month
- Machine-readable lists of lemma-token pairs in 23 languages.☆358Updated 4 years ago
- Gather modern English word frequencies from all enwiki articles.☆228Updated last year
- The Open Source Dictionary☆592Updated 10 months ago
- ☆865Updated 2 years ago
- Compact Language Detector 2☆890Updated 4 years ago
- Tatoeba is a platform whose purpose is to create a collaborative and open dataset of sentences and their translations.☆835Updated last week
- All languages stopwords collection☆476Updated 2 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,625Updated 2 months ago
- A Python library to parse MediaWiki WikiText☆315Updated 8 months ago
- hand-written dictionaries from the FreeDict project☆462Updated 6 months ago
- Heuristic based boilerplate removal tool☆811Updated 11 months ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,365Updated last week
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Updated last week
- Just the facts -- web page content extraction☆1,279Updated 6 months ago
- Port of Google's language-detection library to Python.☆1,870Updated 10 months ago
- Snowball compiler and stemming algorithms☆834Updated last month
- English Lemma Database - Compiled by Referencing British National Corpus☆36Updated last year
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- Fast and secure translation on your local machine, powered by marian and Bergamot.☆588Updated 10 months ago
- Article extraction benchmark: dataset and evaluation scripts☆351Updated 4 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆863Updated 2 years ago