rspeer / wordfreqLinks
Access a database of word frequencies, in various natural languages.
☆1,593Updated last year
Alternatives and similar repositories for wordfreq
Users that are interested in wordfreq are comparing it to the libraries listed below
Sorting:
- The Open English WordNet☆689Updated this week
- A Python Wiktionary Parser☆369Updated 5 months ago
- Wiktionary dump file parser and multilingual data extractor☆1,062Updated this week
- A Python parser for MediaWiki wikicode☆853Updated 6 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,339Updated 2 weeks ago
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- All languages stopwords collection☆472Updated 2 years ago
- A modern, interlingual wordnet interface for Python☆277Updated this week
- Heuristic based boilerplate removal tool☆809Updated 10 months ago
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,218Updated last month
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆631Updated 4 years ago
- Article extraction benchmark: dataset and evaluation scripts☆344Updated 3 months ago
- extract text from any document. no muss. no fuss.☆4,414Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,180Updated 3 weeks ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆377Updated 3 years ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,348Updated 2 months ago
- ☆858Updated 2 years ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,609Updated last month
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆162Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆855Updated last month
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆860Updated 2 years ago
- Compact Language Detector 2☆885Updated 4 years ago
- The most popular spellchecking library.☆2,406Updated 3 months ago
- Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.☆488Updated last year
- A Python library to parse MediaWiki WikiText☆316Updated 7 months ago
- hand-written dictionaries from the FreeDict project☆456Updated 5 months ago
- Bitextor generates translation memories from multilingual websites☆299Updated last year
- Internet search engine for text-oriented websites. Indexing the small, old and weird web.☆1,661Updated 3 weeks ago
- Python stemming library using snowball stemmers☆275Updated 3 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆767Updated last month