rspeer / wordfreq
Access a database of word frequencies, in various natural languages.
☆1,446Updated 2 months ago
Alternatives and similar repositories for wordfreq:
Users that are interested in wordfreq are comparing it to the libraries listed below
- A modern, interlingual wordnet interface for Python☆235Updated 3 weeks ago
- The Open English WordNet☆521Updated last month
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆735Updated 2 weeks ago
- Wiktionary dump file parser and multilingual data extractor☆873Updated this week
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆375Updated 4 months ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- A Python parser for MediaWiki wikicode☆783Updated 2 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- A Python Wiktionary Parser☆358Updated 3 weeks ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆839Updated 7 months ago
- ☆810Updated last year
- SCOWL (and friends).☆416Updated 6 months ago
- A Python library to parse MediaWiki WikiText☆301Updated 5 months ago
- Heuristic based boilerplate removal tool☆764Updated 3 weeks ago
- Rapid fuzzy string matching in Python using various string metrics☆2,962Updated this week
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆630Updated 3 years ago
- NLP, before and after spaCy☆2,216Updated last year
- Multilingual word vectors in 78 languages☆1,197Updated 2 years ago
- Compact Language Detector 2☆853Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆821Updated this week
- Port of Google's language-detection library to Python.☆1,768Updated 2 weeks ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆291Updated 4 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,254Updated 2 weeks ago
- Kagi Small Web☆680Updated this week
- Stand-alone language identification system☆2,367Updated 5 years ago
- Repository for Frequency Word List Generator and processed files☆1,231Updated 3 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,113Updated 2 months ago
- Sentence aligner☆112Updated 3 years ago
- Multilingual text (NLP) processing toolkit☆2,328Updated last year