philipperemy / name-datasetLinks
The Python library for names.
☆926Updated 4 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- Text databases of last names from various countries☆280Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆753Updated 3 weeks ago
- Spelling corrector in python☆486Updated last month
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 6 months ago
- 🧹 Python package for text cleaning☆983Updated 2 years ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆837Updated 3 months ago
- All languages stopwords collection☆451Updated last year
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆851Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆774Updated last month
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆320Updated 2 weeks ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆866Updated 11 months ago
- Default English stopword lists from many different sources☆307Updated 2 years ago
- Single-document unsupervised keyword extraction☆1,764Updated 3 weeks ago
- Super Fast String Matching in Python☆369Updated 4 months ago
- A python utility for downloading Common Crawl data☆242Updated 2 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆385Updated 10 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,309Updated 3 weeks ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆251Updated 2 years ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆206Updated 6 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- Process Common Crawl data with Python and Spark☆442Updated 2 months ago
- Company Name Processor written in Python☆341Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,020Updated last year
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- Full text geoparsing as a Python library☆750Updated 3 years ago
- A multilingual lexicon of words to hurt.☆90Updated 3 weeks ago
- Abydos NLP/IR library for Python☆188Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆317Updated 2 years ago