philipperemy / name-datasetLinks
The Python library for names.
☆939Updated 4 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- Text databases of last names from various countries☆281Updated 2 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆855Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆839Updated 2 weeks ago
- All languages stopwords collection☆453Updated last year
- Company Name Processor written in Python☆341Updated last year
- Single-document unsupervised keyword extraction☆1,776Updated this week
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆755Updated last week
- 🧹 Python package for text cleaning☆985Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Spelling corrector in python☆486Updated last month
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆386Updated 11 months ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆871Updated last year
- Fixes contractions such as `you're` to `you are`☆317Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 7 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,024Updated last year
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175Updated 5 years ago
- Heuristic based boilerplate removal tool☆793Updated 6 months ago
- Full text geoparsing as a Python library☆751Updated 3 years ago
- Super Fast String Matching in Python☆369Updated 5 months ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆322Updated last month
- Fuzzy string matching, grouping, and evaluation.☆780Updated last month
- NeuSpell: A Neural Spelling Correction Toolkit☆696Updated 2 years ago
- Article extraction benchmark: dataset and evaluation scripts☆321Updated last year
- Process Common Crawl data with Python and Spark☆440Updated this week
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,477Updated 2 months ago
- Offline database of synonyms/thesaurus☆200Updated last year
- Python bindings to libpostal for fast international address parsing/normalization☆840Updated 6 months ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,312Updated this week
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆539Updated 4 months ago