philipperemy / name-dataset
The Python library for names.
☆897Updated 2 weeks ago
Alternatives and similar repositories for name-dataset:
Users that are interested in name-dataset are comparing it to the libraries listed below
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,266Updated last month
- All languages stopwords collection☆439Updated last year
- Heuristic based boilerplate removal tool☆766Updated 2 months ago
- Fuzzy string matching, grouping, and evaluation.☆759Updated 2 months ago
- Single-document unsupervised keyword extraction☆1,713Updated last month
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,331Updated last month
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 5 months ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆311Updated 2 months ago
- A modern, interlingual wordnet interface for Python☆244Updated this week
- Offline database of synonyms/thesaurus☆195Updated last year
- A library implementing different string similarity and distance measures using Python.☆1,005Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆824Updated this week
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆268Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆605Updated 5 months ago
- Spelling corrector in python☆480Updated 3 months ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆846Updated 8 months ago
- 🧹 Python package for text cleaning☆975Updated last year
- Multilingual text (NLP) processing toolkit☆2,332Updated last year
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆740Updated last month
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆998Updated last year
- Compact Language Detector 2☆859Updated 3 years ago
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.☆1,067Updated 2 years ago
- Text databases of last names from various countries☆280Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆519Updated 6 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆140Updated 4 months ago
- Full text geoparsing as a Python library☆748Updated 3 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year