philipperemy / name-datasetLinks
The Python library for names.
☆965Updated 9 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- All languages stopwords collection☆472Updated 2 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆860Updated 2 years ago
- Company Name Processor written in Python☆350Updated 3 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆768Updated last month
- 🧹 Python package for text cleaning☆999Updated 2 years ago
- Heuristic based boilerplate removal tool☆810Updated 10 months ago
- Fuzzy string matching, grouping, and evaluation.☆787Updated 6 months ago
- Article extraction benchmark: dataset and evaluation scripts☆345Updated 3 months ago
- Text databases of last names from various countries☆280Updated 3 years ago
- Spelling corrector in python☆492Updated 6 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated 11 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆266Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆855Updated last month
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆512Updated last year
- A spaCy pipeline and model for NLP on unstructured legal text.☆669Updated last year
- Super Fast String Matching in Python☆371Updated 9 months ago
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,612Updated last month
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆400Updated last year
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- Single-document unsupervised keyword extraction☆1,809Updated last month
- Information extraction from English and German texts based on predicate logic☆393Updated 3 years ago
- Python wrapper for Wikipedia☆710Updated 2 weeks ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Process Common Crawl data with Python and Spark☆452Updated last month
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆744Updated last year
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆329Updated 2 months ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆528Updated last year
- A Python library for calculating a large variety of metrics from text☆359Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 4 months ago
- NLP, before and after spaCy☆2,236Updated 2 years ago