philipperemy / name-datasetLinks
The Python library for names.
☆951Updated 6 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- All languages stopwords collection☆458Updated last year
- Company Name Processor written in Python☆341Updated last year
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆858Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆760Updated last month
- Process Common Crawl data with Python and Spark☆442Updated 3 weeks ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆345Updated 3 years ago
- Text databases of last names from various countries☆281Updated 2 years ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 8 months ago
- Article extraction benchmark: dataset and evaluation scripts☆334Updated 3 weeks ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆323Updated 2 months ago
- Heuristic based boilerplate removal tool☆798Updated 7 months ago
- 🧹 Python package for text cleaning☆996Updated 2 years ago
- A package to structure Australian addresses☆196Updated 3 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆305Updated 2 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆847Updated last month
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆739Updated last year
- Fuzzy string matching, grouping, and evaluation.☆783Updated 3 months ago
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆390Updated last year
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆526Updated 11 months ago
- Library for unit extraction - fork of quantulum for python3☆142Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,025Updated last year
- Offline database of synonyms/thesaurus☆202Updated last year
- Spelling corrector in python☆486Updated 3 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Super Fast String Matching in Python☆369Updated 7 months ago
- Index Common Crawl archives in tabular format☆122Updated 2 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago