philipperemy / name-datasetLinks
The Python library for names.
☆956Updated 7 months ago
Alternatives and similar repositories for name-dataset
Users that are interested in name-dataset are comparing it to the libraries listed below
Sorting:
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆860Updated 2 years ago
- Text databases of last names from various countries☆280Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆848Updated last month
- ✔️Contextual word checker for better suggestions (not actively maintained)☆416Updated 9 months ago
- 🧹 Python package for text cleaning☆997Updated 2 years ago
- Company Name Processor written in Python☆344Updated last year
- All languages stopwords collection☆462Updated last year
- Fuzzy string matching, grouping, and evaluation.☆784Updated 4 months ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆527Updated last year
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,545Updated 2 weeks ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆763Updated last month
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆392Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆347Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆506Updated last year
- Super Fast String Matching in Python☆370Updated 7 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆265Updated last year
- Abydos NLP/IR library for Python☆192Updated 2 years ago
- Heuristic based boilerplate removal tool☆803Updated 8 months ago
- Full text geoparsing as a Python library☆753Updated 4 years ago
- Spelling corrector in python☆488Updated 4 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated last week
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 3 years ago
- Ultimate Website Sitemap Parser☆229Updated last week
- Article extraction benchmark: dataset and evaluation scripts☆336Updated last month
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,325Updated this week
- Fuzzy matching and more functionality for spaCy.☆258Updated last year
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆222Updated 2 years ago