smashew / NameDatabasesLinks
Text databases of last names from various countries
☆280Updated 2 years ago
Alternatives and similar repositories for NameDatabases
Users that are interested in NameDatabases are comparing it to the libraries listed below
Sorting:
- ☆52Updated last year
- The Python library for names.☆926Updated 4 months ago
- Default English stopword lists from many different sources☆307Updated 2 years ago
- Word lists from the web.☆90Updated 9 years ago
- Offline database of synonyms/thesaurus☆200Updated last year
- A dataset of popular forenames and surnames by country☆40Updated 2 years ago
- A very long list of English profanity.☆268Updated 7 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- List of common stop words in various languages.☆337Updated 2 years ago
- ☆81Updated last month
- A dataset of multinational first names and last names☆26Updated 2 years ago
- Snowball compiler and stemming algorithms☆805Updated last week
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.☆104Updated 5 years ago
- Gather modern English word frequencies from all enwiki articles.☆222Updated last year
- Convert Wikipedia database dumps into plaintext files☆321Updated 4 years ago
- English stopwords collection☆163Updated 8 years ago
- All languages stopwords collection☆451Updated last year
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆851Updated 2 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 6 months ago
- Python wrapper for Wikipedia☆690Updated last week
- Full list of US states and cities☆284Updated last year
- A multilingual lexicon of words to hurt.☆90Updated 3 weeks ago
- A set of utility scripts to process Wikipedia related data☆38Updated 3 years ago
- A Python Wiktionary Parser☆362Updated 3 weeks ago
- A comprehensive database of name variants☆47Updated 3 years ago
- Stopwords for 50 languages in JSON format☆432Updated 2 years ago
- ☆234Updated 8 years ago
- Blazingly fast cleaning swear words (and their leetspeak) in strings☆223Updated last year