smashew / NameDatabasesLinks
Text databases of last names from various countries
☆280Updated 2 years ago
Alternatives and similar repositories for NameDatabases
Users that are interested in NameDatabases are comparing it to the libraries listed below
Sorting:
- The Python library for names.☆956Updated 7 months ago
- ☆52Updated last year
- Offline database of synonyms/thesaurus☆204Updated last year
- Machine-readable lists of lemma-token pairs in 23 languages.☆347Updated 3 years ago
- Default English stopword lists from many different sources☆309Updated 2 years ago
- List of common stop words in various languages.☆339Updated last week
- All languages stopwords collection☆462Updated last year
- A dataset of multinational first names and last names☆27Updated 2 years ago
- English stopwords collection☆163Updated 9 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆306Updated 3 months ago
- Full list of US states and cities☆286Updated last year
- WordNet in JSON format.☆93Updated 5 years ago
- Snowball compiler and stemming algorithms☆814Updated this week
- Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.☆104Updated last week
- A dataset of 200k English plaintext jokes.☆616Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆527Updated last year
- Modern spell checking library - accurate, fast, multi-language☆652Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- A Python library for detecting and filtering profanity☆166Updated 4 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆389Updated 3 months ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆204Updated 7 years ago
- Gather modern English word frequencies from all enwiki articles.☆226Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆416Updated 9 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 2 years ago
- A comprehensive database of name variants☆46Updated 3 years ago
- Convert Wikipedia database dumps into plaintext files☆326Updated 4 years ago
- A dataset of popular forenames and surnames by country☆48Updated 2 years ago
- A compound word splitter for Python☆49Updated 4 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆392Updated last year
- A list of the most popular English words.☆383Updated 3 years ago