smashew / NameDatabasesLinks
Text databases of last names from various countries
☆281Updated 2 years ago
Alternatives and similar repositories for NameDatabases
Users that are interested in NameDatabases are comparing it to the libraries listed below
Sorting:
- The Python library for names.☆939Updated 4 months ago
- ☆52Updated last year
- Offline database of synonyms/thesaurus☆200Updated last year
- List of common stop words in various languages.☆337Updated 2 years ago
- Word lists from the web.☆90Updated 9 years ago
- SCOWL (and friends).☆441Updated last month
- All languages stopwords collection☆453Updated last year
- Default English stopword lists from many different sources☆308Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- English stopwords collection☆163Updated 8 years ago
- Full list of bad words and top swear words banned by Google.☆657Updated last month
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆306Updated last month
- A JSON representation of Webster's Unabridged Dictionary☆688Updated 4 years ago
- A dataset of popular forenames and surnames by country☆40Updated 2 years ago
- Snowball compiler and stemming algorithms☆807Updated last week
- List of major cities of the world as a datapackage☆263Updated last month
- ☆82Updated 2 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆252Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆222Updated last year
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆197Updated 6 years ago
- Convert Wikipedia database dumps into plaintext files☆322Updated 4 years ago
- Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.☆855Updated 2 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆178Updated 2 years ago
- A lightweight and easily readable context-free grammar generator!☆20Updated 6 years ago
- A compound word splitter for Python☆48Updated 4 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆386Updated 11 months ago
- RosaeNLG is a Natural Language Generation library for node.js and browser rendering, based on the Pug template engine.☆100Updated 8 months ago
- A creative commons dataset of trivia questions and answers☆225Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 7 months ago