smashew / NameDatabases
Text databases of last names from various countries
☆279Updated 2 years ago
Alternatives and similar repositories for NameDatabases:
Users that are interested in NameDatabases are comparing it to the libraries listed below
- ☆50Updated 7 months ago
- The Python library for names.☆861Updated 3 months ago
- 📦 A list, huge one (~200K) of human male/female first/last names.☆44Updated last year
- A dataset of multinational first names and last names☆26Updated last year
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 6 years ago
- ☆323Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆71Updated last year
- A lightweight and easily readable context-free grammar generator!☆20Updated 6 years ago
- Working with hOCR in Javascript☆123Updated last year
- ☆80Updated last year
- Abydos NLP/IR library for Python☆184Updated 2 years ago
- Some labeled training and test data for email intent machine learning (based on sentence-level speech acts)☆108Updated 10 years ago
- A hypothetical proof-of-concept book recommendation system for Project Gutenberg, using Natural Language Processing.☆11Updated 8 years ago
- Spacy NER annotator using ipywidgets☆120Updated 10 months ago
- Stylometry library for Burrows' Delta method☆33Updated 8 months ago
- A Stylometry Library for Python☆137Updated last year
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆434Updated last year
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆64Updated 10 years ago
- Gather modern English word frequencies from all enwiki articles.☆206Updated 10 months ago
- Fast approximate strings search & spelling correction☆57Updated 3 years ago
- small Java library for splitting German compound words☆61Updated 8 months ago
- A multilingual lexicon of words to hurt.☆82Updated 2 months ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated 2 years ago
- Offline database of synonyms/thesaurus☆192Updated last year
- FreeLing project source code☆252Updated last year
- A set of tools for topical text classification and scaling☆19Updated last year