solvenium / names-datasetLinks
A dataset of multinational first names and last names
☆26Updated 2 years ago
Alternatives and similar repositories for names-dataset
Users that are interested in names-dataset are comparing it to the libraries listed below
Sorting:
- Extract dates from text☆64Updated 4 years ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated last year
- An Email Segmentation System☆9Updated 4 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- Python based Wikidata framework for easy dataframe extraction☆44Updated last year
- TeXoo – A Zoo of Text Extractors☆18Updated 5 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Scalable String Similarity Joins in Python☆39Updated 10 months ago
- Collaborative Synchronized Corpus Annotation Tool☆10Updated 6 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- ☆81Updated 6 years ago
- A helper library full of URL-related heuristics.☆69Updated 2 months ago
- An authorship attribution project with particular emphasis on Twitter analysis☆16Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆49Updated 3 years ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 6 months ago
- A curated list of promising Web Data Extractors resources☆28Updated 5 years ago
- Privacy browser extension using machine learning to summarize privacy policies☆24Updated 8 months ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- simple rule based named entity recognition☆43Updated 3 years ago
- RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity …☆15Updated 5 years ago
- 📜Neural Text Simplification to Improve Chatbot Performance☆13Updated 6 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated this week
- Gender detection toolkit from names written in python and bash☆5Updated last year