solvenium / names-datasetLinks
A dataset of multinational first names and last names
☆27Updated 2 years ago
Alternatives and similar repositories for names-dataset
Users that are interested in names-dataset are comparing it to the libraries listed below
Sorting:
- Extract dates from text☆65Updated 4 years ago
- Now included in rigour☆152Updated last month
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated 2 months ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆109Updated 2 years ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- This repository contains an implementation of a US address parser built using spaCy NLP library.☆38Updated 2 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Trying to generate name synonyms from wikidata☆34Updated 5 years ago
- Matrix-based News Aggregation to Explore Media Bias☆19Updated 7 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆125Updated last year
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆108Updated this week
- Index Common Crawl archives in tabular format☆122Updated 2 months ago
- An email segmentation system (reference implementation of ECIR 2018 paper)☆10Updated 5 years ago
- A comprehensive database of name variants☆47Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Abydos NLP/IR library for Python☆191Updated 2 years ago
- Extracting addresses from text☆42Updated 7 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated this week
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- GraphiPy: Universal Social Data Extractor☆82Updated 2 years ago
- A helper library full of URL-related heuristics.☆73Updated 3 weeks ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more☆20Updated 6 years ago
- Example of building a working Spanish-to-English translation model with Marian NMT☆23Updated 5 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆36Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 7 years ago
- Extract text from HTML☆134Updated 5 years ago
- Extraction Toolkit☆83Updated 3 years ago