solvenium / names-datasetLinks
A dataset of multinational first names and last names
☆26Updated 2 years ago
Alternatives and similar repositories for names-dataset
Users that are interested in names-dataset are comparing it to the libraries listed below
Sorting:
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated last month
- Record Linkage ToolKit (Find and link entities)☆110Updated 2 years ago
- Now included in rigour☆151Updated 2 weeks ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆62Updated this week
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago
- A helper library full of URL-related heuristics.☆70Updated this week
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Index Common Crawl archives in tabular format☆122Updated last month
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆184Updated this week
- Extracting addresses from text☆42Updated 7 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- A comprehensive database of name variants☆47Updated 3 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆124Updated last year
- An email segmentation system (reference implementation of ECIR 2018 paper)☆10Updated 5 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of diff…☆88Updated 3 years ago
- This repository contains an implementation of a US address parser built using spaCy NLP library.☆38Updated 2 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Document Search Engine Tool☆74Updated 2 years ago
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆29Updated 6 years ago
- Extract dates from text☆65Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Abydos NLP/IR library for Python☆190Updated 2 years ago
- A curated list of promising Web Data Extractors resources☆29Updated 5 years ago
- A Python Package which helps to scrape all news details from any news websites☆217Updated 3 months ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 8 months ago
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more☆20Updated 6 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆156Updated 2 months ago