tfmorris / Names
A comprehensive database of name variants
☆46Updated 2 years ago
Alternatives and similar repositories for Names:
Users that are interested in Names are comparing it to the libraries listed below
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆296Updated 4 months ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- Korpuslinguistik war noch nie so einfach...☆23Updated 3 weeks ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆204Updated this week
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 3 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 4 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆120Updated 9 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆75Updated 3 months ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Social Feed Manager user interface application.☆155Updated 9 months ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆65Updated 10 years ago
- Wikidata authority file mapping tool☆11Updated 6 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- The linked open dataset described at http://datahub.io/dataset/vu-wordnet, and the tools used to create it☆25Updated 4 years ago
- Extract Data from Wikipedia Lists☆31Updated 7 years ago
- An implementation of latent Dirichlet allocation in javascript☆183Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Near-duplicate detection tool☆23Updated 8 years ago
- Record Linkage ToolKit (Find and link entities)☆110Updated last year
- A review of the deprecated Freebase knowledge base and Metaweb Query Language (MQL). A brief comparison of MQL and GraphQL.☆41Updated 7 years ago
- Named-Entity Recognition extension for OpenRefine☆26Updated 2 years ago
- Minimal Named-Entity Recognizer (MER)☆57Updated 5 months ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.☆119Updated 2 years ago
- A repo that contains outgoing links from DBpedia☆50Updated 4 years ago