tfmorris / Names
A comprehensive database of name variants
☆44Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Names
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 3 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- Tools to download and process name data from various sources.☆88Updated 11 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 3 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆28Updated 8 years ago
- Another next-generation event coding platform.☆71Updated 5 years ago
- Events and Situations Ontology☆13Updated 6 years ago
- Wikidata authority file mapping tool☆11Updated 6 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆42Updated 6 years ago
- Record Linkage ToolKit (Find and link entities)☆106Updated last year
- ☆16Updated 6 years ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆200Updated this week
- Specification of NAF, the NLP annotation format☆21Updated 3 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Turning news into events since 2014.☆50Updated 7 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 2 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 3 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated 9 months ago
- Loading OpenSanctions into Neo4J and Linkurious☆27Updated last month
- ☆21Updated 6 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago