uhh-lt / newsleak
Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery
☆53Updated 2 months ago
Related projects: ⓘ
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆45Updated 2 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆54Updated last month
- Google Refine extension for adding columns (extending data) from DBpedia☆38Updated 10 years ago
- Events and Situations Ontology☆13Updated 6 years ago
- Tools for tracking stories on news homepages☆48Updated 4 years ago
- A PDF classifier ensemble with REST API service☆23Updated 3 years ago
- Specification of NAF, the NLP annotation format☆21Updated 3 years ago
- NYT Risk Semantics Project☆12Updated 8 years ago
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 3 years ago
- Extract Data from Wikipedia Lists☆30Updated 7 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- A digital humanities operating system that runs on a USB disk.☆30Updated 7 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆106Updated 3 years ago
- spaCy-to-naf converter☆21Updated 3 months ago
- Topic Modeling Workflow in Python☆16Updated last year
- OpenRefine for Social Science Data☆23Updated this week
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 7 years ago
- A Named-Entity Recogniser based on Grobid.☆48Updated this week
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆188Updated 3 years ago
- Extract networks of entities from journalistic reporting☆46Updated last year
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Parse Popolo JSON data and navigate it with Python☆14Updated 4 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆23Updated 2 years ago
- Neo4j powered web application for multimedia collections: bring graph-based exploration and crowd-based indexation.☆39Updated 4 years ago
- ☆14Updated 3 years ago