ICIJ / datashareLinks
A self-hosted search engine for documents.
☆638Updated this week
Alternatives and similar repositories for datashare
Users that are interested in datashare are comparing it to the libraries listed below
Sorting:
- Data model and processing tools for investigative entity data☆236Updated this week
- A cross-platform command line tool for parallelised content extraction and analysis.☆245Updated 3 weeks ago
- Lightweight web scraping toolkit for documents and structured data.☆312Updated last year
- An open database of international sanctions data, persons of interest and politically exposed persons☆567Updated this week
- The data journalism platform with built in training☆306Updated 6 months ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆212Updated this week
- Websites crawler with built-in exploration and control web interface☆354Updated this week
- Klaxon enables reporters and editors to monitor scores of sites on the web for newsworthy changes.☆663Updated this week
- JavaScript app for displaying annotated network graphs based on data from LittleSis☆102Updated 4 months ago
- Run Overview on your own system☆124Updated 3 years ago
- Twitter stream + search API grabber☆105Updated last year
- The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.☆318Updated this week
- A webmining CLI tool & library for python.☆327Updated 2 weeks ago
- Now included in rigour☆151Updated last month
- Tool for the retrieval of corporate and financial data from the SEC☆172Updated last month
- Data-driven, participatory fact-mapping☆83Updated 8 years ago
- Social Feed Manager user interface application.☆155Updated last year
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)☆446Updated 4 years ago
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆108Updated 4 months ago
- YTDT is a collection of simple tools for extracting data from the YouTube platform via the YouTube API v3.☆127Updated 8 months ago
- Project moved to https://code.europa.eu/EDPS/website-evidence-collector ! The tool Website Evidence Collector (WEC) automates the website…☆428Updated last year
- An ICIJ app to conduct data validation and cleaning.☆20Updated 3 weeks ago
- Automatically archive links to videos, images, and social media content from Google Sheets (and more).☆716Updated this week
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆115Updated 3 weeks ago
- Textricator is a tool to extract text from documents and generate structured data.☆345Updated 3 months ago
- Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that co…☆85Updated last year
- The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the co…☆84Updated 3 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆65Updated last week
- A fork of Mozilla Lightbeam made for tracker research.☆13Updated 6 months ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆150Updated last week