ICIJ / datashare
A self-hosted search engine for documents.
☆623Updated this week
Alternatives and similar repositories for datashare:
Users that are interested in datashare are comparing it to the libraries listed below
- Data model and processing tools for investigative entity data☆225Updated this week
- A cross-platform command line tool for parallelised content extraction and analysis.☆243Updated this week
- Websites crawler with built-in exploration and control web interface☆347Updated 2 months ago
- Twitter stream + search API grabber☆104Updated last year
- Lightweight web scraping toolkit for documents and structured data.☆311Updated last year
- Klaxon enables reporters and editors to monitor scores of sites on the web for newsworthy changes.☆654Updated this week
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆204Updated this week
- Social Feed Manager user interface application.☆155Updated 9 months ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆34Updated this week
- The data journalism platform with built in training☆305Updated 3 months ago
- ☆66Updated 5 years ago
- JavaScript app for displaying annotated network graphs based on data from LittleSis☆102Updated last month
- brozzler - distributed browser-based web crawler☆693Updated last week
- Run Overview on your own system☆123Updated 3 years ago
- A browser extension to collect social media data with.☆237Updated last week
- A fork of Mozilla Lightbeam made for tracker research.☆13Updated 4 months ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆186Updated 4 years ago
- A small command line tool and set of functions for studying coordination networks in Twitter and other social media data.☆76Updated 2 years ago
- The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.☆296Updated this week
- Extract and Visualize Data from URLs using Unfurl☆659Updated 2 weeks ago
- LittleSis is a free database of who-knows-who at the heights of business and government☆100Updated this week
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- ☆211Updated last week
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆56Updated 8 months ago
- A Tool To Push Web Resources Into Web Archives☆419Updated last year
- data for national legislatures worldwide☆243Updated last year
- Automatically archive links to videos, images, and social media content from Google Sheets (and more).☆656Updated this week
- Indelible links☆460Updated this week
- A webmining CLI tool & library for python.☆311Updated this week
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆249Updated this week