ICIJ / datashare
A self-hosted search engine for documents.
☆629Updated this week
Alternatives and similar repositories for datashare:
Users that are interested in datashare are comparing it to the libraries listed below
- A cross-platform command line tool for parallelised content extraction and analysis.☆245Updated this week
- Lightweight web scraping toolkit for documents and structured data.☆311Updated last year
- Data model and processing tools for investigative entity data☆226Updated this week
- Search and browse documents and data; find the people and companies you look for.☆2,128Updated this week
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆209Updated this week
- Websites crawler with built-in exploration and control web interface☆350Updated this week
- Twitter stream + search API grabber☆104Updated last year
- The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.☆297Updated last week
- Automatically archive links to videos, images, and social media content from Google Sheets (and more).☆690Updated this week
- An open database of international sanctions data, persons of interest and politically exposed persons☆551Updated this week
- A webmining CLI tool & library for python.☆319Updated this week
- Exploration, monitoring and classification of incidents in time and space.☆360Updated 6 months ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆34Updated this week
- Social Feed Manager user interface application.☆155Updated 10 months ago
- Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Tex…☆1,025Updated last week
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆265Updated this week
- JavaScript app for displaying annotated network graphs based on data from LittleSis☆102Updated 2 months ago
- Digital Methods Initiative - Twitter Capture and Analysis Toolset☆369Updated 5 months ago
- An ICIJ app to conduct data validation and cleaning.☆20Updated 2 months ago
- Run Overview on your own system☆124Updated 3 years ago
- ☆245Updated this week
- Want to contribute? These are difficult, long-term projects that could be valuable to open source investigators at Bellingcat and around …☆350Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆150Updated 3 months ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆187Updated 4 years ago
- A browser extension to collect social media data with.☆238Updated 3 weeks ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆55Updated last week
- Retrieves archived tweets from Wayback Machine in HTML, CSV, and JSON☆103Updated this week
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆108Updated 2 months ago
- The Java Graphical Authorship Attribution Program☆271Updated 9 months ago
- A helper library full of URL-related heuristics.☆69Updated last month