alephdata / aleph
Search and browse documents and data; find the people and companies you look for.
☆2,036Updated this week
Related projects ⓘ
Alternatives and complementary repositories for aleph
- Data model and processing tools for investigative entity data☆218Updated this week
- A self-hosted search engine for documents.☆598Updated this week
- Lightweight web scraping toolkit for documents and structured data.☆309Updated 10 months ago
- An open database of international sanctions data, persons of interest and politically exposed persons☆500Updated this week
- Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Tex…☆978Updated last year
- Trigger Happy - The bus for your internet services☆1,345Updated 4 years ago
- A Python stream processing engine modeled after Yahoo! Pipes☆1,604Updated 2 years ago
- Open Source Self-Hosted Business Intelligence Platform☆1,088Updated last year
- A toolkit for making domain-specific probabilistic parsers☆797Updated last month
- Download the entire Wayback Machine archive for a given URL.☆2,899Updated 6 months ago
- GitHub repository for the SecureDrop whistleblower platform. Do not submit tips here!☆3,623Updated this week
- A cross-platform command line tool for parallelised content extraction and analysis.☆241Updated 2 months ago
- Generate links that users can use to submit messages encrypted with your public key.☆954Updated 2 weeks ago
- Collect and revisit web pages.☆1,485Updated last year
- The magma server daemon, is an encrypted email system with support for SMTP, POP, IMAP, HTTP and MOLTEN,. Additional support for DMTP and…☆1,817Updated 10 months ago
- brozzler - distributed browser-based web crawler☆672Updated last week
- ☆1,924Updated 3 years ago
- Decentralized feeds using BitTorrent's DHT. Idea from Arvid and The_8472 "DHT RSS feeds" http://libtorrent.org/dht_rss.html☆879Updated 8 years ago
- Log-based transactional graph engine☆1,137Updated last month
- A desktop application for viewing and analyzing tabular data☆3,181Updated this week
- A Python data analysis library that is optimized for humans instead of machines.☆1,173Updated 3 months ago
- Distributed Stream Processing☆1,480Updated 3 years ago
- Think of Local sheriff as a recon tool in your browser (WebExtension). While you normally browse the internet, Local Sheriff works in the…☆305Updated last year
- Wget-compatible web downloader and crawler.☆557Updated 6 months ago
- a set of tools to help with securely redacting and stripping metadata from documents before publishing☆533Updated 4 years ago
- Orchestra is a human-in-the-loop AI system for orchestrating project teams of experts and machines.☆669Updated 8 months ago
- Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.☆1,050Updated last year
- Websites crawler with built-in exploration and control web interface☆328Updated 2 months ago
- Documents with Scientific Intelligence☆802Updated this week
- Just the facts -- web page content extraction☆1,254Updated 4 months ago