niczem / trawler
scraper for facebook, gab, google and tiktok
☆22Updated 6 months ago
Alternatives and similar repositories for trawler:
Users that are interested in trawler are comparing it to the libraries listed below
- API client for Aleph, supports bulk entity and document upload.☆28Updated 3 months ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- BotSlayer Community Edition☆35Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- DEPRECATED. Desktop graph visualization application☆50Updated 2 years ago
- A collaborative collection of datasets that are common to use within "Follow the Money" investigations with european scope☆13Updated 7 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last year
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- A list of over 5000 US news domains and their social media accounts☆42Updated last year
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- jq module to process Wikidata JSON format☆11Updated 5 years ago
- List of Sanctions and Most wanted☆26Updated 7 years ago
- A helper library full of URL-related heuristics.☆64Updated 3 months ago
- ☆12Updated 5 years ago
- A Python library for defining rule-based overrides on messy data☆13Updated 2 months ago
- Codec is a collaborative tool for managing video evidence.☆63Updated 9 months ago
- Ask questions about government data.☆37Updated 6 years ago
- All the files and documentation necessary to reuse, remix and translate A Field Guide to "Fake News" and Other Information Disorders.☆61Updated 4 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆36Updated this week
- Web interface for network analysis.☆21Updated 2 years ago
- ☆22Updated 4 years ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆14Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆37Updated 2 weeks ago
- 📕 Writing tests, the DataMade way☆16Updated 4 years ago
- RTAA-72, is CVCIO's real-time intelligence dashboard for Twitter☆21Updated 2 years ago
- Template repository and README for submissions to Bellingcat's Global Hackathon☆16Updated 2 years ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆35Updated 2 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 3 weeks ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆74Updated 5 months ago