niczem / trawler
scraper for facebook, gab, google and tiktok
☆21Updated 10 months ago
Alternatives and similar repositories for trawler
Users that are interested in trawler are comparing it to the libraries listed below
Sorting:
- API client for Aleph, supports bulk entity and document upload.☆28Updated 7 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 2 months ago
- Template repository and README for submissions to Bellingcat's Global Hackathon☆16Updated 2 years ago
- DEPRECATED. Desktop graph visualization application☆50Updated 2 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- A collaborative collection of datasets that are common to use within "Follow the Money" investigations with european scope☆13Updated 11 months ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- This windows CLI app lets you collect data from twitter via REST API and convert it into a CSV data set that can be used with Gephi. Othe…☆25Updated 4 years ago
- ☆11Updated 5 years ago
- RTAA-72, is CVCIO's real-time intelligence dashboard for Twitter☆21Updated 2 years ago
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- A helper library full of URL-related heuristics.☆69Updated last month
- Codec is a collaborative tool for managing video evidence.☆66Updated last year
- Converter for ICIJ Offshore Leaks data into FollowTheMoney format☆12Updated 3 years ago
- A Python library for defining rule-based overrides on messy data☆13Updated last month
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆34Updated this week
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- Ask questions about government data.☆37Updated 6 years ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆36Updated 2 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Explore 3,500 Facebook ads reported to have been bought by the Russian Internet Research Agency☆16Updated 2 years ago
- This repository contains data from the story "Facebook Isn’t Telling You How Popular Right-Wing Content Is on the Platform."☆11Updated 3 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆62Updated 2 weeks ago
- ☆22Updated 4 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 7 months ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- jq module to process Wikidata JSON format☆11Updated 5 years ago