niczem / trawlerLinks
scraper for facebook, gab, google and tiktok
☆21Updated 3 weeks ago
Alternatives and similar repositories for trawler
Users that are interested in trawler are comparing it to the libraries listed below
Sorting:
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 3 months ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 7 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- ☆11Updated 6 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- This windows CLI app lets you collect data from twitter via REST API and convert it into a CSV data set that can be used with Gephi. Othe…☆25Updated 4 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆13Updated this week
- RTAA-72, is CVCIO's real-time intelligence dashboard for Twitter☆21Updated 2 years ago
- DEPRECATED. Desktop graph visualization application☆50Updated 2 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- Template repository and README for submissions to Bellingcat's Global Hackathon☆16Updated 2 years ago
- A list of over 5000 US news domains and their social media accounts☆45Updated 2 years ago
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago
- Stanford Internet Observatory publications☆14Updated 3 years ago
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- A Python library for defining rule-based overrides on messy data☆14Updated last month
- All the files and documentation necessary to reuse, remix and translate A Field Guide to "Fake News" and Other Information Disorders.☆62Updated 4 years ago
- ☆34Updated last year
- jq module to process Wikidata JSON format☆11Updated 6 years ago
- Converter for ICIJ Offshore Leaks data into FollowTheMoney format☆12Updated 3 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Analysis for "Geofenced Searches on Twitter: A Case Study Detailing South Asia’s Covid Crisis", published on May 19, 2021.☆25Updated last year
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆37Updated 2 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 8 months ago
- A command line tool that queries the Open Corporates Database and returns data on corporations under the copyleft Open Database License.☆33Updated 2 years ago
- Data cleaning and validation functions for names, languages, identifiers, etc.☆21Updated this week
- A helper library full of URL-related heuristics.☆69Updated 2 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago