niczem / trawlerLinks
scraper for facebook, gab, google and tiktok
☆21Updated last week
Alternatives and similar repositories for trawler
Users that are interested in trawler are comparing it to the libraries listed below
Sorting:
- ☆11Updated 6 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 8 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 4 months ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- DEPRECATED. Desktop graph visualization application☆51Updated 2 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆13Updated 2 weeks ago
- A Python library for defining rule-based overrides on messy data☆16Updated 2 months ago
- Simple tool to pull posts and users from Gab☆16Updated last week
- An alpha project combining beneficial ownership and contracting data☆13Updated 4 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆23Updated last year
- A helper library full of URL-related heuristics.☆69Updated 3 weeks ago
- A list of over 5000 US news domains and their social media accounts☆45Updated 2 years ago
- List of Sanctions and Most wanted☆28Updated 8 years ago
- 🗞 Monitors data sources, alerts you when they change☆12Updated 3 years ago
- jq module to process Wikidata JSON format☆11Updated 6 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- This windows CLI app lets you collect data from twitter via REST API and convert it into a CSV data set that can be used with Gephi. Othe…☆25Updated 4 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆35Updated last week
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆39Updated last week
- Ask questions about government data.☆37Updated 6 years ago
- A verification “Swiss army knife” helping journalists, fact-checkers, and human rights defenders to save time and be more efficient in th…☆39Updated this week
- A framework for observing Twitter through interactive networks.☆73Updated 7 months ago
- A financial disclosure data extraction tool.☆16Updated last year
- All the files and documentation necessary to reuse, remix and translate A Field Guide to "Fake News" and Other Information Disorders.☆62Updated 4 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 8 months ago
- The Condemned Dataset☆22Updated 5 years ago
- Some tools to help analyze the twitter archive☆62Updated 2 weeks ago