niczem / trawlerLinks
scraper for facebook, gab, google and tiktok
☆21Updated last month
Alternatives and similar repositories for trawler
Users that are interested in trawler are comparing it to the libraries listed below
Sorting:
- API client for Aleph, supports bulk entity and document upload.☆28Updated 9 months ago
- DEPRECATED. Desktop graph visualization application☆51Updated 2 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 5 months ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- The Condemned Dataset☆22Updated 5 years ago
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- All the files and documentation necessary to reuse, remix and translate A Field Guide to "Fake News" and Other Information Disorders.☆62Updated 4 years ago
- Frontend interface for Datashare, a self-hosted search engine for documents.☆37Updated last week
- ☆11Updated 6 years ago
- A framework for observing Twitter through interactive networks.☆73Updated 9 months ago
- Run Overview on your own system☆125Updated 4 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆55Updated this week
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- Mecodify tool for twitter data analysis and visualisation☆42Updated 2 years ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 11 months ago
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆75Updated 2 years ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆37Updated 3 years ago
- A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "i…☆69Updated last year
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆87Updated 4 years ago
- Estimating the age of web resources☆96Updated 2 months ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 5 years ago
- Social Feed Manager user interface application.☆156Updated last year
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆40Updated 3 weeks ago
- Twitter stream + search API grabber☆106Updated 2 years ago
- ☆66Updated 5 years ago
- Classifying the content of domains☆56Updated 2 years ago
- A webmining CLI tool & library for python.☆333Updated last month
- The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)☆65Updated last year
- ☆10Updated last year
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago