the-markup / blacklight-collectorLinks
β220Updated 2 weeks ago
Alternatives and similar repositories for blacklight-collector
Users that are interested in blacklight-collector are comparing it to the libraries listed below
Sorting:
- πΈ Modular, multithreaded, puppeteer-based crawlerβ147Updated this week
- Deep links to opt-out of data sharing by 100+ companies.β149Updated last year
- A simply complicated guide to removing your info from data brokersβ216Updated 2 years ago
- Tranco: An improved top websites rankingβ162Updated 5 years ago
- Data from the largest and longest measurement of online tracking.β455Updated 3 weeks ago
- A browser extension to share data about your social feed with researchers and journalists to increase transparency.β87Updated 2 years ago
- The Digital Standard is an ambitious, community-led effort to build a framework to test and rate products and services on the basis of prβ¦β133Updated 2 years ago
- A tool to detect whether a PDF has a bad redactionβ148Updated 3 weeks ago
- β24Updated 4 years ago
- Code used to build a Tracker Radar data set from raw crawl data.β199Updated 4 months ago
- Global Privacy Control Specificationβ123Updated 2 weeks ago
- Monitor stories from news outlets for words or phrases that matter to youβ150Updated 5 months ago
- π‘ Collection of pages for testing various privacy and security features of browsers and browser extensions.β87Updated last week
- Automated training for Privacy Badger. Badger Sett automates browsers to visit websites to produce fresh Privacy Badger tracker data.β132Updated this week
- β17Updated 2 years ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β169Updated 2 weeks ago
- Documents versions that are not maintained by a dedicated actor. Maintained collaboratively by volunteer contributors.β143Updated last week
- This contains the data and methodology that we used for our story "Thereβs a Multibillion-Dollar Market for Your Phoneβs Location Data ."β74Updated 3 years ago
- Historical website privacy policies spanning over two decades.β131Updated last year
- Own your dataβ112Updated 3 weeks ago
- β167Updated 7 months ago
- Tracks contractual documents and exposes changes to the terms of online services.β125Updated 3 weeks ago
- Code for the twitter bot nyt_diffβ213Updated 8 months ago
- The Toolkit API, app, and browser extension. Start preserving now.β47Updated this week
- prefabricated twitter searches for civil society purposesβ48Updated last week
- β42Updated 3 weeks ago
- Collaborative data collection tool developed by the Associated Pressβ109Updated 2 years ago
- Most government websites end in .gov or .mil, but many do not. This repo contains USA.gov's list of public government domains and URLs thβ¦β225Updated last month
- List of newsrooms around the world that are using software engineering, data science, osint, and various tech to elevate reporting.β100Updated 4 years ago
- Web Extension version of the Firefox Lightbeam add-onβ193Updated 2 years ago