ArchiveTeam / NewsGrabber-Warrior
β8Updated 5 years ago
Related projects: β
- Bot for operating snscrape in #archivebot on efnetβ10Updated 4 years ago
- π Bot powering the @LinkArchiver Twitter tool to send tweeted URLs to the Wayback Machineβ46Updated 6 years ago
- Materials to reproduce findings in our story, "Googleβs Top Search Result? Surprise! Itβs Google"β34Updated 4 years ago
- Grabbing all news.β62Updated 4 years ago
- β15Updated 5 years ago
- Save My News: A personal, permanent clipping serviceβ26Updated 11 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.β13Updated last year
- Find rss, atom, xml, and rdf feeds on webpagesβ30Updated last year
- Tools for tracking stories on news homepagesβ48Updated 4 years ago
- export data from twitter archive and visualize itβ25Updated last year
- Scraping Assisted by Learningβ35Updated last week
- β25Updated this week
- β33Updated this week
- Scripts for FOIA The Dead, a morbid transparency projectβ36Updated last year
- β11Updated 2 years ago
- Internet Archive Data Mining Toolsβ44Updated 3 years ago
- Presentations on Quantified Self and Self-Tracking with Pythonβ29Updated last year
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.β15Updated this week
- how hard is it to get a list of all local news sites in the United States (LOL)β8Updated 4 years ago
- GenderTracker is a service that decomposes articles and computes various gender-related metrics based on the content.β25Updated 10 years ago
- The news homepage archiveβ81Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ167Updated 2 weeks ago
- Literate data analysis with iPython notebooks and Jekyll.β92Updated 10 years ago
- a smaller, cleaner, campaign finance app that complements the new FEC siteβ21Updated 7 months ago
- A WordPress plugin for aggregating data via the hypothes.is API.β25Updated 6 years ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ91Updated 5 years ago
- Python tools for processing data from the Catalog of Copyright Entriesβ37Updated 4 years ago
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"β38Updated 3 months ago
- A tool for working with tweet archives.β15Updated last year
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β49Updated 2 months ago