mstem / archive.org-getter
Ruby script to download bulk results from Archive.org's TV News database of closed captions
☆14Updated 12 years ago
Alternatives and similar repositories for archive.org-getter:
Users that are interested in archive.org-getter are comparing it to the libraries listed below
- Research-grade URL expansion for Python.☆27Updated 6 years ago
- Patterns in NYT production from 1987 to 2007☆11Updated 7 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 4 years ago
- Twitter stream and social network crawling tools☆16Updated 8 years ago
- The BITS Lab STACK tool for social media collection and analysis.☆39Updated 2 years ago
- Python library for interacting with smapp collections☆18Updated 8 years ago
- Web Archives for Historical Research☆13Updated 7 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated last year
- A collection of stuff for my Data Journalism class at the University of Nebraska-Lincoln.☆99Updated 7 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs☆12Updated 6 years ago
- A repo for tracking the number of followers of Congress, the Cabinet, and Governors☆17Updated 5 years ago
- ☆14Updated 8 years ago
- Loose Miscellany☆21Updated 7 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 6 years ago
- The public GitHub repository for MUDDLE: a digital lit-mag devoted to celebrating the messiness of composition. Created by Taylor Brown a…☆15Updated 5 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Updated 11 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆44Updated 8 years ago
- Python scraper to get weekly CDC flu surveillance data☆25Updated 10 years ago
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- The what (and how) digital humanities and news nerds want to explore together☆64Updated 9 years ago
- A Django app to refine, review and republish campaign finance data drawn from the California Secretary of State’s CAL-ACCESS database☆17Updated 9 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 2 years ago
- Python interface for LegiScan API☆19Updated 10 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- GenderTracker is a service that decomposes articles and computes various gender-related metrics based on the content.☆25Updated 11 years ago
- Search the Internet Archive, retrieve metadata, and download files☆60Updated 4 months ago
- A library that will eventually help people wanting to do Data Mining on Twitter☆22Updated 2 years ago