mstem / archive.org-getter
Ruby script to download bulk results from Archive.org's TV News database of closed captions
☆14Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for archive.org-getter
- ☆14Updated 7 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- Python interface for LegiScan API☆17Updated 9 years ago
- Analysis repository for "The Spine of American Law: Digital Text Analysis and U.S. Legal Practice"☆19Updated 6 years ago
- Research-grade URL expansion for Python.☆25Updated 6 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆41Updated 7 years ago
- Web Archives for Historical Research☆13Updated 7 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 4 years ago
- Search the Internet Archive, retrieve metadata, and download files☆58Updated 4 years ago
- The public GitHub repository for MUDDLE: a digital lit-mag devoted to celebrating the messiness of composition. Created by Taylor Brown a…☆14Updated 4 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆9Updated 6 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated last year
- The BITS Lab STACK tool for social media collection and analysis.☆39Updated last year
- The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man☆20Updated 5 years ago
- ARCHIVED An R client for 'HathiTrust' API☆8Updated 2 years ago
- Processing OpenCitations Data☆17Updated 7 years ago
- Website for America's Public Bible☆11Updated 4 years ago
- A python client for the DPLA API☆43Updated 2 years ago
- Twitter stream and social network crawling tools☆16Updated 8 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆87Updated last year
- A simple catalog of Twitter ID Datasets☆28Updated 2 months ago
- A statistics extension for Google Refine.☆25Updated 11 years ago
- command line resource for working with digital primary sources☆27Updated 6 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆42Updated 8 years ago
- A library for extracting and parsing Wikipedia talk pages☆13Updated 7 years ago
- A Twitter data collection and appraisal application.☆50Updated last year
- a bunch of scripts for investigaing reddit☆11Updated 7 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Topic Modeling Workflow in Python☆16Updated last year
- Grabbing all news.☆62Updated 4 years ago