mstem / archive.org-getterLinks
Ruby script to download bulk results from Archive.org's TV News database of closed captions
☆14Updated 12 years ago
Alternatives and similar repositories for archive.org-getter
Users that are interested in archive.org-getter are comparing it to the libraries listed below
Sorting:
- The BITS Lab STACK tool for social media collection and analysis.☆39Updated 2 years ago
- Research-grade URL expansion for Python.☆27Updated 7 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆76Updated 2 months ago
- Python library for interacting with smapp collections☆18Updated 9 years ago
- Twitter stream and social network crawling tools☆17Updated 8 years ago
- Data and analysis for the BuzzFeed News article, "We Got Government Data On 20 Years Of Workplace Sexual Harassment Claims. These Charts …☆27Updated 7 years ago
- Patterns in NYT production from 1987 to 2007☆11Updated 7 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆47Updated 9 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFs☆12Updated 6 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Python tools for text☆16Updated 5 years ago
- Mine tweets with Python and parse/analyze with R and a Shiny Dashboard☆39Updated 7 years ago
- Presentation for the NYU Data Lab December 2015☆14Updated 9 years ago
- A starter kit with code for data collection, preparation, and analysis of digital trace data collected on Twitter☆44Updated 4 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 7 years ago
- Software for preprocessing textual data in multiple languages for textual analysis.☆23Updated 9 years ago
- Amsterdam Content Analysis Toolkit☆46Updated 3 years ago
- Computational Historical Thinking: With Applications in R☆61Updated 5 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆54Updated 5 years ago
- Notebooks and files for the Python for Journalists course on Datajournalism.com☆61Updated 5 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Closed Caption Transcripts of News Videos from archive.org 2014--2023☆49Updated 6 months ago
- This repository is the central communication and project management interface for the Social Media Observatory hosted by the Leibniz Insi…☆27Updated 4 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Analysis related to article on FOIA Online Database.☆11Updated 8 years ago