notnews / archive_news_ccLinks
Closed Caption Transcripts of News Videos from archive.org 2014--2023
☆50Updated 8 months ago
Alternatives and similar repositories for archive_news_cc
Users that are interested in archive_news_cc are comparing it to the libraries listed below
Sorting:
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated last month
- smappdragon is a set of tools for working with twitter data.☆29Updated 7 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 4 years ago
- MPEDS Annotation Interface☆18Updated 3 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆71Updated 4 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- ☆37Updated 7 years ago
- pysmap is a high level interface for working with twitter data.☆21Updated 5 years ago
- ☆76Updated this week
- Data and analysis for the BuzzFeed News article, "We Got Government Data On 20 Years Of Workplace Sexual Harassment Claims. These Charts …☆27Updated 8 years ago
- Determines the ethnicity based on your last name☆10Updated 11 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Updated 5 years ago
- All the data behind How Good Are FiveThirtyEight Forecasts?☆27Updated 2 years ago
- An R package to assess the effects of text preprocessing decisions.☆66Updated 4 years ago
- Code supporting the dissertation "Agents in Conflict," George Mason University, 2016☆20Updated 9 years ago
- Lectures from my DS Text as Data course offered in Spring 2018.☆77Updated 4 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆78Updated 2 months ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- CNN Transcripts 2000--2025☆23Updated 8 months ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated last year
- Fast, flexible name matching for large datasets☆71Updated 4 months ago
- A Python package for downloading data from the UK Parliament's Data Platform.☆29Updated 5 years ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 4 years ago
- Python module to extract articles from NexisUni and Factiva.☆39Updated 6 years ago
- 2020-election-night-model☆60Updated 5 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆40Updated 6 months ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- Calculate readability scores☆43Updated 6 years ago