notnews / archive_news_cc
Closed Caption Transcripts of News Videos from archive.org 2014--2023
☆47Updated last year
Related projects ⓘ
Alternatives and complementary repositories for archive_news_cc
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 3 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- Fine-tuned transformers for protest event detection.☆9Updated 3 years ago
- An R package to assess the effects of text preprocessing decisions.☆66Updated 3 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆67Updated 3 years ago
- NamSor API v2 R SDK - classify personal names accurately by gender, country of origin, or ethnicity.☆12Updated 3 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆35Updated 11 months ago
- Python module to extract articles from NexisUni and Factiva.☆36Updated 5 years ago
- Text-Based Ideal Points☆44Updated last year
- Natural Language Processing for Political Science☆21Updated 7 years ago
- A data package containing lexicons and dictionaries for text analysis☆110Updated 3 years ago
- ☆46Updated 6 years ago
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"☆18Updated 5 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- A Los Angeles Times analysis found that LAPD officers search blacks and Latinos far more often than whites during traffic stops even thou…☆11Updated 5 years ago
- Scale ideological slant of Tweets☆21Updated 5 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆15Updated 4 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆31Updated 2 years ago
- ☆16Updated 3 years ago
- DEPRECATED - The Concept Mover's Distance Method is now available in the text2map package. Concept Mover's Distance is a way to measure…☆27Updated 3 years ago
- Software for preprocessing textual data in multiple languages for textual analysis.☆23Updated 8 years ago
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆15Updated last year
- Fast, flexible name matching for large datasets☆70Updated 11 months ago
- Investigating how COVID-19 shaped Anti-Asian Climate☆12Updated 3 years ago
- Paper and related materials for Rodriguez & Spirling (JOP, 2022) word embeddings overview and assessment☆45Updated 2 years ago