facebookresearch / URL-Sanitization
The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private and/or sensitive data. It is part of the Facebook URL shares release effort, which is led by Election Research Commission (ERC).
☆23Updated 3 years ago
Alternatives and similar repositories for URL-Sanitization
Users that are interested in URL-Sanitization are comparing it to the libraries listed below
Sorting:
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆69Updated 3 years ago
- COMM 4940: Governing Human-Algorithm Behavior☆21Updated 10 months ago
- Tools for collecting social media data around focal events☆84Updated 3 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Investigating how COVID-19 shaped Anti-Asian Climate☆12Updated 3 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆113Updated 5 months ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Analyzing MTurk demographics☆14Updated last year
- Data, code, and methodology supporting BuzzFeed News' analysis of the 2016 U.S. Census Survey of Income and Program Participation☆9Updated 2 years ago
- ☆70Updated 4 months ago
- RECSM-UPF Summer School: Social Media and Big Data Research☆22Updated 7 years ago
- Utilities for retrieving whitehouse.gov transcripts and matching news quotes to them☆16Updated 10 years ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆37Updated last week
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 4 years ago
- Python module to extract articles from NexisUni and Factiva.☆38Updated 5 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Teaching materials and resources for computational and quantitative social science methods.☆23Updated 3 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- 2020 Computational Journalism Class☆20Updated 2 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆72Updated this week
- Replication code for "Indian judges show no gender or religious in-group bias" (Ash et al. 2020)☆28Updated 4 months ago
- Datakit plugin to help manage Github integration on data projects.☆12Updated 2 years ago
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆15Updated 2 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 6 years ago
- DEPRECATED - The Concept Mover's Distance Method is now available in the text2map package. Concept Mover's Distance is a way to measure…☆27Updated 3 years ago