facebookresearch / URL-SanitizationLinks
The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private and/or sensitive data. It is part of the Facebook URL shares release effort, which is led by Election Research Commission (ERC).
☆23Updated 4 years ago
Alternatives and similar repositories for URL-Sanitization
Users that are interested in URL-Sanitization are comparing it to the libraries listed below
Sorting:
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆71Updated 4 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 5 years ago
- Code supporting the dissertation "Agents in Conflict," George Mason University, 2016☆20Updated 9 years ago
- ☆37Updated 7 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆15Updated 6 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 7 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- COMM 4940: Governing Human-Algorithm Behavior☆22Updated last month
- Text Thresher crowd sourced text annotator☆17Updated 8 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated last month
- Closed Caption Transcripts of News Videos from archive.org 2014--2023☆50Updated 8 months ago
- MPEDS Annotation Interface☆18Updated 3 years ago
- Tracing policy ideas from think tanks and lobbyists through state legislative bills☆47Updated 9 years ago
- 2020-election-night-model☆60Updated 4 years ago
- RECSM-UPF Summer School: Social Media and Big Data Research☆23Updated 8 years ago
- ☆75Updated last week
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆39Updated 6 months ago
- Experiments to help discussion on Wikipedia talk pages☆68Updated 3 weeks ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 3 years ago
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"☆19Updated 6 years ago
- Data on newspaper presidential endorsements☆30Updated 5 years ago
- Replication code for "Indian judges show no gender or religious in-group bias" (Ash et al. 2020)☆30Updated 11 months ago
- Data and analysis for the BuzzFeed News article, "We Got Government Data On 20 Years Of Workplace Sexual Harassment Claims. These Charts …☆27Updated 8 years ago
- Data, analytic code, and findings related to the BuzzFeed News article, "Inside The Partisan Fight For Your News Feed," published August …☆46Updated 8 years ago
- Analyzing MTurk demographics☆14Updated 2 years ago
- Code for the paper "Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift…☆16Updated 8 years ago