facebookresearch / URL-Sanitization
The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private and/or sensitive data. It is part of the Facebook URL shares release effort, which is led by Election Research Commission (ERC).
☆23Updated 3 years ago
Alternatives and similar repositories for URL-Sanitization:
Users that are interested in URL-Sanitization are comparing it to the libraries listed below
- Investigating how COVID-19 shaped Anti-Asian Climate☆12Updated 3 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 2 years ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 7 months ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- A financial disclosure data extraction tool.☆14Updated last year
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆37Updated last year
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆68Updated 3 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Data Donation Module: A Django application to setup and manage data donation projects.☆23Updated last week
- Machine-learning Protest Event Data System☆37Updated 4 months ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆30Updated 2 years ago
- Browser extension to simulate browsing behaviour in search engines.☆32Updated 2 months ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- This repository contains the raw data, code, and sources used to create an individual level and state municipal incorporation date datase…☆23Updated last month
- Tools for collecting social media data around focal events☆84Updated 3 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- An R package for accessing the Facebook Ad Library API☆74Updated last year
- COMM 4940: Governing Human-Algorithm Behavior☆21Updated 9 months ago
- OSoMe Twitter tools. Including a package like tweepy but for the v2 Twitter api.☆30Updated 2 years ago
- The COVID-19 Digital Observatory collects, aggregates, and distributes data from social media, search engine results, and Wikipedia to su…☆10Updated 4 years ago
- Ethnicolr implementation with new models in pytorch☆11Updated 2 weeks ago
- Data, code, and methodology supporting BuzzFeed News' analysis of the 2016 U.S. Census Survey of Income and Program Participation☆9Updated 2 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- Twitter & Crowdtangle Data Access and Analysis Workshop for the Social Identity and Morality Lab☆12Updated 3 years ago
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated 9 months ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 3 years ago
- ☆70Updated 2 months ago