facebookresearch / URL-SanitizationLinks
The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private and/or sensitive data. It is part of the Facebook URL shares release effort, which is led by Election Research Commission (ERC).
☆23Updated 3 years ago
Alternatives and similar repositories for URL-Sanitization
Users that are interested in URL-Sanitization are comparing it to the libraries listed below
Sorting:
- Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Tak…☆69Updated 3 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)☆43Updated 4 years ago
- Tools for collecting social media data around focal events☆84Updated 3 years ago
- Data, code, and methodology supporting BuzzFeed News' analysis of the 2016 U.S. Census Survey of Income and Program Participation☆9Updated 2 years ago
- MPEDS Annotation Interface☆18Updated 2 years ago
- COMM 4940: Governing Human-Algorithm Behavior☆21Updated last year
- Investigating how COVID-19 shaped Anti-Asian Climate☆12Updated 3 years ago
- TikTok Content Scraper -- No API-Key needed, minimal dependencies, citable | Download videos (MP4), slides (JPEG) and metadata of author,…☆27Updated 3 weeks ago
- A maximum-strength name parser for record linkage.☆37Updated last week
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Hierarchical clustering of 2011-2022 Congress Twitter☆29Updated 2 years ago
- Twitter & Crowdtangle Data Access and Analysis Workshop for the Social Identity and Morality Lab☆12Updated 3 years ago
- Analyzing MTurk demographics☆14Updated last year
- This is a library of R scripts for the large-scale analysis of texts.☆13Updated 4 months ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆52Updated 3 years ago
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 5 years ago
- OCCRP and media partners collected data on COVID-19 related spending from across Europe from February to October 2020☆13Updated 4 years ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 6 years ago
- Tutorials for Stance Detection: A practical guide☆22Updated 2 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 4 years ago
- RECSM-UPF Summer School: Social Media and Big Data Research☆22Updated 7 years ago
- Accessing the Facebook Marketing API using httr in R, for demographic researchers☆21Updated 7 years ago
- NamSor API v2 R SDK - classify personal names accurately by gender, country of origin, or ethnicity.☆12Updated 4 years ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆14Updated last year
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- ☆38Updated last year