fpdetective / modCrawlerLinks
Crawler based on a modified browser to detect online tracking.
☆11Updated 2 years ago
Alternatives and similar repositories for modCrawler
Users that are interested in modCrawler are comparing it to the libraries listed below
Sorting:
- Artifact release for our IEEE Symposium on Security and Privacy 2021 paper entitled Fingerprinting the Fingerprinters: Learning to Detect…☆71Updated 4 years ago
- Simple Python 3 web crawler☆13Updated 5 years ago
- Python parser for Adblock Plus filters☆200Updated 6 years ago
- extract difference between two html pages☆32Updated last week
- Python client library for Google Safe Browsing API☆84Updated 2 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated last week
- Library for scraping, parsing, and analyzing privacy policies.☆17Updated 2 years ago
- List of Sanctions and Most wanted☆28Updated 8 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- A crawler that uses OpenWPM.☆12Updated 4 years ago
- Detect and classify pagination links☆104Updated last week
- code and data used to build a training dataset for dragnet models☆10Updated 5 years ago
- 🌐 Identifying & Linking Vendor Migrants and Aliases on Darknet Markets. (ACL 2023)☆22Updated 2 years ago
- Privacy browser extension for analyzing web traffic of visited websites☆31Updated this week
- DomainsProject.org HTTP worker☆24Updated 3 years ago
- Code and data release for our PETS 2018 paper: "I never signed up for this! Privacy implications of email tracking".☆46Updated 3 years ago
- Scripts for building a geo-located web corpus using Common Crawl data☆11Updated last month
- Repository for the defense mechanism of the paper "The Dangers of Human Touch: Fingerprinting Browser Extensions through User Actions"☆10Updated 3 years ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆95Updated 3 years ago
- Simple duckduckgo results scraping☆68Updated 8 years ago
- Auxiliary stuff☆36Updated this week
- A classifier for detecting soft 404 pages☆57Updated last week
- Run information flow experiments on the Web