fpdetective / modCrawlerLinks
Crawler based on a modified browser to detect online tracking.
☆11Updated 2 years ago
Alternatives and similar repositories for modCrawler
Users that are interested in modCrawler are comparing it to the libraries listed below
Sorting:
- Artifact release for our IEEE Symposium on Security and Privacy 2021 paper entitled Fingerprinting the Fingerprinters: Learning to Detect…☆71Updated 4 years ago
- extract difference between two html pages☆32Updated 7 years ago
- List of Sanctions and Most wanted☆29Updated 8 years ago
- Python parser for Adblock Plus filters☆199Updated 6 years ago
- Simple heuristic for measuring web page similarity (& data set)☆91Updated 7 years ago
- Python library for IP2Proxy database lookup. It can be used to find the IP addresses which are used as VPN anonymizer, open proxies, web…☆36Updated 3 weeks ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated last year
- Public code release for The Web Never Forgets paper☆68Updated 3 years ago
- FBLYZE is a Facebook scraping system and analysis system.☆65Updated 4 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 4 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
- Code release for: Cookies that give you away: The surveillance implications of web tracking☆53Updated 6 years ago
- A platform to study browser fingerprinting☆72Updated 2 years ago
- Simple Python 3 web crawler☆13Updated 5 years ago
- A collection of tools for working with and analyzing Tracking Protection as implemented in Firefox☆19Updated 2 years ago
- Resources, articles, thoughts, datasets, papers on TI tradecraft☆11Updated 7 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Statitical Anomaly Detector of Internet Traffic (SADIT)☆22Updated 8 years ago
- Python project to create a classifier to guess if a Twitter account is a man, a woman or a bot.☆18Updated 6 years ago
- Code and data release for our PETS 2018 paper: "I never signed up for this! Privacy implications of email tracking".☆46Updated 3 years ago
- Auxiliary stuff☆35Updated last week
- A polite and user-friendly downloader for Common Crawl data☆59Updated 3 months ago
- Open Source testing framework for image correlation, distance and analysis☆44Updated 2 years ago
- Scrapes the obfuscated proxy list at proxylist.hidemyass.com☆32Updated 6 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- The FourthParty web measurement platform.☆44Updated 10 years ago
- Library for scraping, parsing, and analyzing privacy policies.☆17Updated 2 years ago
- Scrape google search results☆93Updated 7 years ago
- A generic crawler☆78Updated 7 years ago
- Python client library for Google Safe Browsing API☆84Updated 2 years ago