fpdetective / modCrawlerLinks
Crawler based on a modified browser to detect online tracking.
☆11Updated 2 years ago
Alternatives and similar repositories for modCrawler
Users that are interested in modCrawler are comparing it to the libraries listed below
Sorting:
- List of Sanctions and Most wanted☆28Updated 8 years ago
- Simple Python 3 web crawler☆13Updated 5 years ago
- extract difference between two html pages☆32Updated last week
- Python client library for Google Safe Browsing API☆84Updated 2 years ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆95Updated 3 years ago
- Public code release for The Web Never Forgets paper☆69Updated 4 years ago
- Python library for IP2Proxy database lookup. It can be used to find the IP addresses which are used as VPN anonymizer, open proxies, web…☆36Updated 3 weeks ago
- A Stylometry Library for Python☆147Updated 2 years ago
- Resources, articles, thoughts, datasets, papers on TI tradecraft☆11Updated 7 years ago
- Library for scraping, parsing, and analyzing privacy policies.☆17Updated 2 years ago
- Auxiliary stuff☆36Updated this week
- code and data used to build a training dataset for dragnet models☆10Updated 5 years ago
- The FourthParty web measurement platform.☆44Updated 10 years ago
- Collect email addresses by crawling search engine results.☆30Updated 3 years ago
- Code and data release for our PETS 2018 paper: "I never signed up for this! Privacy implications of email tracking".☆46Updated 3 years ago
- A crawler that uses OpenWPM.☆12Updated 4 years ago
- OSINT tool for Instagram☆15Updated 6 years ago
- FBLYZE is a Facebook scraping system and analysis system.☆67Updated 4 years ago
- A platform to study browser fingerprinting☆73Updated 2 years ago
- A very fast whois crawler☆40Updated 6 years ago
- Links to resources on misinformation, disinformation, fake news, whatever it's called this week☆52Updated 3 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated last week
- A polite and user-friendly downloader for Common Crawl data☆67Updated 5 months ago
- Data Feed Manager (news watch orchestrator to predict topic with deepdetect and store cleaned text in elasticsearch)☆40Updated 3 years ago
- A crawler based on Tor Browser and Selenium☆55Updated 4 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆119Updated last week
- Extract text from HTML☆134Updated last week
- Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one i…☆11Updated 9 years ago
- Website to view the information fingerprint that your browser exposes to third parties.☆61Updated 7 years ago
- Adaptive crawler which uses Reinforcement Learning methods☆168Updated last week