jkelin / forward-proxy-managerLinks
Request distributor for web scraping
☆14Updated 8 months ago
Alternatives and similar repositories for forward-proxy-manager
Users that are interested in forward-proxy-manager are comparing it to the libraries listed below
Sorting:
- The Architecture of a Web Crawler: Building a Google-Inspired Distributed Web Crawler☆125Updated last year
- Minimal set of tools to conduct stealthy scraping.☆162Updated 2 years ago
- undetected chromedriver in Docker container based on Alpine Linux☆27Updated last year
- Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.☆488Updated 3 weeks ago
- A test suite of common scraper detection techniques. See how detectable your scraper stack is.☆141Updated 3 years ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆111Updated 4 years ago
- use multiple proxies with Scrapy☆771Updated 2 weeks ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆90Updated 7 months ago
- anti-bot-detection with rod☆313Updated last year
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆217Updated 3 weeks ago
- Undetectable browser automation in Docker using Python/Zendriver. Full VNC debugging support.☆78Updated 7 months ago
- create your rotating proxy server with docker. self hosted rotating proxy service.☆178Updated 3 months ago
- fork of fhttp with fixed window_update and removal of gzip/flate/brotli decoding☆21Updated 4 years ago
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆70Updated 4 years ago
- Introduction to JA3 Fingerprint and how to impersonate it.☆62Updated 5 years ago
- Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium, and other web automation tools.☆154Updated 4 months ago
- a stealthy browser automation framework☆842Updated 9 months ago
- A suite of tools for protecting the web's open knowledge.☆127Updated last year
- The Web Scraping Club Free Repository☆158Updated 3 months ago
- A Python library for solving reCAPTCHA v2 and v3 with Playwright☆474Updated 2 weeks ago
- This repository provides usage examples for the Python module Newspaper3k.☆151Updated 2 years ago
- 🧱 A uniform template to use as a foundation for Puppeteer bot construction.☆68Updated 4 years ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆436Updated 3 years ago
- Anonymous automation via selenium with fingerprint replacement technology.☆122Updated 3 months ago
- rotating open proxy multiplexer☆192Updated 3 weeks ago
- A fork of https://github.com/AtuboDad/playwright_stealth☆171Updated 3 weeks ago
- A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them☆72Updated 3 years ago
- playwright stealth☆883Updated last year
- 🦉Gracefully face reCAPTCHA challenge with ultralytics YOLOv8-seg, CLIPs VIT-B/16 and CLIP-Seg/RD64. Implemented in playwright or an easy…☆225Updated last month
- 📡 Renew the IP address of a tethered Android device via Node asynchronously.☆75Updated 2 years ago