scrapy-plugins / scrapy-playwright
đ Playwright integration for Scrapy
â1,126Updated 3 weeks ago
Alternatives and similar repositories for scrapy-playwright:
Users that are interested in scrapy-playwright are comparing it to the libraries listed below
- Scrapy middleware to handle javascript pages using seleniumâ939Updated 8 months ago
- playwright stealthâ632Updated 7 months ago
- Command line client for Scrapyd serverâ773Updated last week
- use multiple proxies with Scrapyâ754Updated 2 years ago
- Random User-Agent middleware based on fake-useragentâ694Updated last year
- Scrapy Extension for monitoring spiders execution.â539Updated 3 months ago
- Parsing JavaScript objects into Python data structuresâ202Updated last week
- A service daemon to run Scrapy spidersâ3,013Updated 3 weeks ago
- HTTP API for Scrapy spidersâ850Updated 8 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectorsâ1,202Updated last week
- Random proxy middleware for Scrapyâ1,665Updated 5 years ago
- Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.âŚâ3,245Updated 3 weeks ago
- Scrapy+Splash for JavaScript integrationâ3,192Updated last month
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.â543Updated 2 years ago
- â128Updated last year
- Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.â1,947Updated last year
- Page Object pattern for Scrapyâ120Updated last month
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).â1,227Updated 3 weeks ago
- Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapyâ364Updated last month
- â164Updated 5 years ago
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.â134Updated last month
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawlsâ269Updated 2 weeks ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protectionâ108Updated 3 years ago
- â258Updated 4 years ago
- Lightweight, scriptable browser as a service with an HTTP APIâ4,130Updated 7 months ago
- â2,126Updated 3 months ago
- Downloader Middleware to support Playwright in Scrapy & Gerapyâ110Updated 3 years ago
- A scalable frontier for web crawlersâ1,307Updated last month
- Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.â3,215Updated this week
- Undetected version of the Playwright testing and automation library.â499Updated 3 weeks ago