ThomasAitken / scrapy-sessions
A session-management extension for Scrapy.
☆10Updated last year
Alternatives and similar repositories for scrapy-sessions:
Users that are interested in scrapy-sessions are comparing it to the libraries listed below
- Pyppeteer integration for Scrapy☆58Updated 4 years ago
- Page Object pattern for Scrapy☆121Updated 2 months ago
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆149Updated last month
- aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare☆53Updated 5 months ago
- Web scraping Page Objects core library☆99Updated 2 months ago
- Library to populate items using XPath and CSS with a convenient API☆48Updated last month
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- A Scrapy middleware to bypass the CloudFlare's anti-bot protection☆110Updated 3 years ago
- Zyte API integration for Scrapy☆38Updated 2 weeks ago
- A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.☆91Updated 3 months ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆17Updated 3 years ago
- Python client and types generator for the Chrome DevTools Protocol (CDP)☆70Updated last month
- A complimentary proxy to help to use SPM with headless browsers☆108Updated last year
- Scrapy + Puppeteer☆110Updated 3 years ago
- A reimplementation of the Selenium API, emulating human interactions☆75Updated last year
- Browser fingerprint data generator☆52Updated 3 weeks ago
- ScrapingAnt API client for Python.☆41Updated 9 months ago
- Scrapy project template. Use it to quickly spin up a new web scraping project☆17Updated 5 months ago
- Modern tests to detect automated browser behavior. Cover most important leaks from Puppeteer and Playwright.☆70Updated 6 months ago
- 💻 A random user-agent generator.☆102Updated last week
- A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them☆65Updated 2 years ago
- Patching CDP (Chrome DevTools Protocol) leaks on OS level. Easy to use with Playwright, Selenium, and other web automation tools.☆112Updated 8 months ago
- ☆29Updated 3 years ago
- ☆58Updated last year
- Common interface for data container classes☆67Updated last month
- A middleware of cookies persistence for Scrapy☆28Updated last month
- undetected chromedriver Docker☆31Updated last year
- Implement scrapy with asyncio☆63Updated 6 months ago
- The most advanced debugging and testing tool for Scrapy☆16Updated 2 years ago
- Asyncio web crawling framework. Work in progress.☆18Updated 8 months ago