ispras / scrapy-puppeteerLinks
Library that helps use puppeteer in scrapy.
☆52Updated last month
Alternatives and similar repositories for scrapy-puppeteer
Users that are interested in scrapy-puppeteer are comparing it to the libraries listed below
Sorting:
- Page Object pattern for Scrapy☆123Updated last week
- Common interface for data container classes☆68Updated 2 weeks ago
- Pyppeteer integration for Scrapy☆58Updated 4 years ago
- Scrapy + Puppeteer☆110Updated 4 years ago
- Web scraping Page Objects core library☆102Updated 2 weeks ago
- A complimentary proxy to help to use SPM with headless browsers☆108Updated 2 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- ☆29Updated 4 years ago
- Scrapfly Python SDK for headless browsers and proxy rotation☆45Updated 2 months ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- 🕷️ Scrapyd is an application for deploying and running Scrapy spiders.☆85Updated 2 months ago
- Parse numbers written in natural language☆119Updated 8 months ago
- Scrapy project boilerplate done right☆48Updated 5 months ago
- More flexible and featured Frontera scheduler for Scrapy☆37Updated last month
- Parsing JavaScript objects into Python data structures☆209Updated 2 weeks ago
- A special service that runs puppeteer instances.☆17Updated last month
- estela, an elastic web scraping cluster 🕸☆184Updated last month
- Zyte API integration for Scrapy☆38Updated 2 months ago
- Software stack with latest Scrapy and updated deps☆63Updated last week
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆142Updated 6 months ago
- Extract text from HTML☆134Updated 4 years ago
- Automatic unit test generation for Scrapy.☆57Updated 4 years ago
- Python client for Zyte API☆26Updated last month
- Trying scrapy-playwright on Scrapy Cloud☆22Updated last year
- Repository for the Mastering Web Scraping in Python: Scaling to Distributed Crawling blogpost with the final code.☆44Updated 3 years ago
- Spider templates for automatic crawlers.☆30Updated 2 weeks ago
- A pure-Python robots.txt parser with support for modern conventions.☆70Updated 3 weeks ago
- Library to populate items using XPath and CSS with a convenient API☆48Updated 2 weeks ago
- ☆20Updated 3 months ago
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.☆38Updated last year