ScrapeOps / scrapeops-scrapy-sdk
Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the box.
☆36Updated 6 months ago
Alternatives and similar repositories for scrapeops-scrapy-sdk:
Users that are interested in scrapeops-scrapy-sdk are comparing it to the libraries listed below
- Web scraping Page Objects core library☆96Updated last week
- Library that helps use puppeteer in scrapy.☆52Updated 3 weeks ago
- 🕷️ Scrapyd is an application for deploying and running Scrapy spiders.☆82Updated last week
- A python package for finding e-mails, checking deliverability and more.☆60Updated 9 months ago
- More flexible and featured Frontera scheduler for Scrapy☆36Updated 2 months ago
- Page Object pattern for Scrapy☆118Updated last week
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- Scrapy + Puppeteer☆111Updated 3 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Common interface for data container classes☆66Updated last week
- ☆29Updated 3 years ago
- Spider templates for automatic crawlers.☆27Updated 2 weeks ago
- Extract text from HTML☆133Updated 4 years ago
- Software stack with latest Scrapy and updated deps☆63Updated last week
- Parsing JavaScript objects into Python data structures☆202Updated last month
- estela, an elastic web scraping cluster 🕸☆176Updated 3 weeks ago
- JavaScript support and proxy rotation for Scrapy with ScrapingBee.☆38Updated 9 months ago
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- ☆164Updated 4 years ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆29Updated last year
- admin ui for scrapy/open source scrapinghub☆58Updated 3 years ago
- ScrapingAnt API client for Python.☆36Updated 7 months ago
- Detect and classify pagination links☆101Updated 4 years ago
- Pyppeteer integration for Scrapy☆59Updated 3 years ago
- 🕶 Awesome list of Scrapy tools and libraries☆59Updated 4 years ago
- Web grep: search all rendered resources used by a URI☆85Updated 7 months ago
- Scrapy Extension for monitoring spiders execution.☆539Updated 2 months ago
- Scrapy project boilerplate done right☆45Updated last week
- For the filthiest web scrapers that have no time for rate-limits.☆18Updated 4 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆263Updated last year