Pyppeteer integration for Scrapy
☆58Feb 26, 2021Updated 5 years ago
Alternatives and similar repositories for scrapy-pyppeteer
Users that are interested in scrapy-pyppeteer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use pyppeteer from a Scrapy spider☆59Feb 5, 2020Updated 6 years ago
- Scrapy + Puppeteer☆110Jun 11, 2021Updated 5 years ago
- Scrapy Pyppeteer Demo☆12Jul 30, 2020Updated 5 years ago
- Downloader Middleware to support Pyppeteer in Scrapy & Gerapy☆132Dec 27, 2021Updated 4 years ago
- scrapy-redis-expiredupefilter是基于scrapy-redis修改来的一款scrapy分布式爬虫框架,它支持为请求指纹设置生命周期,请求指纹生命周期结束后将在不影响其他指纹的情况下自动清除。☆10Aug 6, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 3 months ago
- Library that helps use puppeteer in scrapy.☆51Aug 1, 2025Updated 10 months ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 5 years ago
- Scrapy Extension for monitoring spiders execution.☆559May 28, 2026Updated 3 weeks ago
- A Scrapy extension to log items coverage when the spider shuts down☆18Apr 11, 2020Updated 6 years ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago
- Downloader Middleware to support Selenium in Scrapy & Gerapy☆32Sep 13, 2020Updated 5 years ago
- Sentry component for Scrapy☆84Aug 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Crochet-based blocking API for Scrapy.☆47Feb 24, 2017Updated 9 years ago
- MongoDB extensions for Scrapy☆44Oct 2, 2014Updated 11 years ago
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- 企查查企业分类信息采集☆43Apr 2, 2020Updated 6 years ago
- Page Object pattern for Scrapy☆127Jun 8, 2026Updated 2 weeks ago
- 基于Scrapy和DrissionPage的爬虫项目☆24Mar 19, 2025Updated last year
- Create Django REST APIs the right way, no magic intended☆11Dec 8, 2022Updated 3 years ago
- Configure WAL-E S3 backups for Postgres☆14Mar 27, 2018Updated 8 years ago
- web版抖音采集的一种解决方案☆19Jul 8, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于puppeteer和NodeJS的服务端渲染,提供Docker一键部署及API调用接口。☆19Aug 30, 2022Updated 3 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆24Jun 30, 2016Updated 9 years ago
- Scrapy Redis with Bloom Filter,support redis sentinel and cluster☆25Mar 31, 2023Updated 3 years ago
- 通用文章提取,正文,标题,时间,作者,图片,音视频,联系方式等☆23Mar 19, 2023Updated 3 years ago
- Random User-Agent middleware based on fake-useragent☆688Sep 18, 2023Updated 2 years ago
- Scrapy Redis Bloom Filter☆175Jul 25, 2021Updated 4 years ago
- 🎭 Playwright integration for Scrapy☆1,423Updated this week
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Mar 16, 2022Updated 4 years ago
- Convert Javascript code to an XML document☆188Mar 14, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Scrapy Tutorial☆11Feb 19, 2017Updated 9 years ago
- A Python wrapper for working with Scrapyd's API.☆269Jul 31, 2024Updated last year
- A curated list of awesome open source projects for creating agentic operating systems, AI agents, and tools for the future of autonomous …☆35Mar 25, 2026Updated 3 months ago
- Modular, way of implementing rate-limiting in python with a few handy default implementations☆64Mar 27, 2023Updated 3 years ago
- ☆10Oct 24, 2021Updated 4 years ago
- Distributed crawling/scraping, Kafka And Redis based components for Scrapy☆46Nov 13, 2020Updated 5 years ago
- adsl拨号代理池☆10May 22, 2023Updated 3 years ago