elacuesta/scrapy-pyppeteer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/elacuesta/scrapy-pyppeteer)

elacuesta / scrapy-pyppeteer

Pyppeteer integration for Scrapy

☆58

Alternatives and similar repositories for scrapy-pyppeteer

Users that are interested in scrapy-pyppeteer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lopuhin / scrapy-pyppeteer
View on GitHub
Use pyppeteer from a Scrapy spider
☆59Feb 5, 2020Updated 6 years ago
clemfromspace / scrapy-puppeteer
View on GitHub
Scrapy + Puppeteer
☆110Jun 11, 2021Updated 5 years ago
Gerapy / GerapyPyppeteer
View on GitHub
Downloader Middleware to support Pyppeteer in Scrapy & Gerapy
☆132Dec 27, 2021Updated 4 years ago
AaronJny / scrapy_redis_expiredupefilter
View on GitHub
scrapy-redis-expiredupefilter是基于scrapy-redis修改来的一款scrapy分布式爬虫框架，它支持为请求指纹设置生命周期，请求指纹生命周期结束后将在不影响其他指纹的情况下自动清除。
☆10Aug 6, 2019Updated 6 years ago
scrapinghub / autoextract-spiders
View on GitHub
Pre-built Scrapy spiders for AutoExtract
☆19Apr 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ispras / scrapy-puppeteer
View on GitHub
Library that helps use puppeteer in scrapy.
☆51Aug 1, 2025Updated 11 months ago
scrapy / scrapy-lint
View on GitHub
A linter for Scrapy projects.
☆22Jul 7, 2026Updated last week
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆560May 28, 2026Updated last month
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
scrapinghub / flatson
View on GitHub
Tool to flatten stream of JSON-like objects, configured via schema
☆33Oct 19, 2019Updated 6 years ago
scrapy-plugins / scrapy-splitvariants
View on GitHub
Scrapy spider middleware to split an item into multiple items using a multi-valued key
☆21Feb 8, 2017Updated 9 years ago
Gerapy / GerapySelenium
View on GitHub
Downloader Middleware to support Selenium in Scrapy & Gerapy
☆32Sep 13, 2020Updated 5 years ago
llonchj / scrapy-sentry
View on GitHub
Sentry component for Scrapy
☆84Aug 21, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rmax / scrapydo
View on GitHub
Crochet-based blocking API for Scrapy.
☆47Feb 24, 2017Updated 9 years ago
scrapinghub / scmongo
View on GitHub
MongoDB extensions for Scrapy
☆44Oct 2, 2014Updated 11 years ago
rmax / scrapy-inline-requests
View on GitHub
A decorator to write coroutine-like spider callbacks.
☆109Dec 26, 2022Updated 3 years ago
x-bessie / AggregationNews
View on GitHub
JavaEE实现分布式爬虫新闻聚合网站 SSM框架实现
☆18Dec 15, 2022Updated 3 years ago
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
xyuns-cc / Scrapy_DrissionPage
View on GitHub
基于Scrapy和DrissionPage的爬虫项目
☆24Mar 19, 2025Updated last year
soul-cat / DouYinSign
View on GitHub
web版抖音采集的一种解决方案
☆19Jul 8, 2020Updated 6 years ago
scrapy-plugins / scrapy-querycleaner
View on GitHub
Scrapy spider middleware to clean up query parameters in request URLs
☆24Jun 30, 2016Updated 10 years ago
mic1on / puppeteer-render
View on GitHub
基于puppeteer和NodeJS的服务端渲染，提供Docker一键部署及API调用接口。
☆19Aug 30, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
leffss / ScrapyRedisBloomFilterBlockCluster
View on GitHub
Scrapy Redis with Bloom Filter，support redis sentinel and cluster
☆25Mar 31, 2023Updated 3 years ago
kingking888 / CommNewsExtractor
View on GitHub
通用文章提取，正文，标题，时间，作者，图片，音视频，联系方式等
☆23Mar 19, 2023Updated 3 years ago
alecxe / scrapy-fake-useragent
View on GitHub
Random User-Agent middleware based on fake-useragent
☆688Sep 18, 2023Updated 2 years ago
Python3WebSpider / ScrapyRedisBloomFilter
View on GitHub
Scrapy Redis Bloom Filter
☆175Jul 25, 2021Updated 4 years ago
scrapy-plugins / scrapy-playwright
View on GitHub
🎭 Playwright integration for Scrapy
☆1,428Updated this week
scrapy-plugins / scrapy-magicfields
View on GitHub
Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.
☆56Mar 16, 2022Updated 4 years ago
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
maxnoodles / wechat_app_spider
View on GitHub
通过 airtest + mitmproxy 抓取手机端微信的公众号信息
☆38Nov 14, 2019Updated 6 years ago
djm / python-scrapyd-api
View on GitHub
A Python wrapper for working with Scrapyd's API.
☆269Jul 31, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ThomasAitken / Scrapy-Testmaster
View on GitHub
The most advanced debugging and testing tool for Scrapy
☆16Apr 19, 2023Updated 3 years ago
sigmavirus24 / rush
View on GitHub
Modular, way of implementing rate-limiting in python with a few handy default implementations
☆64Mar 27, 2023Updated 3 years ago
mathculthello / math.ru
View on GitHub
Сайт math.ru
☆14Jan 7, 2023Updated 3 years ago
kerlomz / muggle-speech
View on GitHub
中文语音识别
☆24May 25, 2022Updated 4 years ago
jbrocher / hashnode-testing-database-fastapi
View on GitHub
☆10Oct 24, 2021Updated 4 years ago
tenlee2012 / scrapy-kafka-redis
View on GitHub
Distributed crawling/scraping, Kafka And Redis based components for Scrapy
☆45Nov 13, 2020Updated 5 years ago
Gerapy / GerapyProxy
View on GitHub
A package for supporting proxy in Scrapy & Gerapy
☆11Jul 15, 2020Updated 5 years ago