zytedata/zyte-spider-templates

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zytedata/zyte-spider-templates)

zytedata / zyte-spider-templates

Spider templates for automatic crawlers.

☆35

Alternatives and similar repositories for zyte-spider-templates

Users that are interested in zyte-spider-templates are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zytedata / zyte-spider-templates-project
View on GitHub
☆23Mar 18, 2026Updated 4 months ago
zytedata / html-text
View on GitHub
☆20Oct 6, 2025Updated 9 months ago
scrapy / xtractmime
View on GitHub
https://mimesniff.spec.whatwg.org/ implementation for Python
☆13Jul 9, 2026Updated 2 weeks ago
scrapinghub / andi
View on GitHub
Library for annotation-based dependency injection
☆24Jul 21, 2026Updated last week
scrapinghub / shub-workflow
View on GitHub
☆14Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapy / pypydispatcher
View on GitHub
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
☆16Jul 3, 2017Updated 9 years ago
scrapinghub / web-poet
View on GitHub
Web scraping Page Objects core library
☆107Jul 10, 2026Updated 2 weeks ago
zytedata / clear-html
View on GitHub
Remove DIVs, style stuff and normalize HTML preserving structure information
☆14Oct 24, 2025Updated 9 months ago
scrapy-plugins / scrapy-zyte-api
View on GitHub
Zyte API integration for Scrapy
☆43Updated this week
scrapy / itemadapter
View on GitHub
Common interface for data container classes
☆70Updated this week
tiefling-cat / ru-syntax
View on GitHub
Repository for ru-syntax command line tool.
☆15Mar 8, 2022Updated 4 years ago
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
zytedata / zyte-autoextract
View on GitHub
Python clients for Zyte AutoExtract API
☆41Jan 17, 2022Updated 4 years ago
scrapy / scrapy-bench
View on GitHub
A CLI for benchmarking Scrapy.
☆32Jun 28, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapy / protego
View on GitHub
A pure-Python robots.txt parser with support for modern conventions.
☆90Updated this week
scrapinghub / page_clustering
View on GitHub
A simple algorithm for clustering web pages, suitable for crawlers
☆33Mar 6, 2017Updated 9 years ago
rmax / scrapy-boilerplate
View on GitHub
Small set of utilities to simplify writing Scrapy spiders.
☆50Jul 24, 2015Updated 11 years ago
TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
rytilahti / homeassistant-mpris-bridge
View on GitHub
Control your Home Assistant media players from your desktop using MPRIS
☆32Aug 23, 2024Updated last year
judell / av
View on GitHub
HTML5 audio/video clipper
☆12Mar 7, 2018Updated 8 years ago
luizdepra / r8
View on GitHub
A simple CHIP8 interpreter made with Rust.
☆11Apr 23, 2026Updated 3 months ago
fattmarley / cbbscraper
View on GitHub
College Basketball web scraper that pulls predicted scores from Kenpom HaslaMetrics and BartTorvik as well as betting lines from Fanduel …
☆11Jan 7, 2021Updated 5 years ago
lopuhin / kaggle-jigsaw-2019
View on GitHub
☆14Jun 27, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
datasette / datasette-scribe
View on GitHub
☆10Jun 23, 2026Updated last month
TeamHG-Memex / tor-proxy
View on GitHub
a tor socks proxy docker image
☆12Apr 8, 2026Updated 3 months ago
ghk / kawaldesa
View on GitHub
Aplikasi transparansi penyaluran dan realisasi dana desa
☆13Dec 9, 2015Updated 10 years ago
dialogue-evaluation / morphoRuEval-2017
View on GitHub
☆50Nov 20, 2017Updated 8 years ago
teamSolutionAnalysts / link-preview
View on GitHub
NodeJS Plugin to fetch URL Meta Data for Preview
☆12Mar 13, 2019Updated 7 years ago
scrapinghub / webpager
View on GitHub
Paginating the web
☆37Feb 11, 2014Updated 12 years ago
develer-staff / qt-pyqt-sdk-builder
View on GitHub
Create your custom Qt + PyQt SDK for multiple platforms
☆10Jun 7, 2019Updated 7 years ago
bdarnell / auto2to3
View on GitHub
Wrapper to run 2to3 automatically at import time
☆13Dec 9, 2011Updated 14 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dholth / hello-pyrust
View on GitHub
A “Hello World” of calling Rust code from a Python program with CFFI, in order to show packaging issues
☆11Jul 14, 2016Updated 10 years ago
infoculture / mosopendata
View on GitHub
Parser and data from data.mos.ru. / Парсер и данные для портала открытых данных Москвы data.mos.ru
☆18Aug 24, 2014Updated 11 years ago
scrapinghub / aile
View on GitHub
Automatic Item List Extraction
☆85Jun 15, 2016Updated 10 years ago
TeamHG-Memex / undercrawler
View on GitHub
A generic crawler
☆81Apr 8, 2026Updated 3 months ago
acdha / django-performance-tools
View on GitHub
EXPERIMENTAL Django performance monitoring utilities
☆15Nov 5, 2013Updated 12 years ago
scrapinghub / mdr
View on GitHub
A python library detect and extract listing data from HTML page.
☆110May 5, 2017Updated 9 years ago
ZuInnoTe / scrapy-contrib-bigexporters
View on GitHub
Scrapy exporter for Big Data formats
☆16Mar 10, 2026Updated 4 months ago