scrapy-plugins/scrapy-magicfields

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapy-plugins/scrapy-magicfields)

scrapy-plugins / scrapy-magicfields

Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.

☆56

Alternatives and similar repositories for scrapy-magicfields

Users that are interested in scrapy-magicfields are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy-plugins / scrapy-deltafetch
View on GitHub
Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls
☆276Feb 26, 2025Updated last year
scrapy-plugins / scrapy-splitvariants
View on GitHub
Scrapy spider middleware to split an item into multiple items using a multi-valued key
☆21Feb 8, 2017Updated 9 years ago
scrapy-plugins / scrapy-querycleaner
View on GitHub
Scrapy spider middleware to clean up query parameters in request URLs
☆24Jun 30, 2016Updated 10 years ago
scrapy-plugins / scrapy-dotpersistence
View on GitHub
A scrapy extension to sync `.scrapy` folder to an S3 bucket
☆18Mar 28, 2022Updated 4 years ago
scrapy-plugins / scrapy-pagestorage
View on GitHub
A scrapy extension to store requests and responses information in storage service
☆27Mar 11, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆561May 28, 2026Updated last month
scrapinghub / web-poet
View on GitHub
Web scraping Page Objects core library
☆107Jul 10, 2026Updated last week
scrapy / scrapy-bench
View on GitHub
A CLI for benchmarking Scrapy.
☆32Jun 28, 2025Updated last year
scrapedia / scrapy-pipelines
View on GitHub
A collection of pipelines for Scrapy
☆16Apr 27, 2026Updated 2 months ago
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
scrapy-plugins / scrapy-streaming
View on GitHub
☆19Oct 12, 2016Updated 9 years ago
Tiago-Lira / scrapyd-mongodb
View on GitHub
Library designed to replace the SQLite backend by a MongoDB backend on Scrapy queue management
☆17Sep 2, 2017Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
AccordBox / awesome-scrapy
View on GitHub
A curated list of awesome packages, articles, and other cool resources from the Scrapy community.
☆561Dec 28, 2022Updated 3 years ago
llonchj / scrapy-sentry
View on GitHub
Sentry component for Scrapy
☆84Aug 21, 2023Updated 2 years ago
scrapinghub / arche
View on GitHub
Analyze scraped data
☆47Dec 9, 2019Updated 6 years ago
stav / wgrep
View on GitHub
Web grep: search all rendered resources used by a URI
☆91Nov 21, 2025Updated 8 months ago
mrt-kousha / scrapy
View on GitHub
In this repository, I try to share some of the little tips and tricks and amazing spiders that I used to work with on the scrapy framewor…
☆12Feb 2, 2020Updated 6 years ago
scrapy / scrapyd-client
View on GitHub
Command line client for Scrapyd server
☆772Feb 27, 2026Updated 4 months ago
scrapy-plugins / scrapy-jsonrpc
View on GitHub
Scrapy extension to control spiders using JSON-RPC
☆299Aug 26, 2019Updated 6 years ago
scrapinghub / shub
View on GitHub
Scrapinghub Command Line Client
☆129Updated this week
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,225Nov 7, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated 3 weeks ago
speakol-ads / scrapy-x
View on GitHub
a high-performance, lightweight and human friendly serving engine for scrapy
☆29Mar 17, 2025Updated last year
scrapedia / r18
View on GitHub
A scrapy spider for R18
☆16Jun 1, 2026Updated last month
stummjr / books_crawler
View on GitHub
A Scrapy crawler for http://books.toscrape.com
☆27May 26, 2017Updated 9 years ago
scrapinghub / testspiders
View on GitHub
Useful test spiders for Scrapy
☆184Jan 20, 2020Updated 6 years ago
schubergphilis / data-migrator
View on GitHub
A declarative data-migration package
☆16Dec 7, 2024Updated last year
alecxe / scrapy-fake-useragent
View on GitHub
Random User-Agent middleware based on fake-useragent
☆688Sep 18, 2023Updated 2 years ago
scrapinghub / scmongo
View on GitHub
MongoDB extensions for Scrapy
☆44Oct 2, 2014Updated 11 years ago
scrapinghub / extruct
View on GitHub
Extract embedded metadata from HTML markup
☆966Apr 1, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapy-plugins / scrapy-zyte-smartproxy
View on GitHub
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
☆363May 4, 2026Updated 2 months ago
scrapy / queuelib
View on GitHub
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
☆299Jun 26, 2026Updated 3 weeks ago
realslimshanky / Spider-Sense
View on GitHub
A browser extension to monitor your spiders deployed on Scrapy Cloud.
☆16Mar 8, 2025Updated last year
scrapy-plugins / scrapy-splash
View on GitHub
Scrapy+Splash for JavaScript integration
☆3,229Feb 11, 2025Updated last year
sebdah / scrapy-mongodb
View on GitHub
MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…
☆358Apr 6, 2021Updated 5 years ago
scrapinghub / splash
View on GitHub
Lightweight, scriptable browser as a service with an HTTP API
☆4,190Aug 2, 2024Updated last year
codinglab2017 / Coding_Lab
View on GitHub
☆19Jun 15, 2017Updated 9 years ago