scrapy/itemadapter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapy/itemadapter)

scrapy / itemadapter

Common interface for data container classes

☆70

Alternatives and similar repositories for itemadapter

Users that are interested in itemadapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy / xtractmime
View on GitHub
https://mimesniff.spec.whatwg.org/ implementation for Python
☆13Jul 9, 2026Updated 2 weeks ago
scrapy / itemloaders
View on GitHub
Library to populate items using XPath and CSS with a convenient API
☆49Updated this week
scrapy / protego
View on GitHub
A pure-Python robots.txt parser with support for modern conventions.
☆90Updated this week
zytedata / python-zyte-api
View on GitHub
Python client for Zyte API
☆30Updated this week
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ejulio / spider-feeder
View on GitHub
A library to make it easier to load input URLs to start scrapy processes
☆14Feb 21, 2021Updated 5 years ago
zytedata / zyte-autoextract
View on GitHub
Python clients for Zyte AutoExtract API
☆41Jan 17, 2022Updated 4 years ago
zytedata / zyte-spider-templates
View on GitHub
Spider templates for automatic crawlers.
☆35Mar 26, 2026Updated 3 months ago
TeamHG-Memex / html-text
View on GitHub
Extract text from HTML
☆135Apr 8, 2026Updated 3 months ago
zytedata / zyte-spider-templates-project
View on GitHub
☆23Mar 18, 2026Updated 4 months ago
TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
scrapinghub / web-poet
View on GitHub
Web scraping Page Objects core library
☆107Jul 10, 2026Updated 2 weeks ago
scrapy / parsel
View on GitHub
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
☆1,344Jul 16, 2026Updated last week
scrapinghub / andi
View on GitHub
Library for annotation-based dependency injection
☆24Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
scrapy-plugins / scrapy-zyte-smartproxy
View on GitHub
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
☆363May 4, 2026Updated 2 months ago
zytedata / clear-html
View on GitHub
Remove DIVs, style stuff and normalize HTML preserving structure information
☆14Oct 24, 2025Updated 9 months ago
scrapy / cssselect
View on GitHub
CSS Selectors for Python
☆309Updated this week
scrapy-plugins / scrapy-dotpersistence
View on GitHub
A scrapy extension to sync `.scrapy` folder to an S3 bucket
☆18Mar 28, 2022Updated 4 years ago
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
scrapinghub / shub-workflow
View on GitHub
☆14Jul 16, 2026Updated last week
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
further-reading / scrapy-gui
View on GitHub
A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.
☆109May 21, 2024Updated 2 years ago
adipasquale / techcrunch-incremental-scrapy-spider-with-mongodb
View on GitHub
Techcrunch Incremental Scrapy Spider With MongoDB
☆16Dec 25, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
seomoz / rep-cpp
View on GitHub
Robot exclusion protocol in C++
☆11Jul 26, 2024Updated last year
tgen / jetstream
View on GitHub
Workflow management system written as a pure Python package and command-line utility. It supports complex workflows modeled as directed- …
☆18Mar 20, 2026Updated 4 months ago
scrapy / queuelib
View on GitHub
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
☆299Jun 26, 2026Updated 3 weeks ago
scrapinghub / shublang
View on GitHub
Pluggable DSL that uses pipes to perform a series of linear transformations to extract data
☆16Jul 9, 2024Updated 2 years ago
scrapy-plugins / scrapy-playwright
View on GitHub
🎭 Playwright integration for Scrapy
☆1,434Updated this week
simonw / datasette-export-notebook
View on GitHub
Datasette plugin providing instructions for exporting data to Jupyter or Observable
☆13Sep 15, 2023Updated 2 years ago
rmax / scrapy-inline-requests
View on GitHub
A decorator to write coroutine-like spider callbacks.
☆109Dec 26, 2022Updated 3 years ago
ThetaTau / CMT
View on GitHub
App for the Theta Tau Chapter Management Tool
☆10Updated this week
realslimshanky / Spider-Sense
View on GitHub
A browser extension to monitor your spiders deployed on Scrapy Cloud.
☆16Mar 8, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
rochacbruno / my-awesome-stars
View on GitHub
☆18Updated this week
jayhogan / clicklist-client
View on GitHub
Client for Kroger Clicklist API
☆10Mar 17, 2017Updated 9 years ago
rclement / datasette-ml
View on GitHub
A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models
☆17Updated this week
scrapy / scrapy-lint
View on GitHub
A linter for Scrapy projects.
☆22Jul 7, 2026Updated 2 weeks ago
rajivsarvepalli / mock-alchemy
View on GitHub
SQLAlchemy mock helpers.
☆80Nov 1, 2023Updated 2 years ago
scrapy-plugins / scrapy-magicfields
View on GitHub
Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.
☆56Mar 16, 2022Updated 4 years ago