scrapy/itemloaders

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapy/itemloaders)

scrapy / itemloaders

Library to populate items using XPath and CSS with a convenient API

☆49

Alternatives and similar repositories for itemloaders

Users that are interested in itemloaders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy / itemadapter
View on GitHub
Common interface for data container classes
☆70Jul 12, 2026Updated last week
scrapinghub / scrapy-autoextract
View on GitHub
Zyte Automatic Extraction integration for Scrapy
☆58Apr 13, 2026Updated 3 months ago
realslimshanky / Spider-Sense
View on GitHub
A browser extension to monitor your spiders deployed on Scrapy Cloud.
☆16Mar 8, 2025Updated last year
ejulio / spider-feeder
View on GitHub
A library to make it easier to load input URLs to start scrapy processes
☆14Feb 21, 2021Updated 5 years ago
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
scrapinghub / arche
View on GitHub
Analyze scraped data
☆47Dec 9, 2019Updated 6 years ago
scrapy / w3lib
View on GitHub
Python library of web-related functions
☆419Updated this week
scrapy / protego
View on GitHub
A pure-Python robots.txt parser with support for modern conventions.
☆90Updated this week
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
scrapinghub / web-poet
View on GitHub
Web scraping Page Objects core library
☆107Jul 10, 2026Updated 2 weeks ago
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
croqaz / awesome-scrapy
View on GitHub
🕶 Awesome list of Scrapy tools and libraries
☆60Jul 6, 2020Updated 6 years ago
scrapinghub / andi
View on GitHub
Library for annotation-based dependency injection
☆24Updated this week
scrapinghub / number-parser
View on GitHub
Parse numbers written in natural language
☆130Oct 23, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
zytedata / zyte-autoextract
View on GitHub
Python clients for Zyte AutoExtract API
☆41Jan 17, 2022Updated 4 years ago
scrapy / queuelib
View on GitHub
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
☆299Jun 26, 2026Updated 3 weeks ago
zytedata / python-zyte-api
View on GitHub
Python client for Zyte API
☆30Updated this week
scrapinghub / shub
View on GitHub
Scrapinghub Command Line Client
☆129Updated this week
rennerocha / mediafeed
View on GitHub
Web application to help categorize and aggregate subscriptions of media channels for easy access. (working only with Youtube channels at …
☆16Aug 27, 2020Updated 5 years ago
scrapy / parsel
View on GitHub
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
☆1,344Jul 16, 2026Updated last week
zytedata / clear-html
View on GitHub
Remove DIVs, style stuff and normalize HTML preserving structure information
☆14Oct 24, 2025Updated 9 months ago
osantana / development-guidelines
View on GitHub
Guidelines for Software Development Projects
☆22Apr 1, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆561May 28, 2026Updated last month
scrapinghub / price-parser
View on GitHub
Extract price amount and currency symbol from a raw text string
☆346Mar 19, 2026Updated 4 months ago
scrapinghub / shub-workflow
View on GitHub
☆14Jul 16, 2026Updated last week
scrapy-plugins / scrapy-splitvariants
View on GitHub
Scrapy spider middleware to split an item into multiple items using a multi-valued key
☆21Feb 8, 2017Updated 9 years ago
scrapinghub / product-extraction-benchmark
View on GitHub
☆16Apr 10, 2026Updated 3 months ago
Nykakin / chompjs
View on GitHub
Parsing JavaScript objects into Python data structures
☆222May 17, 2026Updated 2 months ago
zytedata / zyte-spider-templates
View on GitHub
Spider templates for automatic crawlers.
☆35Mar 26, 2026Updated 3 months ago
colmex / frontera_example
View on GitHub
Example frontera project
☆12Aug 10, 2017Updated 8 years ago
scrapy / cssselect
View on GitHub
CSS Selectors for Python
☆309Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
further-reading / scrapy-gui
View on GitHub
A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.
☆109May 21, 2024Updated 2 years ago
cryptosubtlety / final-security-bug
View on GitHub
Google Tink's critical Ed25519 bug related to Java "final" keyword
☆11Apr 5, 2020Updated 6 years ago
steves-dev / cookiecutter-fastapi-panel-python
View on GitHub
Cookiecutter template for FastAPI + Panel projects in Python
☆10Apr 18, 2022Updated 4 years ago
mgedmin / bootable-iso
View on GitHub
Bootable USB disk that lets you choose an ISO image
☆16Oct 19, 2020Updated 5 years ago
scrapinghub / webstruct
View on GitHub
NER toolkit for HTML data
☆259May 3, 2024Updated 2 years ago
scrapy-plugins / scrapy-zyte-api
View on GitHub
Zyte API integration for Scrapy
☆43Updated this week
scrapinghub / python-scrapinghub
View on GitHub
A client interface for Scrapinghub's API
☆206Updated this week