Library to populate items using XPath and CSS with a convenient API
☆49Jan 29, 2026Updated 5 months ago
Alternatives and similar repositories for itemloaders
Users that are interested in itemloaders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Zyte Automatic Extraction integration for Scrapy☆58Apr 13, 2026Updated 2 months ago
- A browser extension to monitor your spiders deployed on Scrapy Cloud.☆16Mar 8, 2025Updated last year
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 5 years ago
- Analyze scraped data☆47Dec 9, 2019Updated 6 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆89Jun 25, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python library of web-related functions☆420Jun 22, 2026Updated last week
- Convert Javascript code to an XML document☆188Mar 14, 2022Updated 4 years ago
- Web scraping Page Objects core library☆107Jun 22, 2026Updated last week
- Page Object pattern for Scrapy☆127Jun 8, 2026Updated 3 weeks ago
- Automatic unit test generation for Scrapy.☆58Jul 12, 2021Updated 4 years ago
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python☆299Jun 26, 2026Updated last week
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 4 years ago
- Scrapinghub Command Line Client☆130Jun 26, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Web application to help categorize and aggregate subscriptions of media channels for easy access. (working only with Youtube channels at …☆16Aug 27, 2020Updated 5 years ago
- Extract text from HTML☆135Apr 8, 2026Updated 2 months ago
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 4 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,339Jun 25, 2026Updated last week
- Remove DIVs, style stuff and normalize HTML preserving structure information☆14Oct 24, 2025Updated 8 months ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- Scrapy Extension for monitoring spiders execution.☆560May 28, 2026Updated last month
- Extract price amount and currency symbol from a raw text string☆345Mar 19, 2026Updated 3 months ago
- QMPDClient official repository☆38Nov 18, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Performance-focused replacement for Python urllib☆21Apr 13, 2026Updated 2 months ago
- Spider templates for automatic crawlers.☆35Mar 26, 2026Updated 3 months ago
- The most advanced debugging and testing tool for Scrapy☆16Apr 19, 2023Updated 3 years ago
- Example frontera project☆12Aug 10, 2017Updated 8 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109May 21, 2024Updated 2 years ago
- Google Tink's critical Ed25519 bug related to Java "final" keyword☆11Apr 5, 2020Updated 6 years ago
- Site do PugPE☆16Jul 19, 2023Updated 2 years ago
- A tool to generate fixed-width CNAB240 files to perform bulk payments☆21Jul 1, 2022Updated 4 years ago
- NER toolkit for HTML data☆259May 3, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Bootable USB disk that lets you choose an ISO image☆16Oct 19, 2020Updated 5 years ago
- A client interface for Scrapinghub's API☆206Jun 24, 2026Updated last week
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Apr 14, 2026Updated 2 months ago
- a Prometheus exporter for the LYWSD03MMC BLE thermometer☆16Dec 11, 2023Updated 2 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆276Feb 26, 2025Updated last year
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- Python client for Zyte API☆30Jun 17, 2026Updated 2 weeks ago