Library to populate items using XPath and CSS with a convenient API
☆48Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for itemloaders
Users that are interested in itemloaders are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Jan 16, 2024Updated 2 years ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 5 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 5 years ago
- Analyze scraped data☆46Dec 9, 2019Updated 6 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆86Jan 29, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python library of web-related functions☆419Apr 27, 2026Updated last week
- Convert Javascript code to an XML document☆188Mar 14, 2022Updated 4 years ago
- Web scraping Page Objects core library☆105Apr 21, 2026Updated last week
- Page Object pattern for Scrapy☆127Updated this week
- Library for annotation-based dependency injection☆24Mar 3, 2026Updated 2 months ago
- Automatic unit test generation for Scrapy.☆57Jul 12, 2021Updated 4 years ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python☆298Jan 29, 2026Updated 3 months ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 4 years ago
- Scrapinghub Command Line Client☆130Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Web application to help categorize and aggregate subscriptions of media channels for easy access. (working only with Youtube channels at …☆16Aug 27, 2020Updated 5 years ago
- Extract text from HTML☆135Apr 8, 2026Updated 3 weeks ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,324Jan 29, 2026Updated 3 months ago
- Remove DIVs, style stuff and normalize HTML preserving structure information☆14Oct 24, 2025Updated 6 months ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- Extract price amount and currency symbol from a raw text string☆345Mar 19, 2026Updated last month
- Parsing JavaScript objects into Python data structures☆218Aug 4, 2025Updated 9 months ago
- Spider templates for automatic crawlers.☆34Mar 26, 2026Updated last month
- The most advanced debugging and testing tool for Scrapy☆16Apr 19, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Example frontera project☆12Aug 10, 2017Updated 8 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109May 21, 2024Updated last year
- Google Tink's critical Ed25519 bug related to Java "final" keyword☆11Apr 5, 2020Updated 6 years ago
- Site do PugPE☆16Jul 19, 2023Updated 2 years ago
- Gilfoyle is a report generation tool for Python which makes it quick and easy to create stylish reports or presentations using data.☆27May 13, 2024Updated last year
- Golang Web Service Example using Databento and DuckDB☆20May 22, 2025Updated 11 months ago
- Cookiecutter template for FastAPI + Panel projects in Python☆10Apr 18, 2022Updated 4 years ago
- Default Twisted does not ship with a CONNECT-enabled HTTP(s) proxy. This code provides one.☆51Feb 21, 2017Updated 9 years ago
- A client interface for Scrapinghub's API☆205Apr 7, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Apr 14, 2026Updated 3 weeks ago
- Zyte API integration for Scrapy☆40Updated this week
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆276Feb 26, 2025Updated last year
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- certstream + analytics☆11Jan 17, 2020Updated 6 years ago
- Using mainly CSS animations and a scattering of JavaScript, make any page a winter wonderland of snow (originally built in January 2010)☆11Apr 7, 2016Updated 10 years ago
- A component that tries to avoid downloading duplicate content☆28Apr 8, 2026Updated 3 weeks ago