scrapinghub / scrapy-autounitView external linksLinks
Automatic unit test generation for Scrapy.
☆57Jul 12, 2021Updated 4 years ago
Alternatives and similar repositories for scrapy-autounit
Users that are interested in scrapy-autounit are comparing it to the libraries listed below
Sorting:
- Web scraping Page Objects core library☆104Jan 27, 2026Updated 2 weeks ago
- Page Object pattern for Scrapy☆126Jan 28, 2026Updated 2 weeks ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 4 years ago
- Library for annotation-based dependency injection☆24Dec 9, 2025Updated 2 months ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Apr 11, 2020Updated 5 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Jan 29, 2026Updated 2 weeks ago
- Remove DIVs, style stuff and normalize HTML preserving structure information☆14Oct 24, 2025Updated 3 months ago
- Scrapy extension which writes crawled items to Kafka☆30Updated this week
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Jan 16, 2024Updated 2 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40May 21, 2024Updated last year
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- A complimentary proxy to help to use SPM with headless browsers☆108May 29, 2023Updated 2 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Jan 14, 2026Updated last month
- Sentry component for Scrapy☆86Aug 21, 2023Updated 2 years ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.☆556Dec 28, 2022Updated 3 years ago
- Analyze scraped data☆46Dec 9, 2019Updated 6 years ago
- Extract embedded metadata from HTML markup☆946Oct 1, 2025Updated 4 months ago
- Modular, way of implementing rate-limiting in python with a few handy default implementations☆63Mar 27, 2023Updated 2 years ago
- Pyppeteer integration for Scrapy☆58Feb 26, 2021Updated 4 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago
- Paginating the web☆37Feb 11, 2014Updated 12 years ago
- internal API for call processing☆11Jan 30, 2026Updated 2 weeks ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆110May 21, 2024Updated last year
- ☆29Apr 28, 2021Updated 4 years ago
- Convert Javascript code to an XML document☆187Mar 14, 2022Updated 3 years ago
- Common interface for data container classes☆68Jan 8, 2026Updated last month
- CuVS integration for Lucene☆37Jun 17, 2025Updated 7 months ago
- Example project to show how I set up Pytest, Playwright, and Django☆36Mar 21, 2023Updated 2 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆79Jan 29, 2026Updated 2 weeks ago
- More flexible and featured Frontera scheduler for Scrapy☆36Jun 6, 2025Updated 8 months ago
- My main collection, containing all important roles for my daily work.☆11Jan 30, 2026Updated 2 weeks ago
- A collection of github workflow patterns☆10Feb 1, 2024Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.☆76Mar 18, 2022Updated 3 years ago
- Extract price amount and currency symbol from a raw text string☆347Updated this week
- Refactoring and upgrade of AllStarLink's app_rpt, etc.☆11Feb 6, 2026Updated last week
- Scanning in the middlelayer, v2☆10Updated this week
- ☆17Jan 23, 2026Updated 3 weeks ago
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago