stummjr / scrapy-fieldstatsView external linksLinks
A Scrapy extension to log items coverage when the spider shuts down
☆19Apr 11, 2020Updated 5 years ago
Alternatives and similar repositories for scrapy-fieldstats
Users that are interested in scrapy-fieldstats are comparing it to the libraries listed below
Sorting:
- ☆11Jul 6, 2020Updated 5 years ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆21Feb 8, 2017Updated 9 years ago
- In this repository, I try to share some of the little tips and tricks and amazing spiders that I used to work with on the scrapy framewor…☆12Feb 2, 2020Updated 6 years ago
- Today I Learnt ...☆16Sep 2, 2025Updated 5 months ago
- Crochet-based blocking API for Scrapy.☆46Feb 24, 2017Updated 8 years ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 4 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆57Mar 16, 2022Updated 3 years ago
- ☆18Oct 6, 2025Updated 4 months ago
- JAV site scrapers☆18Jul 6, 2022Updated 3 years ago
- Scrapy project to scrape public web directories (educational)☆22Mar 18, 2017Updated 8 years ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 3 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 4 years ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.☆556Dec 28, 2022Updated 3 years ago
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- Automatic unit test generation for Scrapy.☆57Jul 12, 2021Updated 4 years ago
- A scrapy extension to store requests and responses information in storage service☆27Mar 11, 2022Updated 3 years ago
- A scrapy spider for R18☆16Nov 18, 2025Updated 2 months ago
- Scrapy extension which writes crawled items to Kafka☆30Updated this week
- Pyppeteer integration for Scrapy☆58Feb 26, 2021Updated 4 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago
- ☆29Apr 28, 2021Updated 4 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 8 years ago
- Enhanment Scrapping API for six hotel booking website from Expedia.com, Booking.com, Bookhotelbeds.com. Hotels.com, Bestday.com, despegar…☆11May 7, 2018Updated 7 years ago
- A collection of github workflow patterns☆10Feb 1, 2024Updated 2 years ago
- A CLI for benchmarking Scrapy.☆32Jun 28, 2025Updated 7 months ago
- Useful test spiders for Scrapy☆184Jan 20, 2020Updated 6 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40May 21, 2024Updated last year
- ☆12Jul 9, 2020Updated 5 years ago
- A generic crawler☆78Updated this week
- ☆10Aug 19, 2022Updated 3 years ago
- Scrapy Extension for monitoring spiders execution.☆553Updated this week
- Analyzes target website for anti-scraping protections and performance. Saves screenshots/HTML snapshots.☆11Aug 13, 2025Updated 6 months ago
- Use Googlemap on your Django app☆18Aug 17, 2012Updated 13 years ago
- Starlette / Zeit Now app for converting HEIC to JPEG☆14Apr 28, 2020Updated 5 years ago
- Python-based high-performance web tools - see http://chris.improbable.org/2010/01/30/quickly-testing-your-sites-using-webtoolbox/ for an …☆21Oct 24, 2017Updated 8 years ago
- A Feedly styled RSS reader with TT-RSS functionality.☆11Mar 22, 2019Updated 6 years ago
- Will send the same request to one or more sources to exchange cost for reduced latency for inference☆11Dec 17, 2024Updated last year
- Web page preview and analysis tool☆12Jan 11, 2023Updated 3 years ago
- An airflow deployment configuration with sane defaults☆10Jun 6, 2019Updated 6 years ago