orangain / scrapy-s3pipelineView external linksLinks
Scrapy pipeline to store chunked items into Amazon S3 or Google Cloud Storage bucket.
☆76Mar 18, 2022Updated 3 years ago
Alternatives and similar repositories for scrapy-s3pipeline
Users that are interested in scrapy-s3pipeline are comparing it to the libraries listed below
Sorting:
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 4 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Jan 29, 2026Updated 2 weeks ago
- Web scraping Page Objects core library☆104Jan 27, 2026Updated 3 weeks ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆45Mar 26, 2021Updated 4 years ago
- Docker container running scrapyd with HTTP authentication☆41May 14, 2024Updated last year
- ☆13Mar 13, 2016Updated 9 years ago
- Page Object pattern for Scrapy☆126Jan 28, 2026Updated 2 weeks ago
- Bridge between Mattermost and various services using the Openresty platform☆13Aug 28, 2017Updated 8 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Jan 16, 2024Updated 2 years ago
- Extensions for using Scrapy on Amazon AWS☆32Dec 5, 2012Updated 13 years ago
- ☆16Apr 24, 2024Updated last year
- More flexible and featured Frontera scheduler for Scrapy☆36Jun 6, 2025Updated 8 months ago
- Bootable USB disk that lets you choose an ISO image☆16Oct 19, 2020Updated 5 years ago
- Custom email-based reports for any Django project☆29Oct 23, 2011Updated 14 years ago
- ☆18Oct 6, 2025Updated 4 months ago
- Python client for Zyte API☆28Feb 10, 2026Updated last week
- Library designed to replace the SQLite backend by a MongoDB backend on Scrapy queue management☆17Sep 2, 2017Updated 8 years ago
- Scrapy middleware which allows to crawl only new content☆79Feb 10, 2026Updated last week
- Scrapinghub Command Line Client☆131Nov 6, 2025Updated 3 months ago
- Run a Scrapy spider programmatically from a script or a Celery task - no project required.☆121Jun 4, 2024Updated last year
- Scrapy Extension for monitoring spiders execution.☆553Updated this week
- A pyVows extension for testing Django applications.☆32Jul 6, 2022Updated 3 years ago
- A simple FastAPI server to manage workspaces and multi-user sharing within MinIO.