πΆ Awesome list of Scrapy tools and libraries
β61Jul 6, 2020Updated 5 years ago
Alternatives and similar repositories for awesome-scrapy
Users that are interested in awesome-scrapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library to make it easier to load input URLs to start scrapy processesβ14Feb 21, 2021Updated 5 years ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.β561Dec 28, 2022Updated 3 years ago
- Library to populate items using XPath and CSS with a convenient APIβ48Jan 29, 2026Updated 4 months ago
- Python clients for Zyte AutoExtract APIβ41Jan 17, 2022Updated 4 years ago
- An MCP server that provides real-time data and insights from the Hyperliquid perp DEX for use in bots, dashboards, and analytics.β28May 31, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- GridSound wants to be a free browser-based HTML5 DAW (Digital Audio Workstation) following the new Web Audio API. You can test the applicβ¦β12Dec 9, 2018Updated 7 years ago
- Web scraping Page Objects core libraryβ107May 5, 2026Updated 3 weeks ago
- β14Apr 22, 2026Updated last month
- A browser extension to monitor your spiders deployed on Scrapy Cloud.β16Mar 8, 2025Updated last year
- Library for annotation-based dependency injectionβ24Mar 3, 2026Updated 2 months ago
- A Scrapy extension to log items coverage when the spider shuts downβ19Apr 11, 2020Updated 6 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schemaβ45Mar 26, 2021Updated 5 years ago
- Automatic unit test generation for Scrapy.β58Jul 12, 2021Updated 4 years ago
- MCP Hyperliquid (https://app.hyperliquid.xyz) serverβ44Mar 6, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python interface to Digital Oceanβ25Jun 11, 2015Updated 10 years ago
- Zyte Automatic Extraction integration for Scrapyβ58Apr 13, 2026Updated last month
- Page Object pattern for Scrapyβ127May 15, 2026Updated 2 weeks ago
- humanreadable is a Python library to convert human-readable values to other units.β21May 9, 2026Updated 3 weeks ago
- Creates a pipeline Airflow and Scrapy to output an average image composition of everyone's face in a given websiteβ43Oct 13, 2017Updated 8 years ago
- Web scraping with Selenium Webdriver and MongoDB, deployed to Herokuβ12Dec 8, 2022Updated 3 years ago
- Rotina em python para obtenΓ§Γ£o de dados dos hospitais brasileiros via Cadastro Nacional de Estabelecimentos de SaΓΊde (CNES)β18Jan 3, 2024Updated 2 years ago
- Convert Javascript code to an XML documentβ188Mar 14, 2022Updated 4 years ago
- Common interface for data container classesβ69May 6, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Detect and classify pagination linksβ107Apr 8, 2026Updated last month
- Extract price amount and currency symbol from a raw text stringβ344Mar 19, 2026Updated 2 months ago
- Scrapy entrypoint for Scrapinghub job runnerβ24Feb 26, 2026Updated 3 months ago
- More flexible and featured Frontera scheduler for Scrapyβ36Jun 6, 2025Updated 11 months ago
- Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.β¦β3,403Feb 19, 2025Updated last year
- A template for standard Maltego transformationβ14Dec 8, 2021Updated 4 years ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued keyβ21Feb 8, 2017Updated 9 years ago
- Scrapy project template. Use it to quickly spin up a new web scraping projectβ16Nov 18, 2024Updated last year
- Podclips is an iOS app that allows users to cut out and share clips from their favourite podcastsβ15Mar 25, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OnionSprout is a tool to run publicaly-accessible web services, for example from Raspberry Pi in your home, without a public IP.β11Oct 3, 2020Updated 5 years ago
- Unofficial API to fetch bin details from bins.wsβ15Jul 9, 2023Updated 2 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.β109May 21, 2024Updated 2 years ago
- scrapydweb, dockerfileβ13Feb 1, 2021Updated 5 years ago
- A linter for Scrapy projects.β22Feb 25, 2026Updated 3 months ago
- Aplikasi transparansi penyaluran dan realisasi dana desaβ13Dec 9, 2015Updated 10 years ago
- A decorator to write coroutine-like spider callbacks.β109Dec 26, 2022Updated 3 years ago