Zyte Automatic Extraction integration for Scrapy
☆58Apr 13, 2026Updated last month
Alternatives and similar repositories for scrapy-autoextract
Users that are interested in scrapy-autoextract are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 5 years ago
- Page Object pattern for Scrapy☆127May 15, 2026Updated 2 weeks ago
- Analyze scraped data☆47Dec 9, 2019Updated 6 years ago
- Scrapy Extension for monitoring spiders execution.☆558Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Examples inspired by book Python For Finance☆12Jan 20, 2021Updated 5 years ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucket☆18Mar 28, 2022Updated 4 years ago
- Web scraping Page Objects core library☆107May 5, 2026Updated 3 weeks ago
- A complimentary proxy to help to use SPM with headless browsers☆108May 20, 2026Updated last week
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 3 months ago
- A python implementation of DEPTA☆83Jan 14, 2017Updated 9 years ago
- HTTP API for Scrapy spiders☆880Mar 20, 2026Updated 2 months ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Jan 16, 2024Updated 2 years ago
- Generate OpenAPI 3.x.x using Pydantic☆11Feb 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Golang Web Service Example using Databento and DuckDB☆20May 22, 2025Updated last year
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18May 23, 2023Updated 3 years ago
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- Extract text from HTML☆135Apr 8, 2026Updated last month
- An MCP server that provides real-time data and insights from the Hyperliquid perp DEX for use in bots, dashboards, and analytics.☆28May 31, 2025Updated 11 months ago
- Template for creating a Python Flask/Dash/Plotly single-page-application (SPA) for interactively charting IoT-type time series data store…☆23Nov 12, 2020Updated 5 years ago
- A scalable frontier for web crawlers☆1,328Jun 6, 2025Updated 11 months ago
- Machine Learning in Asset Management☆20Jul 18, 2019Updated 6 years ago
- Answers to the questions at the back of the chapters of Advances in Financial Machine Learning.☆22Apr 11, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python library of web-related functions☆419May 20, 2026Updated last week
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109May 21, 2024Updated 2 years ago
- A simple CHIP8 interpreter made with Rust.☆11Apr 23, 2026Updated last month
- Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy☆365May 4, 2026Updated 3 weeks ago
- ☆16Apr 10, 2026Updated last month
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,329Jan 29, 2026Updated 4 months ago
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Nov 8, 2023Updated 2 years ago
- Web application to help categorize and aggregate subscriptions of media channels for easy access. (working only with Youtube channels at …☆16Aug 27, 2020Updated 5 years ago
- Convert Javascript code to an XML document☆188Mar 14, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Spider templates for automatic crawlers.☆34Mar 26, 2026Updated 2 months ago
- Provides transparent listbox controls for AHK GUIs.☆15Jan 17, 2015Updated 11 years ago
- 电影评估推荐系统☆17Jul 31, 2016Updated 9 years ago
- A basic python based tool for domain ℹ️ information gathering. I am working 💻 on collecting information related to domain whois, history…☆13Jan 11, 2026Updated 4 months ago
- A modern TypeScript/JavaScript library for interacting with the Asterisk REST Interface (ARI), offering robust WebSocket support for real…☆16Mar 4, 2026Updated 2 months ago
- Scrapy extension to control spiders using JSON-RPC☆300Aug 26, 2019Updated 6 years ago
- A fast scala generic library based on code generation.☆26Dec 3, 2022Updated 3 years ago