scrapinghub / scrapy-autoextractView external linksLinks
Zyte Automatic Extraction integration for Scrapy
☆56Feb 4, 2022Updated 4 years ago
Alternatives and similar repositories for scrapy-autoextract
Users that are interested in scrapy-autoextract are comparing it to the libraries listed below
Sorting:
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Jan 29, 2026Updated 2 weeks ago
- Page Object pattern for Scrapy☆126Jan 28, 2026Updated 2 weeks ago
- A library to make it easier to load input URLs to start scrapy processes☆14Feb 21, 2021Updated 4 years ago
- Analyze scraped data☆46Dec 9, 2019Updated 6 years ago
- Scrapy Extension for monitoring spiders execution.☆553Updated this week
- ☆15Jan 21, 2026Updated 3 weeks ago
- Boilerplate for any django projects with HTML, CSS, Bootstrap.☆13Updated this week
- A complimentary proxy to help to use SPM with headless browsers☆108May 29, 2023Updated 2 years ago
- Pluggable DSL that uses pipes to perform a series of linear transformations to extract data☆16Jul 9, 2024Updated last year
- Web scraping Page Objects core library☆104Jan 27, 2026Updated 2 weeks ago
- HTTP API for Scrapy spiders☆879Updated this week
- ☆23Jan 4, 2017Updated 9 years ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18May 23, 2023Updated 2 years ago
- ☆146Nov 6, 2023Updated 2 years ago
- A linter for Scrapy projects.☆21Jan 27, 2026Updated 2 weeks ago
- A Python library to interact with ScrapingBee's API for headless browsers and proxy rotation☆29Feb 5, 2026Updated last week
- A client interface for Scrapinghub's API☆204Oct 3, 2025Updated 4 months ago
- FE-511 Bloomberg Terminal and Thomson Reuters☆11Feb 8, 2017Updated 9 years ago
- Code base for the practitioner's guide to the ONC algorithm paper published with the Journal of Financial Data Science☆20Jun 8, 2023Updated 2 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆80Sep 18, 2024Updated last year
- Spider templates for automatic crawlers.☆34Jan 8, 2026Updated last month
- Client SDK for Vision Azure☆10Nov 2, 2018Updated 7 years ago
- NSF EarthCube CyberConnector☆13Dec 16, 2022Updated 3 years ago
- Free https proxy list☆86Oct 19, 2025Updated 3 months ago
- ☆11Apr 24, 2023Updated 2 years ago
- VTEX API Wrapper for Python☆14May 21, 2024Updated last year
- ☆10Jun 10, 2022Updated 3 years ago
- Another reverse proxy that provides authentication with OpenID Connect☆10Jul 10, 2023Updated 2 years ago
- Simple tool for recursively scraping/validating emails and phone numbers from web pages.☆11Oct 1, 2021Updated 4 years ago
- Portfolio with data science and machine learning projects I developed during my training in data science.☆10Jan 4, 2021Updated 5 years ago
- In this repo you can get many python scripts,games and projects.☆10Feb 15, 2023Updated 2 years ago
- Django based microservice architecture with oauth2 🔋🌟☆11Sep 19, 2024Updated last year
- Kafka Manager Dockerfile☆11Nov 22, 2017Updated 8 years ago
- MIP21 example☆15Jun 20, 2022Updated 3 years ago
- ☆10Sep 30, 2022Updated 3 years ago
- E-commerce Web Application written in Django with Payment Integration, Asyncronous task processing using Celery, Flower etc..☆11Jul 23, 2019Updated 6 years ago
- A Kong plugin that allows access to an upstream url through a forward proxy (eg. squid).☆12Apr 30, 2018Updated 7 years ago
- This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.☆1,230Nov 7, 2023Updated 2 years ago