zytedata / python-zyte-api
Python client for Zyte API
☆24Updated last month
Alternatives and similar repositories for python-zyte-api:
Users that are interested in python-zyte-api are comparing it to the libraries listed below
- ☆19Updated last month
- Spider templates for automatic crawlers.☆28Updated last week
- Web scraping Page Objects core library☆97Updated last month
- Common interface for data container classes☆67Updated last month
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- ipython + REPL + coroutines - suffering☆18Updated 7 months ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Updated 6 months ago
- Python wrapper for the Lago Rest API☆23Updated 3 weeks ago
- A Python library for finding feed links on websites.☆52Updated 2 years ago
- pai: A Python REPL with a built in AI agent☆40Updated last year
- Pluggable DSL that uses pipes to perform a series of linear transformations to extract data☆16Updated 8 months ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated 2 weeks ago
- Python clients for Zyte AutoExtract API☆40Updated 3 years ago
- A Python client for the People Data Labs API☆31Updated 2 weeks ago
- Caching and distributed locks in your applications with just one or two lines. Easy to learn. Fast to code.☆33Updated 2 weeks ago
- A pure-Python robots.txt parser with support for modern conventions.☆61Updated 2 weeks ago
- Detect and classify pagination links☆15Updated 4 years ago
- Pytest plugin that runs PyStack on slow or hanging tests.☆16Updated 4 months ago
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.☆24Updated 2 months ago
- Page Object pattern for Scrapy☆120Updated last month
- Detect and classify pagination links☆102Updated 4 years ago
- Extract text from HTML☆134Updated 4 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 10 months ago
- Versatile Metrics Collection for Python☆18Updated last year
- Bringing semantic search to Django. Integrates seemlessly with Django ORM.☆32Updated 5 months ago
- Use XML tags for long context prompting using Claude's multi-document structure.☆23Updated 5 months ago
- Apify API client for Python☆58Updated this week