spicyparrot / kafka_scrapy_connect
A custom library that integrates Scrapy with Kafka.
☆11Updated 7 months ago
Alternatives and similar repositories for kafka_scrapy_connect:
Users that are interested in kafka_scrapy_connect are comparing it to the libraries listed below
- Implement scrapy with asyncio☆61Updated 5 months ago
- This repo contains a full-fledged Python-based script that scrapes a JavaScript-rendered website, cleans the data, and pushes the results…☆13Updated 2 years ago
- Scrapy download handler that can impersonate browser' TLS signatures or JA3 fingerprints.☆141Updated 2 weeks ago
- Python port of Xetera/ghost-cursor, for use with Pyppeteer and Playwright.☆66Updated 2 years ago
- Nodriver integration for Scrapy☆15Updated 3 months ago
- Super Fast, Super Anti-Detect, and Super Intuitive Web Driver☆55Updated 3 weeks ago
- A blazing-fast Python HTTP Client with TLS fingerprint☆230Updated this week
- ☆65Updated last year
- Zyte API integration for Scrapy☆38Updated 2 weeks ago
- Page Object pattern for Scrapy☆121Updated last month
- awsome scrapy utils☆56Updated 11 months ago
- aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare☆53Updated 4 months ago
- ☆48Updated 10 months ago
- 爬虫实战项目☆19Updated 2 years ago
- A drop-in replacement for playwright-python patched with rebrowser-patches. It allows to pass modern automation detection tests.☆60Updated 3 months ago
- playwright stealth☆72Updated 4 months ago
- Perfect breakthrough JA3 detection☆32Updated 5 months ago
- Easily implement distributed asynchronous tasks in one step.仅需一步,轻松实现分布式异步任务。☆22Updated last month
- A fork of https://github.com/AtuboDad/playwright_stealth☆77Updated last week
- Scrapy stats exporter for prometheus☆19Updated 5 months ago
- This repository serves as a comprehensive resource for my studies of akamai solutions.☆23Updated 4 months ago
- Browser fingerprint data generator☆43Updated last week
- The Web Scraping Club Free Repository☆137Updated 5 months ago
- Scrapy project template. Use it to quickly spin up a new web scraping project☆17Updated 4 months ago
- Undetected Python version of the Playwright testing and automation library.☆28Updated 5 months ago
- Solve amazon flex captcha with python / js☆6Updated last year
- ☆13Updated 3 years ago
- Python client and types generator for the Chrome DevTools Protocol (CDP)☆70Updated 3 weeks ago
- REST API provides programmatic access to GoLogin App. Create a new browser profile, get a list of all browser profiles, add a browser pr…☆73Updated 2 months ago
- ☆58Updated last year