wind2sing / aCrawlerLinks
π A powerful web-crawling framework, based on aiohttp.
β15Updated 6 years ago
Alternatives and similar repositories for aCrawler
Users that are interested in aCrawler are comparing it to the libraries listed below
Sorting:
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmiβ¦β24Updated 11 months ago
- Pyppeteer integration for Scrapyβ58Updated 4 years ago
- A Ruia plugin for loading javascript - pyppeteerβ18Updated 3 years ago
- A complimentary proxy to help to use SPM with headless browsersβ108Updated 2 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β23Updated 8 months ago
- Scrapy + Puppeteerβ110Updated 4 years ago
- Use pyppeteer from a Scrapy spiderβ59Updated 6 years ago
- A RabbitMQ Scheduler for Scrapyβ87Updated 3 years ago
- Mobilenium allows you to use Selenium and have access to status codes and HTTP headers, without the need for manual labor.β20Updated 6 years ago
- tus.io protocol implementation for aiohttp.web applicationsβ17Updated 3 years ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucketβ18Updated 3 years ago
- minimalist event system for Pythonβ85Updated 6 years ago
- Extract text from HTMLβ134Updated 2 weeks ago
- Django based application that allows creating, deploying and running Scrapy spiders in a distributed mannerβ112Updated 7 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.β193Updated 3 years ago
- Bounded Process&Thread Pool Executorβ63Updated last year
- Simple image classifier microservice using tensorflow and sanicβ25Updated 7 years ago
- Crochet-based blocking API for Scrapy.β46Updated 8 years ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncioβ15Updated 5 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schemaβ45Updated 4 years ago
- Asyncio web crawling framework. Work in progress.β19Updated last year
- Simple Web UI for Scrapy spider management via Scrapydβ50Updated 7 years ago
- Easily document your Sanic API with Swagger UI, Plus param validation and model serialization.β47Updated 4 years ago
- Python package for HTTP/1.1 style headers. Parse headers to objects. Most advanced available structure for http headers.β122Updated last month
- A session-management extension for Scrapy.β10Updated 2 years ago
- simple motor wrapper for sanicβ57Updated 3 years ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Pythonβ291Updated last week
- python 3.7 asyncio tutorial.β14Updated 6 years ago
- A Requests-compatible interface for PycURL.β71Updated 4 months ago
- Combine XPath, CSS Selectors and JSONPath for Web data extracting.β29Updated last year