wind2sing / aCrawler
π A powerful web-crawling framework, based on aiohttp.
β15Updated 5 years ago
Alternatives and similar repositories for aCrawler:
Users that are interested in aCrawler are comparing it to the libraries listed below
- Use pyppeteer from a Scrapy spiderβ60Updated 5 years ago
- python 3.7 asyncio tutorial.β14Updated 5 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmiβ¦β24Updated last week
- Scrapy + Puppeteerβ111Updated 3 years ago
- A session-management extension for Scrapy.β11Updated last year
- all kinds of scrapy demoβ164Updated 2 years ago
- Mobilenium allows you to use Selenium and have access to status codes and HTTP headers, without the need for manual labor.β20Updated 5 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β21Updated 4 years ago
- Django based application that allows creating, deploying and running Scrapy spiders in a distributed mannerβ113Updated 6 years ago
- Simple Web UI for Scrapy spider management via Scrapydβ51Updated 6 years ago
- A simple python tool that generates a requests/bs4 based web scraperβ26Updated 2 years ago
- Asyncio web crawling framework. Work in progress.β18Updated 6 months ago
- Scrapy integration with Tor for anonymous web scrapingβ46Updated 9 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.β18Updated 9 months ago
- Analyze scraped dataβ46Updated 5 years ago
- Library to populate items using XPath and CSS with a convenient APIβ46Updated 2 weeks ago
- A Ruia plugin for loading javascript - pyppeteerβ18Updated 2 years ago
- Common interface for data container classesβ66Updated last week
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.β11Updated last year
- scrapy mysql pipelineβ49Updated 3 years ago
- MongoDB with ΞΌMongo support for Sanic frameworkβ27Updated last year
- Easily document your Sanic API with Swagger UI, Plus param validation and model serialization.β47Updated 3 years ago
- Web scraping Page Objects core libraryβ96Updated last week
- Scrapy environment with Tor for anonymous ip routing and Privoxy for http proxyβ20Updated 8 years ago
- Library that helps use puppeteer in scrapy.β52Updated 3 weeks ago
- Scrapy stats exporter for prometheusβ19Updated 3 months ago
- πΆ Awesome list of Scrapy tools and librariesβ59Updated 4 years ago
- More flexible and featured Frontera scheduler for Scrapyβ36Updated 2 months ago
- Simple Flask scheduled tasks without extra daemonsβ101Updated 3 years ago
- Simple, clear and fast Web Crawler framework build on python3.6+, powered by asyncio.β95Updated 2 years ago