wind2sing / aCrawler
π A powerful web-crawling framework, based on aiohttp.
β15Updated 5 years ago
Alternatives and similar repositories for aCrawler:
Users that are interested in aCrawler are comparing it to the libraries listed below
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmiβ¦β24Updated 2 months ago
- python 3.7 asyncio tutorial.β14Updated 5 years ago
- tus.io protocol implementation for aiohttp.web applicationsβ17Updated 2 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β21Updated 4 years ago
- Pyppeteer integration for Scrapyβ59Updated 4 years ago
- Analyze scraped dataβ46Updated 5 years ago
- Scrapy + Puppeteerβ111Updated 3 years ago
- Use pyppeteer from a Scrapy spiderβ60Updated 5 years ago
- Easily document your Sanic API with Swagger UI, Plus param validation and model serialization.β47Updated 3 years ago
- A Ruia plugin for loading javascript - pyppeteerβ18Updated 2 years ago
- Integrates terminado (a web based terminal) with flaskβ15Updated 7 years ago
- A cancelled Python web server project.β39Updated 5 years ago
- β76Updated 4 years ago
- MongoDB with ΞΌMongo support for Sanic frameworkβ27Updated last year
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.β56Updated 3 years ago
- πΆ Awesome list of Scrapy tools and librariesβ59Updated 4 years ago
- Combine XPath, CSS Selectors and JSONPath for Web data extracting.β28Updated 3 months ago
- Asyncio web crawling framework. Work in progress.β18Updated 8 months ago
- asyncio client for neo4jβ29Updated 3 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schemaβ45Updated 4 years ago
- Free & open source API service for obtaining information about +9600 universities worldwide.β67Updated 3 years ago
- Page Object pattern for Scrapyβ121Updated 2 months ago
- Mobilenium allows you to use Selenium and have access to status codes and HTTP headers, without the need for manual labor.β20Updated 5 years ago
- sonic search backend client in pythonβ60Updated 3 years ago
- An efficient and lightweight thread poolβ37Updated 4 years ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.β109Updated 10 months ago
- Scrapy spider middleware to clean up query parameters in request URLsβ25Updated 8 years ago
- Extract structured data from HTML and XML documents like a boss.β49Updated 4 months ago
- Library to populate items using XPath and CSS with a convenient APIβ48Updated 3 weeks ago
- Web scraping Page Objects core libraryβ99Updated 2 months ago