wind2sing / aCrawlerLinks
π A powerful web-crawling framework, based on aiohttp.
β15Updated 6 years ago
Alternatives and similar repositories for aCrawler
Users that are interested in aCrawler are comparing it to the libraries listed below
Sorting:
- Pyppeteer integration for Scrapyβ58Updated 4 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmiβ¦β24Updated 11 months ago
- Use pyppeteer from a Scrapy spiderβ59Updated 6 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.β23Updated 8 months ago
- Analyze scraped dataβ46Updated 6 years ago
- python 3.7 asyncio tutorial.β14Updated 6 years ago
- Scrapy + Puppeteerβ110Updated 4 years ago
- Asyncio web crawling framework. Work in progress.β19Updated last year
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.β57Updated 3 years ago
- Zyte Automatic Extraction integration for Scrapyβ56Updated 4 years ago
- Extract structured data from HTML and XML documents like a boss.β50Updated last year
- Scrapy schema validation pipeline and Item builder using JSON Schemaβ45Updated 4 years ago
- tus.io protocol implementation for aiohttp.web applicationsβ17Updated 3 years ago
- Simple Web UI for Scrapy spider management via Scrapydβ50Updated 7 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.β193Updated 3 years ago
- Library for annotation-based dependency injectionβ24Updated last month
- A complimentary proxy to help to use SPM with headless browsersβ108Updated 2 years ago
- A Ruia plugin for loading javascript - pyppeteerβ18Updated 3 years ago
- Web scraping Page Objects core libraryβ104Updated last week
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.β110Updated last year
- Extract text from HTMLβ134Updated 2 weeks ago
- Page Object pattern for Scrapyβ126Updated last week
- Django based application that allows creating, deploying and running Scrapy spiders in a distributed mannerβ112Updated 7 years ago
- Combine XPath, CSS Selectors and JSONPath for Web data extracting.β29Updated last year
- A Flask-based CMS / web application for compatibility and simplicity.β14Updated 6 years ago
- Scrapy middleware for the autologinβ36Updated 2 weeks ago
- scrapy mysql pipelineβ49Updated 4 years ago
- A scrapy extension to sync `.scrapy` folder to an S3 bucketβ18Updated 3 years ago
- Python clients for Zyte AutoExtract APIβ41Updated 4 years ago
- A RabbitMQ Scheduler for Scrapyβ87Updated 3 years ago