zym1115718204 / xspider
A Distributed web crawler system. Support for templated spider development.
☆13Updated 7 years ago
Alternatives and similar repositories for xspider:
Users that are interested in xspider are comparing it to the libraries listed below
- A project to attempt to automatically login to a website given a single seed☆11Updated 8 months ago
- an awesome public proxy server crawler based on scrapy framework☆96Updated 7 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆162Updated 2 years ago
- Output scrapy statistics to graphite/carbon☆54Updated 11 years ago
- Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.☆189Updated last year
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆269Updated 3 years ago
- 一个基于scrapy-redis的分布式爬虫模板☆42Updated 7 years ago
- antitools☆17Updated 4 years ago
- Brownant is a web data extracting framework.☆159Updated 7 years ago
- A python Function / Method OUTPUT cache system base on function Decorators.☆58Updated 4 years ago
- 基于 asyncio,aiohttp,uvloop 的爬虫框架☆14Updated 6 years ago
- Extends the official Elasticsearch Python API adding Tornado AsyncHTTPClient support☆38Updated 4 years ago
- A Python wrapper for working with Scrapyd's API.☆270Updated 6 months ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Use pyppeteer from a Scrapy spider☆60Updated 5 years ago
- PhantomJS Downloader for Scrapy, Yeah!☆94Updated 10 years ago
- Some scrapy and web.py exmaples☆79Updated 7 years ago
- 基于mongodb存储,redis缓存,celery 实现的分布式爬虫。☆13Updated 2 years ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- celery redis scheduler, dynamic add/modify/delete task from celery.☆180Updated 4 months ago
- Python bloom filter using redis as a shared backend.☆19Updated 7 years ago
- Fast Redis Bloom Filters in Python☆289Updated 6 years ago
- ☆29Updated 3 years ago
- Useful test spiders for Scrapy☆185Updated 5 years ago
- Fish Fish Jump is a solution in the python that simply and basic for search engines.☆55Updated 6 years ago
- A RabbitMQ Scheduler for Scrapy☆86Updated 2 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmi…☆24Updated 2 weeks ago
- A Python Module for the "General SQL Parser" library (sqlparser.com)☆76Updated last year
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner☆113Updated 6 years ago