yall / scrapy-twitter
☆45Updated 8 years ago
Alternatives and similar repositories for scrapy-twitter:
Users that are interested in scrapy-twitter are comparing it to the libraries listed below
- An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site☆126Updated 6 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- Automatic Item List Extraction☆87Updated 8 years ago
- A Twitter search client mining tweets using their advanced search implemtation.☆90Updated 6 years ago
- ☆59Updated 3 years ago
- A middleware to use random user agent in Scrapy crawler.☆33Updated 12 years ago
- Scrapes sites. Gets news. Eventually events.☆84Updated 8 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆162Updated 2 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 9 years ago
- Python Diffbot API Client☆124Updated last year
- Docker container running scrapyd with HTTP authentication☆41Updated 9 months ago
- Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 11 years ago
- A Scrapy pipeline to categorize items using MonkeyLearn☆38Updated 7 years ago
- Scrapy middleware for the autologin☆37Updated 6 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆269Updated last week
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated 9 months ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Scrapy Middleware to set a random User-Agent for every Request.☆201Updated 5 years ago
- ☆49Updated 2 years ago
- A generic crawler☆78Updated 6 years ago
- Scrapy integration with Tor for anonymous web scraping☆46Updated 9 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Some scrapy and web.py exmaples☆79Updated 7 years ago
- An efficient simhash implementation for python☆124Updated 5 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 2 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆39Updated 4 years ago
- A twitter crawler in Python☆304Updated 7 years ago
- A flask API for running your scrapy spiders☆128Updated 6 years ago