corywalker / selenium-crawler
Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.
☆125Updated 11 years ago
Alternatives and similar repositories for selenium-crawler:
Users that are interested in selenium-crawler are comparing it to the libraries listed below
- Crawlera tools☆26Updated 9 years ago
- Some scrapy and web.py exmaples☆79Updated 7 years ago
- An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site☆127Updated 6 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- ☆143Updated 9 years ago
- a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆97Updated 11 months ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- A Python module to fetch and parse results from different search engines.☆77Updated 6 years ago
- ☆167Updated 6 years ago
- Python distributed web scrapper and dynamic crawler☆141Updated 7 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- Blog crawler for the blogforever project.☆22Updated 11 years ago
- Using Scrapy to get company profiles from http://crunchbase.com☆31Updated 11 years ago
- Crochet-based blocking API for Scrapy.☆46Updated 8 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆32Updated 7 years ago
- A sdk for AlchemyAPI using Python - Please note that this legacy AlchemyAPI SDK is no longer supported by IBM. Please use the Watson SDKs…☆98Updated 8 years ago
- Command line tool for Sentiment analysis of tweets - done for data mining sem project☆11Updated 8 years ago
- A library to interface with the Linkscape API.☆40Updated 6 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 9 years ago
- Scrapy examples crawling Craigslist☆199Updated 8 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 7 years ago
- Scrapes public information off of LinkedIn☆110Updated 9 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Running scrapy spider programmatically.☆47Updated 8 years ago
- Natural Language Generator for Python☆27Updated 7 years ago
- Scrapy Middleware to set a random User-Agent for every Request.☆202Updated 5 years ago
- ☆18Updated 8 years ago
- Dmoz RDF parser☆28Updated 8 years ago