al-serebrov / scrapinghub-elasticsearch-loader
Load items from Scrapinghub to ElasticSearch
☆11Updated 2 years ago
Alternatives and similar repositories for scrapinghub-elasticsearch-loader:
Users that are interested in scrapinghub-elasticsearch-loader are comparing it to the libraries listed below
- A browser extension to monitor your spiders deployed on Scrapy Cloud.☆16Updated 2 months ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Updated 5 years ago
- Convert Javascript code to an XML document☆186Updated 3 years ago
- Parsing JavaScript objects into Python data structures☆203Updated this week
- A library to make it easier to load input URLs to start scrapy processes☆14Updated 4 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆272Updated 2 months ago
- Automatic unit test generation for Scrapy.☆56Updated 3 years ago
- Page Object pattern for Scrapy☆121Updated 2 months ago
- Extract price amount and currency symbol from a raw text string☆327Updated 2 months ago
- This repository is no longer maintained.☆130Updated 6 years ago
- A client interface for Scrapinghub's API☆206Updated 2 months ago
- Scrapinghub Command Line Client☆132Updated 2 weeks ago
- Scrapy Training companion code☆174Updated 6 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- Scrapy Extension for monitoring spiders execution.☆540Updated 3 weeks ago
- Run a Scrapy spider programmatically from a script or a Celery task - no project required.☆122Updated 11 months ago
- Analyze scraped data☆46Updated 5 years ago
- Simple tool to convert curl requests to scrapy.☆45Updated 3 years ago
- Splash + HAProxy + Docker Compose☆196Updated 6 years ago
- Formasaurus tells you the type of an HTML form and its fields using machine learning☆118Updated 10 months ago
- Extract text from HTML☆135Updated 4 years ago
- Python library of web-related functions☆406Updated this week
- 🎭 Twisted Deferred Thread backend for Requests.☆417Updated 6 years ago
- Python DSL for Elasticsearch☆25Updated this week
- validate arbitrary data structures in python☆141Updated 6 years ago
- Celery Once allows you to prevent multiple execution and queuing of celery tasks.☆21Updated 9 years ago
- Useful test spiders for Scrapy☆185Updated 5 years ago
- ☆22Updated 2 years ago
- Materials from europython2015☆23Updated 9 years ago
- Generator of User-Agent header☆338Updated 10 months ago