scrapinghub/exporters

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapinghub/exporters)

scrapinghub / exporters

Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations

☆39

Alternatives and similar repositories for exporters

Users that are interested in exporters are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

raidikalu / raidikalu
View on GitHub
Listaa raideja ja silleen
☆16Nov 2, 2022Updated 3 years ago
scrapinghub / flatson
View on GitHub
Tool to flatten stream of JSON-like objects, configured via schema
☆33Oct 19, 2019Updated 6 years ago
scrapinghub / page_finder
View on GitHub
Find which links on a web page are pagination links
☆29Jan 12, 2017Updated 9 years ago
scrapinghub / skinfer
View on GitHub
Skinfer is a tool for inferring and merging JSON schemas
☆141Apr 24, 2024Updated 2 years ago
scrapinghub / python-hubstorage
View on GitHub
Deprecated HubStorage client library - please use python-scrapinghub>=1.9.0 instead
☆16Dec 5, 2016Updated 9 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
TeamHG-Memex / scrapy-kafka-export
View on GitHub
Scrapy extension which writes crawled items to Kafka
☆31Apr 8, 2026Updated 3 months ago
scrapinghub / mdr
View on GitHub
A python library detect and extract listing data from HTML page.
☆110May 5, 2017Updated 9 years ago
scrapinghub / extruct
View on GitHub
Extract embedded metadata from HTML markup
☆967Apr 1, 2026Updated 3 months ago
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
scrapinghub / aile
View on GitHub
Automatic Item List Extraction
☆85Jun 15, 2016Updated 10 years ago
scrapy-plugins / scrapy-magicfields
View on GitHub
Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.
☆56Mar 16, 2022Updated 4 years ago
chakki-works / entitypedia
View on GitHub
Entitypedia is an Extended Named Entity Dictionary from Wikipedia.
☆13Dec 7, 2022Updated 3 years ago
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
scrapedia / r18
View on GitHub
A scrapy spider for R18
☆16Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
scrapinghub / scrapy-training
View on GitHub
Scrapy Training companion code
☆173Jan 30, 2019Updated 7 years ago
scrapinghub / aduana
View on GitHub
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…
☆54May 21, 2024Updated 2 years ago
b-e-p / bep
View on GitHub
bleeding edge (python) packages...
☆15Nov 22, 2015Updated 10 years ago
scrapinghub / docker-images
View on GitHub
☆33Oct 20, 2025Updated 9 months ago
brianwarehime / mcrits
View on GitHub
Visualize your CRITs IOC's in Maltego
☆12Jan 13, 2015Updated 11 years ago
scrapinghub / shub
View on GitHub
Scrapinghub Command Line Client
☆129Jul 22, 2026Updated last week
rkhwaja / fs.googledrivefs
View on GitHub
Implementation of a pyfilesystem2 filesystem for Google Drive
☆27Jul 15, 2026Updated 2 weeks ago
agaoglu / pyjasperclient
View on GitHub
JasperServer SOAP client for Python
☆27May 5, 2017Updated 9 years ago
FavyTeam / Advanced_PHP_Scrapping
View on GitHub
Enhanment Scrapping API for six hotel booking website from Expedia.com, Booking.com, Bookhotelbeds.com. Hotels.com, Bestday.com, despegar…
☆11May 7, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
scrapinghub / arche
View on GitHub
Analyze scraped data
☆47Dec 9, 2019Updated 6 years ago
scrapinghub / scrapy-mosquitera
View on GitHub
Restrict crawl and scraping scope using matchers.
☆26Jun 8, 2016Updated 10 years ago
browniebroke / cookiecutter-lambda-function
View on GitHub
A cookiecutter template to create AWS Lambda function
☆23Apr 15, 2019Updated 7 years ago
CIRCL / bgpranking-redis-api
View on GitHub
API to access the Redis database of a BGP Ranking instance.
☆17Dec 11, 2017Updated 8 years ago
mfwarren / cpa-1464
View on GitHub
Implementation of the Canadian Payment Association Standard 005, 1464 byte file format, for transmitting payments
☆11Dec 7, 2018Updated 7 years ago
shish / devtools-py
View on GitHub
A Python client for Chrome's DevTools protocol / a headless chrome control library
☆15Aug 20, 2018Updated 7 years ago
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
scrapy-plugins / scrapy-querycleaner
View on GitHub
Scrapy spider middleware to clean up query parameters in request URLs
☆24Jun 30, 2016Updated 10 years ago
deathbybandaid / pimotd
View on GitHub
This tweaks the motd do be much cooler
☆12May 15, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
scrapinghub / crawlera-tools
View on GitHub
Crawlera tools
☆26Feb 9, 2016Updated 10 years ago
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated last month
aGHz / structominer
View on GitHub
Data scraping for a more civilized age
☆17Jun 12, 2014Updated 12 years ago
venmo / feature_ramp
View on GitHub
Toggling and ramping features via a lightweight Redis backend.
☆18Sep 26, 2019Updated 6 years ago
rmax / scrapy-inline-requests
View on GitHub
A decorator to write coroutine-like spider callbacks.
☆109Dec 26, 2022Updated 3 years ago
scrapinghub / python-simhash
View on GitHub
An efficient simhash implementation for python
☆127Oct 25, 2019Updated 6 years ago
venmo / swaggergenerator
View on GitHub
Create swagger / OpenAPI schemas from example interactions.
☆12May 23, 2023Updated 3 years ago