Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
☆40May 21, 2024Updated last year
Alternatives and similar repositories for exporters
Users that are interested in exporters are comparing it to the libraries listed below
Sorting:
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- Scrapy extension which writes crawled items to Kafka☆30Feb 10, 2026Updated 3 weeks ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago
- Deprecated HubStorage client library - please use python-scrapinghub>=1.9.0 instead☆16Dec 5, 2016Updated 9 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆141Apr 24, 2024Updated last year
- Deploy an Image Recognition API using Go, Terraform, Lambda, and API Gateway.☆15May 8, 2018Updated 7 years ago
- MongoDB extensions for Scrapy☆44Oct 2, 2014Updated 11 years ago
- Create swagger / OpenAPI schemas from example interactions.☆12May 23, 2023Updated 2 years ago
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Apr 11, 2020Updated 5 years ago
- 🔁 jsonapi protocol implementation for Django.☆20Mar 5, 2018Updated 8 years ago
- Crawlera tools☆26Feb 9, 2016Updated 10 years ago
- This is a reusable Django application with which you can log activities your users are making and displaying a stream similar to the acti…☆87Nov 8, 2015Updated 10 years ago
- ⛔️ DEPRECATED ⛔️ Select which functions are to be deployed based on region and stage.☆25Mar 5, 2024Updated 2 years ago
- Learn React.js by building a re-usable Survey application. We'll cover React v16.8 with a heavy focus on the use of React Hooks.☆20Mar 27, 2019Updated 6 years ago
- Deployment Automation Engine☆26Updated this week
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- Automatic unit test generation for Scrapy.☆57Jul 12, 2021Updated 4 years ago
- A scrapy spider for R18☆16Feb 21, 2026Updated 2 weeks ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55May 21, 2024Updated last year
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆57Mar 16, 2022Updated 3 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆24Jun 30, 2016Updated 9 years ago
- A client interface for Scrapinghub's API☆206Oct 3, 2025Updated 5 months ago
- Follow Twitter users based on keywords☆29Nov 5, 2016Updated 9 years ago
- Convert Javascript code to an XML document☆187Mar 14, 2022Updated 3 years ago
- Read JSON lines (jl) files, including gzipped and broken☆36Feb 10, 2026Updated 3 weeks ago
- ☆35Oct 25, 2023Updated 2 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆35Mar 6, 2017Updated 9 years ago
- Python-based cross-platform tool for mining text data (html, transcript, problems) of edX MOOCs on a user's dashboard. It is an extension…☆10Feb 12, 2020Updated 6 years ago
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year
- ☆12Sep 10, 2019Updated 6 years ago
- ⚡️ Actions and Reducer Utilities for NGRX☆10Oct 17, 2019Updated 6 years ago
- Basic example using Clarifai custom training.☆10Oct 17, 2015Updated 10 years ago
- Provides scheduled jobs management from the Django Admin using Django-RQ☆32Oct 14, 2019Updated 6 years ago
- Scrapy middleware for the autologin☆37Feb 10, 2026Updated 3 weeks ago
- A Beginner's Guide to Machine Learning with Scikit-Learn☆31Feb 15, 2014Updated 12 years ago
- Turkish Lemmatizer is used for finding root form of Turkish words.☆12Nov 30, 2013Updated 12 years ago
- ☆13Nov 25, 2024Updated last year