Yelp / s3mysqldump
Dump mysql tables to s3, and parse them
☆31Updated 10 years ago
Alternatives and similar repositories for s3mysqldump:
Users that are interested in s3mysqldump are comparing it to the libraries listed below
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- TAC is an airflow plugin which helps you to Extract transform and Load your data, bit more easily☆9Updated 7 years ago
- An offline task management framework, built on top of luigi.☆16Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- Pyramid tween to add Zipkin service spans☆28Updated 5 months ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Telecom scenarios implemented with streaming techniques☆11Updated last year
- A template-based cluster provisioning system☆61Updated 2 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Python Client for WebHDFS REST API☆43Updated 9 years ago
- Zipkin API for python☆18Updated last year
- A Pythonic API for Amazon's States Language for defining AWS Step Functions☆8Updated 2 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Example using Grafana with Druid☆11Updated 9 years ago
- Addon to the official elasticsearch python client for X-Pack (deprecated)☆10Updated 6 years ago
- ☆11Updated 9 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Cubes OLAP Examples☆74Updated 6 years ago
- ☆26Updated 5 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).☆16Updated 10 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 5 months ago
- Large scale server deploys using BitTorrent and the BitTornado library by Murder (https://github.com/lg/murder)☆28Updated 11 years ago
- Application Driven Stats Monitoring☆229Updated 9 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- Elasticsearch Watcher plugin for the elasticsearch.js client☆14Updated 6 years ago