Yelp / s3mysqldumpLinks
Dump mysql tables to s3, and parse them
☆31Updated 10 years ago
Alternatives and similar repositories for s3mysqldump
Users that are interested in s3mysqldump are comparing it to the libraries listed below
Sorting:
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 4 years ago
- A performance-focused tuned profile for MongoDB on CentOS/Redhat Linux☆37Updated 8 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).☆16Updated 10 years ago
- Cubes OLAP Examples☆74Updated 7 years ago
- Provides a Pythonic interface for reading and writing Avro schemas☆27Updated 2 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- ☆21Updated last year
- ElasticSearch plugin to watch segment dynamics (additions, merges, deletes)☆136Updated 9 years ago
- Addon to the official elasticsearch python client for X-Pack (deprecated)☆10Updated 7 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- TAC is an airflow plugin which helps you to Extract transform and Load your data, bit more easily☆9Updated 8 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- A Redis Replication Cluster accessible through HAProxy running across a Docker Composed-Swarm with Supervisor and Sentinel☆51Updated 9 years ago
- Extensible language parser with Python-like syntax. Written in Java and antlr.☆18Updated 7 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- ☆11Updated 9 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 4 years ago
- Extract data about items from JIRA, output raw data and interesting reports☆18Updated 3 years ago
- Telecom scenarios implemented with streaming techniques☆11Updated 2 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 2 years ago
- Python Implementation of Super and Hyper Log Log Sketches☆49Updated 13 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 5 months ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 13 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- An offline task management framework, built on top of luigi.☆16Updated 9 years ago
- Change Data Capture (CDC) toolkit for keeping system layers in sync with the database☆23Updated 8 years ago