yashap / airflow
Workflow manager from Airbnb
☆9Updated 8 years ago
Alternatives and similar repositories for airflow:
Users that are interested in airflow are comparing it to the libraries listed below
- Open Source Cloud Formation☆59Updated 10 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 4 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Better, container friendly big-data images for Docker☆39Updated 8 years ago
- Extensions, custom & experimental panels☆52Updated 9 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Cloudbreak Deployer Tool☆34Updated last year
- Ferry lets you define, run, and deploy big data applications on AWS, OpenStack, and your local machine using Docker☆253Updated 9 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 7 months ago
- [DEPRECATED] Documentation for Mesosphere supported open source projects.☆20Updated 5 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 9 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Updated 4 years ago
- Metrics produced to Kafka and consumers for monitoring them☆100Updated 10 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆19Updated 7 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- Python library to manage autoscaling logic and actions☆72Updated 6 years ago
- Storm Cassandra Bridge built on CQL☆42Updated last year
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Apache Mesos Platform as a Service Deploy☆21Updated 8 years ago
- POC: Spark consumer for bottledwater-pg Kafka Avro topics☆16Updated 4 years ago
- ☆12Updated 4 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 2 years ago
- Common utilities for Apache Kafka☆36Updated last year
- ☆17Updated 9 years ago