yashap / airflow
Workflow manager from Airbnb
☆9Updated 8 years ago
Alternatives and similar repositories for airflow:
Users that are interested in airflow are comparing it to the libraries listed below
- Open Source Cloud Formation☆59Updated 9 years ago
- [DEPRECATED] Documentation for Mesosphere supported open source projects.☆20Updated 5 years ago
- Consumes Kafka topics specified in the config, and outputs them in chunks as desired in an S3 Bucket. Keeps track of offsets via S3.☆15Updated 11 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Hive Storage Handler for Kinesis.☆11Updated 9 years ago
- A rough prototype of a tool for discovering Apache Hive schemas from JSON documents.☆42Updated last year
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆97Updated 2 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Automated deploy for Kafka on AWS☆123Updated 13 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Amazon Kinesis Aggregators provides a simple way to create real time aggregations of data on Amazon Kinesis.☆150Updated 3 years ago
- Fast JSON to Avro converter☆61Updated 6 years ago
- Store batched Kafka messages in S3.☆39Updated 2 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- ☆76Updated 8 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 3 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Metrics produced to Kafka and consumers for monitoring them☆100Updated 10 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- An AWS SDK-backed FileSystem driver for Hadoop☆64Updated 4 years ago
- Cloudbreak Deployer Tool☆34Updated last year
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Scripts for running Apache Kafka on Mesosphere's Marathon☆14Updated 9 years ago
- Kinesis spout for Storm☆106Updated 6 years ago
- Scheduled task execution on top of AWS Data Pipeline☆43Updated 9 years ago
- ☆33Updated 10 years ago