yashap / airflowLinks
Workflow manager from Airbnb
☆9Updated 9 years ago
Alternatives and similar repositories for airflow
Users that are interested in airflow are comparing it to the libraries listed below
Sorting:
- Open Source Cloud Formation☆59Updated 10 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆94Updated 6 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 9 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- This project is no longer actively supported. It is made available as read-only. A highly available, horizontally scalable queuing and no…☆276Updated 6 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆154Updated last year
- Extensions, custom & experimental panels☆53Updated 9 years ago
- ☆76Updated 8 years ago
- Metrics produced to Kafka and consumers for monitoring them☆101Updated 10 years ago
- Cubes over ElasticSearch. Aggregation library for Business Intelligence☆20Updated 10 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 9 years ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆136Updated 2 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- A fork of the Apache Kafka "connect-file" Kafka Connect, to use as a starting point to write your own Kafka connectors.☆37Updated 7 years ago
- Apache Kafka HTTP Endpoint for producing and consuming messages from topics☆153Updated 10 years ago
- SQL for Kafka Connectors☆98Updated last year
- ☆26Updated 5 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- Ferry lets you define, run, and deploy big data applications on AWS, OpenStack, and your local machine using Docker☆253Updated 10 years ago
- This project allows to run Samza jobs on Mesos cluster☆43Updated 4 years ago
- Annotation driven Java object writer for ORC with runtime code generation for speed.☆20Updated last year
- Jetstream is a streaming processing framework☆113Updated 9 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Kinesis spout for Storm☆106Updated 7 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 4 years ago
- Unix tee, but for Kinesis streams☆12Updated 3 years ago
- A utility for generating Oozie workflows from a YAML definition☆48Updated 6 years ago
- [DEPRECATED] Documentation for Mesosphere supported open source projects.☆20Updated 5 years ago