lowks / AirflowLinks
AirFlow is a system to programmaticaly author, schedule and monitor data pipelines.
☆13Updated 11 years ago
Alternatives and similar repositories for Airflow
Users that are interested in Airflow are comparing it to the libraries listed below
Sorting:
- Simple python logging handler for forwarding logs to a kafka server☆30Updated 6 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- DEPRECATED - HBase Stargate (REST API) client wrapper for Python.☆54Updated 7 years ago
- ☆49Updated 8 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Updated 3 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆88Updated 4 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 6 years ago
- A third-party client for the Clickhouse DBMS server.☆263Updated 2 years ago
- Running Presto on k8s☆38Updated 6 years ago
- Dump mysql tables to s3, and parse them☆31Updated 11 years ago
- Hadoop Cluster Configurations☆32Updated 4 years ago
- Sysbench benchmark for MongoDB compatible databases☆102Updated 3 months ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 5 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated 2 years ago
- MySQL-like queries for Druid built on top of Plywood☆146Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆70Updated 6 years ago
- my tools working with redis☆112Updated 7 years ago
- Clickhouse cluster on docker☆36Updated 5 years ago
- REST-like API exposing Airflow data and operations☆61Updated 7 years ago
- A performance-focused tuned profile for MongoDB on CentOS/Redhat Linux☆37Updated 9 years ago
- nginx kafka module, send post log data to kafka cluster☆176Updated 3 years ago
- Exports hadoop metrics via HTTP for Prometheus consumption☆19Updated 5 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆293Updated 2 years ago
- ☆94Updated 2 years ago
- MySQLStreamer is a database change data capture and publish system.☆411Updated 3 years ago
- iiBench benchmark for MongoDB and TokuMX☆31Updated 3 years ago
- Simple nginx logs parser & transporter to ClickHouse database.☆157Updated 2 months ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 5 years ago
- ClickHouse database driver for the Metabase business intelligence front-end☆511Updated 7 months ago