lowks / AirflowLinks
AirFlow is a system to programmaticaly author, schedule and monitor data pipelines.
☆13Updated 11 years ago
Alternatives and similar repositories for Airflow
Users that are interested in Airflow are comparing it to the libraries listed below
Sorting:
- Simple python logging handler for forwarding logs to a kafka server☆30Updated 6 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- Running Presto on k8s☆38Updated 6 years ago
- ☆49Updated 8 years ago
- Exports hadoop metrics via HTTP for Prometheus consumption☆19Updated 5 years ago
- A third-party client for the Clickhouse DBMS server.☆263Updated 2 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆88Updated 4 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆234Updated 3 years ago
- Ansible roles to install an Spark Standalone cluster (HDFS/Spark/Jupyter Notebook) or Ambari based Spark cluster☆61Updated 2 years ago
- my tools working with redis☆112Updated 7 years ago
- DEPRECATED - HBase Stargate (REST API) client wrapper for Python.☆54Updated 7 years ago
- MySQL-like queries for Druid built on top of Plywood☆146Updated 6 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 6 years ago
- Altinity Dashboard helps you manage ClickHouse installations controlled by clickhouse-operator.☆68Updated last week
- ☆208Updated 9 years ago
- Clickhouse cluster on docker☆36Updated 5 years ago
- Airflow script for incremental data import from Mysql to Hive using Sqoop.☆18Updated 7 years ago
- ClickHouse 中文文档☆17Updated 7 years ago
- Hadoop exporter☆53Updated 6 years ago
- Tutorial for setup clickhouse server.☆153Updated last year
- REST-like API exposing Airflow data and operations☆61Updated 7 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 6 years ago
- ClickHouse stress tests suite☆21Updated 6 years ago
- A change data capture system for PostgreSQL☆11Updated 10 years ago
- Oplog-based data sync tool that synchronizes data from a replica set to another deployment, e.g.: standalone, replica set, and sharded cl…☆109Updated 2 years ago
- ☆34Updated 4 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 5 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆293Updated 2 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 5 years ago
- An extendable Docker image for Airbnb's Superset platform, previously known as Caravel.☆114Updated 3 years ago