binaryaffairs / a-la-mode
A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)
☆71Updated 4 years ago
Alternatives and similar repositories for a-la-mode:
Users that are interested in a-la-mode are comparing it to the libraries listed below
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Airflow declarative DAGs via YAML☆132Updated last year
- Terraform provider for kafka☆32Updated 5 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 11 months ago
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Reporters and collectors for use in Google Cloud Platform☆91Updated last month
- Environment, operations and runtime-meta testing tool.☆90Updated 3 years ago
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆209Updated 2 weeks ago
- A microservice app that demonstrates the power of tilt☆50Updated 2 years ago
- Kafka sink connector for streaming messages to PostgreSQL☆90Updated 4 years ago
- Open Source Secret Provider plugin for the Kafka Connect framework☆46Updated 8 months ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated last month
- Etcd cluster appliance for the STUPS (AWS) environment☆30Updated 11 months ago
- Streaming left joins in Kafka for change data capture☆52Updated 10 months ago
- Opinionated serverless event analytics pipeline☆43Updated last year
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated 2 years ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publ…☆59Updated 2 years ago
- A Apache Hive SerDe (short for serializer/deserializer) for the Ion file format.☆30Updated this week
- Knative event sources for AWS services☆60Updated 2 years ago
- Marquez Web UI☆22Updated 4 years ago
- Helm Chart for Fn☆56Updated 6 years ago
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Continuously synchronize directories from remote object store to local filesystem☆104Updated last month
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- Aiven's S3 Sink Connector for Apache Kafka®☆69Updated 6 months ago