binaryaffairs / a-la-mode
A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)
☆71Updated 4 years ago
Alternatives and similar repositories for a-la-mode:
Users that are interested in a-la-mode are comparing it to the libraries listed below
- Terraform provider for kafka☆32Updated 5 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 10 months ago
- Continuously synchronize directories from remote object store to local filesystem☆102Updated this week
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆209Updated last week
- Environment, operations and runtime-meta testing tool.☆90Updated 3 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated last year
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated this week
- Opinionated serverless event analytics pipeline☆43Updated last year
- Pulsar weekly community update☆10Updated 2 years ago
- ☆33Updated last year
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 11 months ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆74Updated 2 years ago
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- A protobuf schema registry on steroids. It will keep track of the contracts throughout your organization, making sure no contract is brok…☆43Updated 4 years ago
- ☆45Updated 7 years ago
- Terraform samples to sync and use IAM users ssh keys to connect to EC2 instances☆13Updated 5 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Kafka sink connector for streaming messages to PostgreSQL☆90Updated 4 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 8 years ago
- Standalone alternatives to Kafka Connect Connectors☆42Updated this week
- tap-postgres☆68Updated 5 months ago
- Open Source Secret Provider plugin for the Kafka Connect framework☆46Updated 7 months ago
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publ…☆59Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 weeks ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year