binaryaffairs / a-la-modeLinks
A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)
☆72Updated 5 years ago
Alternatives and similar repositories for a-la-mode
Users that are interested in a-la-mode are comparing it to the libraries listed below
Sorting:
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆214Updated last week
- Environment, operations and runtime-meta testing tool.☆90Updated 4 years ago
- tap-postgres☆68Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 3 years ago
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆185Updated 2 weeks ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆37Updated last year
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- Streaming left joins in Kafka for change data capture☆52Updated last month
- The language specification of PartiQL.☆149Updated 2 years ago
- ☆45Updated 8 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Updated 2 years ago
- Terraform provider for kafka☆32Updated 6 years ago
- A distributed graph-based platform to automatically collect, discover, explore and relate multi-cluster Kubernetes resources and metadata…☆213Updated 2 years ago
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- Kafka sink connector for streaming messages to PostgreSQL☆93Updated 5 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 3 years ago
- Avro to JSON Schema, and back☆136Updated last year
- Bender - Serverless ETL Framework☆188Updated 2 years ago
- Continuously synchronize directories from remote object store to local filesystem☆109Updated last week
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 9 months ago
- A CLI and library to run Singer Taps and Targets☆35Updated 3 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Postgresql To Kinesis For Java☆80Updated 6 months ago
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publ…☆59Updated 7 months ago
- ☆33Updated 2 years ago
- Graph Analytics with Apache Kafka☆107Updated 2 weeks ago
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆160Updated 2 weeks ago