binaryaffairs / a-la-modeLinks
A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)
☆72Updated 5 years ago
Alternatives and similar repositories for a-la-mode
Users that are interested in a-la-mode are comparing it to the libraries listed below
Sorting:
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 9 months ago
- tap-postgres☆68Updated 9 months ago
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- Terraform provider for kafka☆32Updated 5 years ago
- Continuously synchronize directories from remote object store to local filesystem☆105Updated 3 months ago
- A CLI and library to run Singer Taps and Targets☆34Updated 3 years ago
- ☆45Updated 7 years ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆60Updated 4 years ago
- Environment, operations and runtime-meta testing tool.☆90Updated 4 years ago
- Reporters and collectors for use in Google Cloud Platform☆91Updated 3 months ago
- BigQuery Foreign Data Wrapper for PostgreSQL☆92Updated last year
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 3 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Terraform samples to sync and use IAM users ssh keys to connect to EC2 instances☆13Updated 6 years ago
- Terraform modules which create AWS resources for a Segment Data Lake.☆37Updated 5 months ago
- pg2kinesis uses logical decoding in Postgres 9.4 or later to capture a consistent, continuous stream of events from the database and publ…☆59Updated 2 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆68Updated 3 months ago
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆211Updated last month
- A microservice app that demonstrates the power of tilt☆50Updated 3 years ago
- Opinionated serverless event analytics pipeline☆43Updated 2 years ago
- Terraform provider for managing Kafka topics.☆28Updated 4 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated last month
- Streaming left joins in Kafka for change data capture☆52Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated last year
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 4 months ago