binaryaffairs / a-la-modeLinks
A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)
☆72Updated 5 years ago
Alternatives and similar repositories for a-la-mode
Users that are interested in a-la-mode are comparing it to the libraries listed below
Sorting:
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆214Updated last month
- Environment, operations and runtime-meta testing tool.☆90Updated 4 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated 2 years ago
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆185Updated last week
- tap-postgres☆68Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 2 years ago
- Terraform provider for kafka☆32Updated 5 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆68Updated 2 years ago
- Kafka sink connector for streaming messages to PostgreSQL☆93Updated 5 years ago
- States Language on Cadence☆63Updated 5 years ago
- The language specification of PartiQL.☆150Updated 2 years ago
- High Efficiency Reliable Access to data stores☆300Updated 4 months ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆152Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆37Updated last year
- Pulsar Beam is a streaming service via HTTP built on Apache Pulsar.☆60Updated 3 years ago
- Flyte binds together the tools you use into easily defined, automated workflows☆88Updated last year
- Streaming left joins in Kafka for change data capture☆52Updated last year
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 3 years ago
- A distributed graph-based platform to automatically collect, discover, explore and relate multi-cluster Kubernetes resources and metadata…☆213Updated 2 years ago
- Run templatable playbooks of SQL scripts in series and parallel on Redshift, PostgreSQL, BigQuery and Snowflake☆81Updated 6 months ago
- ☆45Updated 7 years ago
- A Postgres Proxy to Mask Data in Realtime☆195Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆160Updated 2 weeks ago
- Avro to JSON Schema, and back☆135Updated last year
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- A curated list of awesome things related to the Cadence and Temporal Workflow Engines☆83Updated 4 years ago
- Bender - Serverless ETL Framework☆188Updated last year
- Functions Repository for Kubeless☆70Updated 3 years ago
- Observability for your AWS load balancers, CloudFront, and more☆51Updated last year
- Terraform provider for managing Kafka topics.☆28Updated 4 years ago