estuary / flow
π Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. π
β638Updated this week
Related projects β
Alternatives and complementary repositories for flow
- GlareDB: An analytics DBMS for distributed dataβ706Updated this week
- Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflakeβ¦β601Updated this week
- Serverless HTAP cloud data platform powered by Arrow Γ DuckDB Γ Icebergβ307Updated last year
- Analytical database for data-driven Web applications πͺΆβ436Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semanticsβ1,042Updated this week
- Open Control Plane for Tables in Data Lakehouseβ312Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,148Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lakeβ370Updated last week
- The Feldera Incremental Computation Engineβ796Updated this week
- The metrics layer for your data. Join us at https://metriql.com/slackβ298Updated last year
- Open source data observability platformβ320Updated 2 years ago
- Build platforms that flexibly mix SQL, batch, and stream processing paradigmsβ719Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β1,850Updated this week
- Iceberg/Delta Columnstore Table in Postgresβ224Updated this week
- Work with your web service, database, and streaming schemas in a single format.β333Updated 7 months ago
- LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.β447Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.β248Updated 11 months ago
- Efficient data transformation and modeling framework that is backwards compatible with dbt.β1,827Updated this week
- Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.β443Updated this week
- Serverless multi-protocol + multi-destination event collection system.β196Updated last month
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)β924Updated this week
- DuckDB-powered analytics for Postgresβ382Updated this week
- Conduit streams data between data stores. Kafka Connect replacement. No JVM required.β400Updated this week
- Apache PyIcebergβ476Updated this week
- β295Updated this week
- Apache DataFusion Ballista Distributed Query Engineβ1,549Updated this week
- High-performance diffing of large datasets across databasesβ368Updated last month
- Apache DataFusion Comet Spark Acceleratorβ823Updated this week
- Fast SQL formatter/linterβ393Updated this week
- Dagster Labs' open-source data platform, built with Dagster.β286Updated this week