realtimedatalake / rtdl
rtdl makes it easy to build and maintain a real-time data lake
☆44Updated last year
Related projects: ⓘ
- Multi-hop declarative data pipelines☆86Updated last month
- Query Snowflake tables locally with DuckDB, without any need for a running warehouse☆62Updated 3 weeks ago
- In-Memory Analytics for Kafka using DuckDB☆63Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆141Updated 2 weeks ago
- dbt adapter for Rockset☆15Updated 3 months ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆24Updated 2 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆51Updated this week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆61Updated last year
- DuckDB for streaming data☆62Updated 5 months ago
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆90Updated this week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆55Updated 11 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆16Updated last month
- ☆131Updated last month
- Demos of Materialize, the operational data warehouse.☆50Updated 2 weeks ago
- Dashboard for operating Flink jobs and deployments.☆25Updated 5 months ago
- dbt ksqlDB adapter☆27Updated 2 years ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆297Updated last year
- A Table format agnostic data sharing framework☆36Updated 7 months ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆79Updated last month
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- Mock streaming data generator☆14Updated 3 months ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆27Updated last week
- A write-audit-publish implementation on a data lake without the JVM☆39Updated last month
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated this week
- Inspect Your Servers with DuckDB☆28Updated last year
- ☆26Updated last year
- ☆18Updated 11 months ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆35Updated last month
- Dione - a Spark and HDFS indexing library☆49Updated 6 months ago