realtimedatalake / rtdl
rtdl makes it easy to build and maintain a real-time data lake
☆45Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for rtdl
- Multi-hop declarative data pipelines☆91Updated 2 weeks ago
- In-Memory Analytics for Kafka using DuckDB☆79Updated this week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆148Updated 2 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆61Updated this week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆81Updated 3 months ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆25Updated 4 months ago
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆57Updated this week
- a curated list of awesome lakehouse frameworks, applications, etc☆17Updated 3 months ago
- Dione - a Spark and HDFS indexing library☆50Updated 8 months ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 6 months ago
- Query Snowflake tables locally with DuckDB, without any need for a running warehouse☆101Updated this week
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆97Updated this week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- dbt ksqlDB adapter☆27Updated 2 years ago
- sql-logic-test☆60Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆307Updated last year
- ☆49Updated 8 months ago
- ☆18Updated last year
- Tektite DB☆180Updated this week
- Weekly Data Engineering Newsletter☆93Updated 4 months ago
- Work with your web service, database, and streaming schemas in a single format.☆332Updated 7 months ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆37Updated 3 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆58Updated last year
- ☆22Updated 5 years ago
- Demos of Materialize, the operational data warehouse.☆50Updated 2 months ago
- Generated Kafka protocol implementations☆28Updated last week
- Serverless multi-protocol + multi-destination event collection system.☆195Updated last month
- ☆33Updated last year