realtimedatalake / rtdlLinks
rtdl makes it easy to build and maintain a real-time data lake
☆45Updated 2 years ago
Alternatives and similar repositories for rtdl
Users that are interested in rtdl are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆118Updated this week
- In-Memory Analytics for Kafka using DuckDB☆137Updated this week
- Data Pipeline Automation Framework to build MCP servers, data APIs, and data lakes with SQL.☆130Updated this week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- Tektite DB☆184Updated 6 months ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated last week
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆29Updated last year
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆59Updated this week
- Demos of Materialize, the operational data warehouse.☆51Updated 6 months ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆175Updated 2 weeks ago
- A BYOC option for Snowflake workloads☆96Updated this week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 2 years ago
- Idempotent query executor☆53Updated 4 months ago
- Apache iceberg Spark s3 examples☆20Updated last year
- Generated Kafka protocol implementations☆33Updated 2 weeks ago
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆35Updated 2 years ago
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last week
- Open Control Plane for Tables in Data Lakehouse☆370Updated this week
- ☆33Updated 4 months ago
- ☆22Updated 3 weeks ago
- MemQ is an efficient, scalable cloud native PubSub system☆138Updated 2 weeks ago
- A dbt adapter for Decodable☆12Updated 2 weeks ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆92Updated 6 months ago
- sql-logic-test☆64Updated 2 years ago
- A leightweight UI for Lakekeeper☆15Updated this week