realtimedatalake / rtdlLinks
rtdl makes it easy to build and maintain a real-time data lake
☆45Updated 2 years ago
Alternatives and similar repositories for rtdl
Users that are interested in rtdl are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆120Updated 2 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆165Updated 3 weeks ago
- In-Memory Analytics for Kafka using DuckDB☆138Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- Data Pipeline Automation Framework to build MCP servers, data APIs, and data lakes with SQL.☆133Updated last week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated this week
- A Table format agnostic data sharing framework☆39Updated last year
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆159Updated 2 years ago
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last month
- Demos of Materialize, the operational data warehouse.☆51Updated 7 months ago
- ☆19Updated last year
- Tektite DB☆184Updated 7 months ago
- Serverless multi-protocol + multi-destination event collection system.☆209Updated 10 months ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆176Updated this week
- A BYOC option for Snowflake workloads☆101Updated this week
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆35Updated 2 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Open Control Plane for Tables in Data Lakehouse☆370Updated last week
- Query Plan Markup Language☆45Updated last year
- ☆107Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated 2 years ago
- Iceberg Playground in a Box☆67Updated 3 months ago
- Apache iceberg Spark s3 examples☆20Updated last year
- A leightweight UI for Lakekeeper☆15Updated this week
- Analytics Accelerator Library for Amazon S3 is an open source library that accelerates data access from client applications to Amazon S3.☆55Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated last week
- Generated Kafka protocol implementations☆33Updated last week
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆95Updated 2 years ago