realtimedatalake / rtdl
rtdl makes it easy to build and maintain a real-time data lake
☆45Updated 2 years ago
Alternatives and similar repositories for rtdl:
Users that are interested in rtdl are comparing it to the libraries listed below
- Multi-hop declarative data pipelines☆110Updated this week
- In-Memory Analytics for Kafka using DuckDB☆89Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Demos of Materialize, the operational data warehouse.☆51Updated 5 months ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆26Updated 7 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆154Updated 2 months ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆79Updated this week
- a curated list of awesome lakehouse frameworks, applications, etc☆21Updated 2 months ago
- dbt adapter for Rockset☆15Updated 8 months ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆62Updated last year
- Java implementation for performing operations on Apache Iceberg and Hive tables☆20Updated 4 months ago
- Generated Kafka protocol implementations☆31Updated 2 weeks ago
- Snowflake connector repository for the Apache Flink project☆36Updated 2 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆102Updated this week
- This repository contains the source code for samples featured in eventdrivenutopia.com☆46Updated 2 years ago
- Inspect Your Servers with DuckDB☆30Updated last year
- A Minimalistic Rust Implementation of Delta Sharing Server.☆84Updated this week
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- A dbt adapter for Decodable☆12Updated this week
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆41Updated 4 months ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆42Updated this week
- Java/Scala library for easily authoring Flyte tasks and workflows☆43Updated last week
- A Table format agnostic data sharing framework☆38Updated last year
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆65Updated 2 weeks ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆74Updated this week
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆31Updated 2 years ago
- Use dbt to manage real-time data transformations in RisingWave.☆22Updated 2 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆75Updated this week