realtimedatalake / rtdl
rtdl makes it easy to build and maintain a real-time data lake
☆45Updated 2 years ago
Alternatives and similar repositories for rtdl
Users that are interested in rtdl are comparing it to the libraries listed below
Sorting:
- Multi-hop declarative data pipelines☆115Updated this week
- Demos of Materialize, the operational data warehouse.☆51Updated 2 months ago
- A dbt adapter for Decodable☆12Updated 2 months ago
- Apache Hive Metastore in Standalone Mode With Docker☆13Updated 9 months ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆27Updated 10 months ago
- ☆19Updated 10 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆61Updated 2 years ago
- a curated list of awesome lakehouse frameworks, applications, etc☆28Updated 2 months ago
- In-Memory Analytics for Kafka using DuckDB☆122Updated this week
- Generated Kafka protocol implementations☆32Updated 3 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 2 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆161Updated 5 months ago
- A BYOC option for Snowflake workloads☆63Updated this week
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Updated 4 years ago
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆52Updated last month
- Data Streaming Framework to build data APIs, data lakes, and LLM tooling with SQL.☆105Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 5 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- dbt adapter for Rockset☆16Updated 11 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- dbt's adapter for dremio☆48Updated 2 years ago
- Use dbt to manage real-time data transformations in RisingWave.☆25Updated last week
- A Minimalistic Rust Implementation of Delta Sharing Server.☆90Updated 2 months ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆44Updated 2 months ago
- Ecosystem website for Apache Flink☆12Updated last year
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Serverless multi-protocol + multi-destination event collection system.☆204Updated 5 months ago
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access t…☆24Updated 3 years ago
- Apache iceberg Spark s3 examples☆20Updated last year