implydata / learn-druidLinks
Learn the basics of Apache Druid® from leaders in the community with these notebooks and useful tools.
☆59Updated 2 weeks ago
Alternatives and similar repositories for learn-druid
Users that are interested in learn-druid are comparing it to the libraries listed below
Sorting:
- ☆269Updated last year
- ☆105Updated 10 months ago
- A curated list of Apache Flink learning resources☆109Updated 10 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆184Updated 3 weeks ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆291Updated last week
- Delta Lake examples☆233Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 2 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Delta Lake Documentation☆51Updated last year
- Drop-in replacement for Apache Spark UI☆361Updated this week
- Open Control Plane for Tables in Data Lakehouse☆372Updated last week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,366Updated last week
- The Data Contract Specification Repository☆391Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated last week
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆168Updated 2 months ago
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.☆27Updated 11 months ago
- Home of the Open Data Contract Standard (ODCS).☆592Updated last week
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆124Updated 3 weeks ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆420Updated 6 months ago
- ☆14Updated 2 years ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆44Updated 3 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆222Updated last week
- ☆81Updated 7 months ago
- ☆237Updated this week
- Enforce Data Contracts☆741Updated last week
- Quick Guides from Dremio on Several topics☆79Updated 2 weeks ago
- Helm charts for Trino and Trino Gateway☆187Updated last week
- Template for a data contract used in a data mesh.☆484Updated last year
- A Table format agnostic data sharing framework☆42Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year