implydata / learn-druidLinks
Learn the basics of Apache Druid® from leaders in the community with these notebooks and useful tools.
☆59Updated 2 months ago
Alternatives and similar repositories for learn-druid
Users that are interested in learn-druid are comparing it to the libraries listed below
Sorting:
- ☆110Updated last year
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆188Updated 2 months ago
- ☆270Updated last year
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆169Updated 4 months ago
- Delta Lake examples☆238Updated last year
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 9 months ago
- A curated list of Apache Flink learning resources☆125Updated last year
- One bite-sized tip or trick for Apache Flink practitioners every day leading up to Christmas Eve 2024.☆27Updated last year
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆298Updated last week
- Open Control Plane for Tables in Data Lakehouse☆380Updated this week
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 3 years ago
- Drop-in replacement for Apache Spark UI☆401Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 3 weeks ago
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year
- 🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka☆345Updated last month
- ☆60Updated last year
- ☆65Updated last year
- Spark runtime on AWS Lambda☆113Updated 5 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆197Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆147Updated 2 weeks ago
- Turn XML into Avro and vice versa.☆20Updated last week
- Multi-hop declarative data pipelines☆124Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- Java SDK for the Snowflake Ingest Service -☆79Updated 3 weeks ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆226Updated last week
- ☆243Updated this week
- Snowflake Kafka Connector (Sink Connector)☆161Updated last week
- Data Product Portal created by Dataminded☆198Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,413Updated this week
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆46Updated 6 months ago