implydata / druid-query-toolkitLinks
A collection of utilities for working with Druid queries
☆23Updated last month
Alternatives and similar repositories for druid-query-toolkit
Users that are interested in druid-query-toolkit are comparing it to the libraries listed below
Sorting:
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆69Updated 5 months ago
- Quix Notebook Manager☆275Updated 5 months ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Graph Analytics with Apache Kafka☆106Updated this week
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated last month
- Connects Grafana to Druid☆69Updated last week
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆158Updated 2 years ago
- Use SQL to build ELT pipelines on a data lakehouse.☆288Updated 3 years ago
- Management and automation platform for Stateful Distributed Systems☆109Updated last week
- ThirdEye is an integrated tool for realtime monitoring of time series and interactive root-cause analysis. It enables anyone inside an or…☆94Updated 2 years ago
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Updated last year
- BigQuery Google Storage Based Data Loader☆57Updated 3 months ago
- Paper: A Zero-rename committer for object stores☆20Updated 4 years ago
- Spark Connector to read and write with Pulsar☆115Updated last month
- Aiven's S3 Sink Connector for Apache Kafka®☆70Updated 11 months ago
- Docker images for Trino integration testing☆53Updated last month
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Apache datasketches☆97Updated 2 years ago
- Kubernetes Operator for the Ververica Platform☆35Updated 2 years ago
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆182Updated this week
- Kubernetes (K8s) Operator for PrestoDB☆46Updated 3 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- ☆34Updated 4 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- calcite-arrow-sample(WIP)☆13Updated 7 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆45Updated 3 weeks ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 8 months ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 5 years ago