Quark is a data virtualization engine over analytic databases.
☆100Jul 13, 2017Updated 8 years ago
Alternatives and similar repositories for quark
Users that are interested in quark are comparing it to the libraries listed below
Sorting:
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- Plugin for Presto to allow addition of user functions easily☆118Mar 31, 2021Updated 4 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆293May 19, 2023Updated 2 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Apr 4, 2019Updated 6 years ago
- Enabling Spark Optimization through Cross-stack Monitoring and Visualization☆47Aug 23, 2017Updated 8 years ago
- ☆33Jan 13, 2019Updated 7 years ago
- Giraffa FileSystem (Slack: giraffa-fs.slack.com)☆18Mar 8, 2017Updated 9 years ago
- Example of running MDX on Druid via Mondrian and Calcite☆26Aug 3, 2016Updated 9 years ago
- Presto connector to Amazon Kinesis service.☆14Jun 28, 2019Updated 6 years ago
- Tools for AWS☆14Sep 23, 2022Updated 3 years ago
- Examples of user defined functions for Apache Drill☆18May 24, 2017Updated 8 years ago
- Sql interface to druid.☆78Dec 14, 2015Updated 10 years ago
- DataFibers Data Service☆31Feb 11, 2022Updated 4 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago
- Email Analysis Tool based on Hadoop☆20Apr 26, 2021Updated 4 years ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆28Jun 20, 2017Updated 8 years ago
- Mirror of Apache MetaModel Membrane☆16Jun 4, 2019Updated 6 years ago
- RESTful Complex Event Processor powered by Kafka & Siddhi☆49Apr 9, 2025Updated 11 months ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Jan 7, 2017Updated 9 years ago
- Teiid is a data virtualization system that allows applications to use data from multiple, heterogenous data stores.☆317Jan 4, 2023Updated 3 years ago
- DEPRECATED - Moved to github.com/apache/calcite-avatica-go☆42Mar 15, 2021Updated 5 years ago
- Netezza Connector for Apache Spark☆13Sep 10, 2018Updated 7 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- Flow Arch(流式架构)/Reactive Programming(RP/反应式编程) 实践☆12Dec 18, 2018Updated 7 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Nov 29, 2016Updated 9 years ago
- Random implementation notes☆34Apr 23, 2013Updated 12 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- ☆13Mar 2, 2018Updated 8 years ago
- ☆57Mar 27, 2019Updated 6 years ago
- Dremio - the missing link in modern data☆1,475Sep 26, 2025Updated 5 months ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- Mirror of Apache Calcite☆13Mar 6, 2026Updated 2 weeks ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework☆13Feb 5, 2016Updated 10 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Jun 8, 2016Updated 9 years ago