qubole/quark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qubole/quark)

qubole / quark

Quark is a data virtualization engine over analytic databases.

☆101

Alternatives and similar repositories for quark

Users that are interested in quark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qubole / rubix
View on GitHub
Cache File System optimized for columnar formats and object stores
☆188Aug 11, 2022Updated 3 years ago
qubole / presto-udfs
View on GitHub
Plugin for Presto to allow addition of user functions easily
☆119Mar 31, 2021Updated 5 years ago
randerzander / docker-ambari
View on GitHub
Dockerfile and artifacts for running a self-contained HDP 2.3 "cluster" in a docker container
☆10Aug 30, 2016Updated 9 years ago
treasure-data / prestogres
View on GitHub
PostgreSQL protocol gateway for Presto distributed SQL query engine
☆293May 19, 2023Updated 3 years ago
qubole / streamx
View on GitHub
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
☆96Apr 4, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
datafibers-community / df_data_service
View on GitHub
DataFibers Data Service
☆31Feb 11, 2022Updated 4 years ago
GiraffaFS / giraffa
View on GitHub
Giraffa FileSystem (Slack: giraffa-fs.slack.com)
☆18Mar 8, 2017Updated 9 years ago
qubole / presto-kinesis
View on GitHub
Presto connector to Amazon Kinesis service.
☆14Jun 28, 2019Updated 7 years ago
julianhyde / druid-mdx
View on GitHub
Example of running MDX on Druid via Mondrian and Calcite
☆26Aug 3, 2016Updated 9 years ago
mapr-demos / simple-drill-functions
View on GitHub
Examples of user defined functions for Apache Drill
☆18May 24, 2017Updated 9 years ago
srikalyc / Sql4D
View on GitHub
Sql interface to druid.
☆78Dec 14, 2015Updated 10 years ago
sequenceiq / yarn-monitoring
View on GitHub
Hadoop YARN monitoring with R
☆19Sep 16, 2014Updated 11 years ago
RunningJon / outsourcer
View on GitHub
☆24Feb 4, 2021Updated 5 years ago
dk-stationery / stationery-ink
View on GitHub
Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark
☆12Mar 14, 2016Updated 10 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
dryoni / aws-tools
View on GitHub
Tools for AWS
☆14Sep 23, 2022Updated 3 years ago
prestodb / benchto
View on GitHub
Framework for running macro benchmarks in a clustered environment
☆25Aug 29, 2022Updated 3 years ago
LinkedInAttic / kamikaze
View on GitHub
DocId set compression and set operation library
☆22Mar 7, 2014Updated 12 years ago
Impetus / jumbune
View on GitHub
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…
☆73Jan 1, 2023Updated 3 years ago
airbnb / transformpy
View on GitHub
transformpy is a Python 2/3 module for doing transforms on "streams" of data
☆28Jun 20, 2017Updated 9 years ago
julianhyde / foodmart-data-hsqldb
View on GitHub
Foodmart data set in hsqldb format
☆26Oct 19, 2025Updated 9 months ago
zrlio / crail-spark-io
View on GitHub
Fast I/O plugins for Spark
☆41Dec 14, 2020Updated 5 years ago
milinda / samza-sql
View on GitHub
SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka
☆30Jun 8, 2016Updated 10 years ago
teiid / teiid
View on GitHub
Teiid is a data virtualization system that allows applications to use data from multiple, heterogenous data stores.
☆319Jan 4, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
malcolmgreaves / fp4ml
View on GitHub
A library of machine learning algorithms implemented using principles of functional programming.
☆23Jan 7, 2017Updated 9 years ago
CODAIT / spark-netezza
View on GitHub
Netezza Connector for Apache Spark
☆13Sep 10, 2018Updated 7 years ago
apache / metamodel-membrane
View on GitHub
Mirror of Apache MetaModel Membrane
☆16Jun 4, 2019Updated 7 years ago
zhihuili / rp-pratice
View on GitHub
Flow Arch(流式架构)/Reactive Programming(RP/反应式编程) 实践
☆12Dec 18, 2018Updated 7 years ago
onetapbeyond / opencpu-spark-executor
View on GitHub
Apache Spark OpenCPU Executor (ROSE)
☆25Jun 16, 2018Updated 8 years ago
redBorder / cep
View on GitHub
RESTful Complex Event Processor powered by Kafka & Siddhi
☆50Apr 9, 2025Updated last year
bernhard-42 / Spark-ETL-Atlas
View on GitHub
A small project to show how to add lineage to Atlas when using Spark as ETL tool
☆12Nov 29, 2016Updated 9 years ago
tzolov / calcite-sql-rewriter
View on GitHub
JDBC driver that converts any INSERT, UPDATE and DELETE statements into append-only INSERTs. Instead of updating rows in-place it inserts…
☆84Mar 27, 2017Updated 9 years ago
calrissian / spark-jetty-server
View on GitHub
Recipes and examples for Apache Spark
☆13Jan 21, 2015Updated 11 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Impetus / ankush
View on GitHub
A big data cluster management tool that creates and manages clusters of different technologies.
☆21Apr 20, 2015Updated 11 years ago
t3rmin4t0r / notes
View on GitHub
Random implementation notes
☆34Apr 23, 2013Updated 13 years ago
cloudera / kitten
View on GitHub
The fast and fun way to write YARN applications.
☆136Nov 14, 2018Updated 7 years ago
prestodb / presto-yarn
View on GitHub
☆58Mar 27, 2019Updated 7 years ago
xavient / Data-Ingestion-Platform
View on GitHub
☆51Jun 30, 2026Updated 2 weeks ago
yeleid / eagleeye
View on GitHub
An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework
☆13Feb 5, 2016Updated 10 years ago
rmetzger / flink-streaming-etl
View on GitHub
A demo repository for "streaming etl" with Apache Flink
☆44Jun 8, 2016Updated 10 years ago