ClickHouse/spark-clickhouse-connector

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ClickHouse/spark-clickhouse-connector)

ClickHouse / spark-clickhouse-connector

Spark ClickHouse Connector build on DataSourceV2 API

☆217

Alternatives and similar repositories for spark-clickhouse-connector

Users that are interested in spark-clickhouse-connector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

housepower / ClickHouse-Native-JDBC
View on GitHub
ClickHouse Native Protocol JDBC implementation
☆542Jun 22, 2025Updated last year
yaooqinn / itachi
View on GitHub
A library that brings useful functions from various modern database management systems to Apache Spark
☆63Sep 4, 2023Updated 2 years ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
wankunde / sql-runner
View on GitHub
☆17Mar 19, 2024Updated 2 years ago
jdbcx / jdbcx
View on GitHub
JDBCX: Extended JDBC driver for dynamic multi-language queries with optional bridge server for federated datasource connectivity.
☆32Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oracle / spark-oracle
View on GitHub
On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.
☆36Apr 15, 2025Updated last year
ClickHouse / dbt-clickhouse
View on GitHub
The Clickhouse plugin for dbt (data build tool)
☆355Jul 22, 2026Updated last week
CoxAutomotiveDataSolutions / spark-distcp
View on GitHub
A re-implementation of Hadoop DistCP in Apache Spark
☆47Dec 20, 2023Updated 2 years ago
ClickHouse / clickhouse-java
View on GitHub
ClickHouse Java Clients & JDBC Driver
☆1,610Updated this week
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,578Updated this week
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
apache / kyuubi-docker
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆16May 22, 2026Updated 2 months ago
fansy1990 / javaweb_spark_standalone_monitor
View on GitHub
☆14Apr 12, 2022Updated 4 years ago
alibaba / SparkCube
View on GitHub
SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.
☆136Mar 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Tencent / Firestorm
View on GitHub
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆256Apr 7, 2023Updated 3 years ago
apache / amoro
View on GitHub
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
☆1,153Updated this week
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,059Updated this week
yaooqinn / spark-ranger
View on GitHub
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
☆59Nov 11, 2021Updated 4 years ago
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago
yaooqinn / spark-history-cli
View on GitHub
CLI tool for querying Apache Spark History Server REST API
☆28Mar 22, 2026Updated 4 months ago
permanentstar / spark-sql-dsv2-extension
View on GitHub
A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters
☆11May 7, 2022Updated 4 years ago
apache / orc-format
View on GitHub
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
☆16May 15, 2026Updated 2 months ago
ClickHouse / clickhouse-jdbc-bridge
View on GitHub
A JDBC proxy from ClickHouse to external databases
☆176Oct 9, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
boostscale / velox4j
View on GitHub
Community Java bindings for https://github.com/facebookincubator/velox
☆43Updated this week
apache / iceberg
View on GitHub
Apache Iceberg
☆9,085Updated this week
qwshen / spark-flight-connector
View on GitHub
A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
☆49Jun 7, 2026Updated last month
bytedance / CloudShuffleService
View on GitHub
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
☆261May 12, 2024Updated 2 years ago
yaooqinn / spark-authorizer
View on GitHub
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆183Apr 6, 2022Updated 4 years ago
NetEase / spark-alarm
View on GitHub
Alerting and monitoring tool for Apache Spark
☆23May 20, 2022Updated 4 years ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆906Jul 20, 2026Updated last week
datablade-io / daisy
View on GitHub
The most valuable time series database in the universe
☆33Feb 22, 2022Updated 4 years ago
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
apache / hbase-connectors
View on GitHub
Apache HBase Connectors
☆246Jul 13, 2026Updated 2 weeks ago
awesome-kyuubi / hadoop-testing
View on GitHub
Testing Sandbox for Hadoop Ecosystem Components
☆45Jun 16, 2026Updated last month
housepower / ckman
View on GitHub
This is a tool which used to manage and monitor ClickHouse database
☆485Jun 18, 2026Updated last month
AbsaOSS / spline-spark-agent
View on GitHub
Spline agent for Apache Spark
☆207Updated this week
MemVerge / splash
View on GitHub
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
☆131Dec 19, 2024Updated last year
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,197Updated this week
housepower / clickhouse_sinker
View on GitHub
Easily load data from kafka to ClickHouse
☆534Mar 20, 2026Updated 4 months ago