Spark ClickHouse Connector build on DataSourceV2 API
☆218May 20, 2026Updated last week
Alternatives and similar repositories for spark-clickhouse-connector
Users that are interested in spark-clickhouse-connector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ClickHouse Native Protocol JDBC implementation☆543Jun 22, 2025Updated 11 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,335May 22, 2026Updated last week
- ☆17Mar 19, 2024Updated 2 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Clickhouse plugin for dbt (data build tool)☆345May 15, 2026Updated 2 weeks ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- ClickHouse Java Clients & JDBC Driver☆1,600May 21, 2026Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,563May 22, 2026Updated last week
- Client libraries of end users of Apache Kyuubi☆11May 15, 2026Updated 2 weeks ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 15, 2026Updated 2 weeks ago
- ☆14Apr 12, 2022Updated 4 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,047May 21, 2026Updated last week
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,137May 22, 2026Updated last week
- The most valuable time series database in the universe☆33Feb 22, 2022Updated 4 years ago
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 4 years ago
- A JDBC proxy from ClickHouse to external databases☆175Oct 9, 2025Updated 7 months ago
- A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters☆12May 7, 2022Updated 4 years ago
- Apache ORC - the smallest, fastest columnar storage for Hadoop workloads☆16May 15, 2026Updated 2 weeks ago
- Apache Iceberg☆8,898Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262May 12, 2024Updated 2 years ago
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆48Apr 16, 2026Updated last month
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆184Apr 6, 2022Updated 4 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆903May 22, 2026Updated last week
- This is a tool which used to manage and monitor ClickHouse database☆481May 22, 2026Updated last week
- Easily load data from kafka to ClickHouse☆534Mar 20, 2026Updated 2 months ago
- Community Java bindings for https://github.com/facebookincubator/velox☆41May 21, 2026Updated last week
- Apache HBase Connectors☆245May 15, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Spline agent for Apache Spark☆202May 21, 2026Updated last week
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Upserts, Deletes And Incremental Processing on Big Data.☆6,164May 22, 2026Updated last week
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆256Feb 21, 2023Updated 3 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆131Dec 19, 2024Updated last year
- Testing Sandbox for Hadoop Ecosystem Components☆45Apr 29, 2026Updated last month
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆9,343Updated this week