cloudera-labs/SparkOnHBase

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cloudera-labs/SparkOnHBase)

cloudera-labs / SparkOnHBase

SparkOnHBase

☆278

Alternatives and similar repositories for SparkOnHBase

Users that are interested in SparkOnHBase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nerdammer / spark-hbase-connector
View on GitHub
Connect Spark to HBase for reading and writing data with ease
☆296Dec 19, 2017Updated 8 years ago
hbase-rdd / hbase-rdd
View on GitHub
Spark RDD to read, write and delete from HBase
☆275Jan 22, 2021Updated 5 years ago
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
hortonworks-spark / shc
View on GitHub
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
☆546May 10, 2021Updated 5 years ago
caroljmcdonald / SparkStreamingHBaseExample
View on GitHub
Spark Streaming HBase Example
☆94Apr 4, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OopsOutOfMemory / spark-sql-hbase
View on GitHub
A Spark SQL HBase connector
☆29May 4, 2015Updated 11 years ago
IBM / sparksql-for-hbase
View on GitHub
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers
☆69Sep 17, 2025Updated 10 months ago
hbase-rdd / hbase-rdd-examples
View on GitHub
HBase RDD example project
☆19Jan 22, 2021Updated 5 years ago
dibbhatt / kafka-spark-consumer
View on GitHub
High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…
☆632Apr 24, 2026Updated 3 months ago
zaratsian / SparkHBaseExample
View on GitHub
Spark code to analyze HBase Snapshots
☆36Feb 19, 2018Updated 8 years ago
apache / hbase-connectors
View on GitHub
Apache HBase Connectors
☆246Jul 13, 2026Updated last week
databricks / spark-avro
View on GitHub
Avro Data Source for Apache Spark
☆537Dec 19, 2018Updated 7 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
LinMingQiang / sparkstreaming
View on GitHub
封装sparkstreaming动态调节batch time(有数据就执行计算)；支持运行过程中增删topic；封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
☆181Apr 15, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xlturing / spark-journey
View on GitHub
spark实例代码
☆78Nov 11, 2017Updated 8 years ago
cpbaranwal / Avro-SparkStreaming-Kafka
View on GitHub
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Sep 9, 2016Updated 9 years ago
spark-jobserver / spark-jobserver
View on GitHub
REST job server for Apache Spark
☆2,837Mar 3, 2026Updated 4 months ago
sryza / spark-timeseries
View on GitHub
A library for time series analysis on Apache Spark
☆1,197Oct 13, 2020Updated 5 years ago
tresata / spark-scalding
View on GitHub
Use Cascading Taps and Scalding DSL with Spark
☆49Dec 28, 2016Updated 9 years ago
RedisLabs / spark-redis
View on GitHub
A connector for Spark that allows reading and writing to/from Redis cluster
☆947Oct 22, 2024Updated last year
lw-lin / CoolplaySpark
View on GitHub
酷玩 Spark: Spark 源代码解析、Spark 类库等
☆3,475May 18, 2022Updated 4 years ago
cloudera / livy
View on GitHub
Livy is an open source REST interface for interacting with Apache Spark from anywhere
☆1,007Oct 5, 2022Updated 3 years ago
apache / hbase
View on GitHub
Apache HBase
☆5,551Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
h2oai / sparkling-water
View on GitHub
Sparkling Water provides H2O functionality inside Spark cluster
☆979Nov 5, 2025Updated 8 months ago
elastic / elasticsearch-hadoop
View on GitHub
Elasticsearch real-time search and analytics natively integrated with Hadoop
☆1,975Updated this week
lw309637554 / alicloud-hbase-spark-examples
View on GitHub
☆29Aug 2, 2018Updated 7 years ago
Gschiavon / Kafka-SparkStreaming-HDFS
View on GitHub
☆14Nov 3, 2016Updated 9 years ago
zaratsian / SparkPhoenix
View on GitHub
Spark Example using Phoenix to interact with HBase
☆16Nov 2, 2016Updated 9 years ago
Stratio / sparta
View on GitHub
Real Time Analytics and Data Pipelines based on Spark Streaming
☆530Oct 24, 2019Updated 6 years ago
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
apache / phoenix
View on GitHub
Apache Phoenix
☆1,060Updated this week
mvalleavila / Kafka-Spark-Hbase-Example
View on GitHub
☆40Aug 19, 2015Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
amplab / spark-indexedrdd
View on GitHub
An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
duhanmin / structured-streaming-Kafka2HBase
View on GitHub
Spark structured-streaming 消费kafka数据写入hbase
☆33Jan 22, 2019Updated 7 years ago
JerryLead / SparkInternals
View on GitHub
Notes talking about the design and implementation of Apache Spark
☆5,361Apr 2, 2024Updated 2 years ago
lucidworks / spark-solr
View on GitHub
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
☆445Sep 4, 2025Updated 10 months ago
2shou / HBaseObserver
View on GitHub
通过HBase Observer同步数据到ElasticSearch
☆55May 8, 2015Updated 11 years ago
fayson / cdhproject
View on GitHub
hadoop各组件使用，持续更新
☆897Jan 4, 2023Updated 3 years ago