cloudera-labs / SparkOnHBaseLinks
SparkOnHBase
☆279Updated 4 years ago
Alternatives and similar repositories for SparkOnHBase
Users that are interested in SparkOnHBase are comparing it to the libraries listed below
Sorting:
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆318Updated 3 years ago
- Connect Spark to HBase for reading and writing data with ease☆297Updated 7 years ago
- Spark RDD to read, write and delete from HBase☆276Updated 4 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆634Updated 3 years ago
- ☆243Updated 7 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆551Updated 4 years ago
- Spark Streaming HBase Example☆96Updated 9 years ago
- Write your Spark data to Kafka seamlessly☆174Updated last year
- Mirror of Apache Bahir☆335Updated 2 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 9 years ago
- Learning to write Spark examples☆160Updated 11 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆282Updated 7 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Updated 9 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆190Updated last year
- Apache HBase Connectors☆243Updated last month
- Scala examples for learning to use Spark☆445Updated 5 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆178Updated 3 years ago
- The Internals of Spark Structured Streaming☆420Updated 2 years ago
- Facebook's Hive UDFs☆276Updated 3 weeks ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- ☆240Updated 4 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Updated 2 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Updated 8 years ago
- spark + drools☆103Updated 3 years ago
- Spark code to analyze HBase Snapshots☆35Updated 7 years ago
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 9 years ago