cloudera-labs / SparkOnHBaseLinks
SparkOnHBase
☆279Updated 4 years ago
Alternatives and similar repositories for SparkOnHBase
Users that are interested in SparkOnHBase are comparing it to the libraries listed below
Sorting:
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆316Updated 3 years ago
- Connect Spark to HBase for reading and writing data with ease☆295Updated 8 years ago
- Spark RDD to read, write and delete from HBase☆274Updated 4 years ago
- High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…☆634Updated 3 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆551Updated 4 years ago
- ☆243Updated 7 years ago
- Spark Streaming HBase Example☆95Updated 9 years ago
- Write your Spark data to Kafka seamlessly☆174Updated last year
- DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management☆60Updated 9 years ago
- Learning to write Spark examples☆160Updated 11 years ago
- Mirror of Apache Bahir☆335Updated 2 years ago
- Fluent client for interacting with Spark Standalone Mode's Rest API for submitting, killing and monitoring the state of jobs.☆111Updated 7 years ago
- Facebook's Hive UDFs☆276Updated last week
- A Maven-based example of using Cloudera Impala's JDBC driver☆118Updated 9 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Updated 8 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆192Updated 2 years ago
- Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.☆446Updated 3 months ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆216Updated 9 years ago
- Scala examples for learning to use Spark☆445Updated 5 years ago
- Plugins for Azkaban.☆130Updated 7 years ago
- spark + drools☆103Updated 3 years ago
- Apache HBase Connectors☆245Updated 3 months ago
- ☆236Updated 3 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Updated 8 years ago
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- ☆240Updated 4 years ago
- Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning.☆90Updated 9 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆281Updated 7 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago
- REST job server for Spark. Note that this is *not* the mainline open source version. For that, go to https://github.com/spark-jobserver…☆345Updated 8 years ago