curtishoward / spark-stream-kudu
Kafka, Spark Streaming, Kudu integration examples
☆17Updated 7 years ago
Alternatives and similar repositories for spark-stream-kudu:
Users that are interested in spark-stream-kudu are comparing it to the libraries listed below
- ☆8Updated 6 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Updated 8 years ago
- Configuration options and instructions on how to add JanusGraph to ambari as a service☆9Updated 7 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated last year
- Real-time analytics in Apache Flume☆52Updated 9 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 6 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- A Spark SQL HBase connector☆29Updated 9 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Ambari stack for easily installing and managing Redis on HDP cluster☆15Updated 9 years ago
- Flink Examples☆39Updated 8 years ago
- Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers☆69Updated 2 years ago
- ElasticSearch integration for Apache Spark☆47Updated 8 years ago
- Spark Example using Phoenix to interact with HBase☆16Updated 8 years ago
- A demo repository for "streaming etl" with Apache Flink☆44Updated 8 years ago
- Will come later...☆20Updated 2 years ago
- Ambari service for Apache Drill☆17Updated 8 years ago
- POC for all the stack of big data (kafka, spark, cassandra, hdfs, docker, springboot)☆12Updated 2 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- This is a datasource implementation for quick query in Kafka with Spark☆9Updated last year
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Updated 7 years ago
- ☆11Updated 9 years ago
- Custom Service for deploying Apache Alluxio on a running HDP 2.3 / IOP 4.1 Ambari Managed Cluster☆13Updated 8 years ago
- ☆39Updated 6 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆13Updated last year
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago