stefcorda / sparkStreamingETLLinks

Project to create configurable ETL via lightbend configuration using Spark Structured Streaming

☆8

Alternatives and similar repositories for sparkStreamingETL

Users that are interested in sparkStreamingETL are comparing it to the libraries listed below

Sorting:

jerryshao / spark-hive-streaming-sink
A sink to save Spark Structured Streaming DataFrame into Hive table
☆31Updated 7 years ago
polomarcus / Spark-Structured-Streaming-Examples
Spark Structured Streaming / Kafka / Cassandra / Elastic
☆184Updated 2 years ago
ansrivas / spark-structured-streaming
Spark structured streaming with Kafka data source and writing to Cassandra
☆62Updated 5 years ago
agile-lab-dev / DataQuality
DataQuality for BigData
☆144Updated last year
hortonworks-spark / spark-schema-registry
Schema Registry integration for Apache Spark
☆40Updated 2 years ago
bartosz25 / spark-scala-playground
Sample processing code using Spark 2.1+ and Scala
☆51Updated 5 years ago
mayur2810 / sope
Apache Spark ETL Utilities
☆40Updated 9 months ago
AndrewKuzmin / spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.5.1
☆26Updated last year
mshtelma / spark-structured-streaming-jdbc-sink
Spark Structured Streaming JDBC Sink
☆16Updated 4 years ago
chermenin / spark-states
Custom state store providers for Apache Spark
☆92Updated 5 months ago
bomeng / Heracles
High performance HBase / Spark SQL engine
☆28Updated 3 years ago
cloudera-labs / envelope
Build configuration-driven ETL pipelines on Apache Spark
☆160Updated 2 years ago
cpbaranwal / Avro-SparkStreaming-Kafka
Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)
☆29Updated 8 years ago
NashTech-Labs / real-time-stream-processing-engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
☆41Updated 8 years ago
bluejoe2008 / spark-http-stream
spark structured streaming via HTTP communication
☆18Updated 3 years ago
SharpRay / spark-druid-connector
A library for querying Druid data sources with Apache Spark
☆23Updated 4 years ago
hortonworks-spark / spark-hive-streaming-sink
A sink to save Spark Structured Streaming DataFrame into Hive table
☆23Updated 7 years ago
phatak-dev / spark2.0-examples
Examples of Spark 2.0
☆212Updated 4 years ago
ippontech / spark-kafka-source
Kafka stream for Spark with storage of the offsets in ZooKeeper
☆60Updated 8 years ago
yaooqinn / spark-postgres
PostgreSQL and GreenPlum Data Source for Apache Spark
☆35Updated last month
qubole / spark-acid
ACID Data Source for Apache Spark based on Hive ACID
☆97Updated 4 years ago
spirom / spark-streaming-with-kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
☆199Updated 7 years ago
spirom / spark-data-sources
Developing Spark External Data Sources using the V2 API
☆48Updated 7 years ago
NetEase / spark-alarm
Alerting and monitoring tool for Apache Spark
☆23Updated 3 years ago
yaooqinn / spark-authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…
☆178Updated 3 years ago
rubencasado / Flink-Kudu
Java library to integrate Flink and Kudu
☆54Updated 8 years ago
hortonworks-spark / spark-llap
☆103Updated 5 years ago
ExpediaGroup / circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
☆89Updated last year
zaratsian / SparkHBaseExample
Spark code to analyze HBase Snapshots
☆35Updated 7 years ago
BenFradet / spark-kafka-writer
Write your Spark data to Kafka seamlessly
☆174Updated last year