stefcorda / sparkStreamingETLLinks
Project to create configurable ETL via lightbend configuration using Spark Structured Streaming
☆8Updated 7 years ago
Alternatives and similar repositories for sparkStreamingETL
Users that are interested in sparkStreamingETL are comparing it to the libraries listed below
Sorting:
- A sink to save Spark Structured Streaming DataFrame into Hive table☆31Updated 7 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆184Updated 2 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- DataQuality for BigData☆144Updated last year
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- Apache Spark ETL Utilities☆40Updated 9 months ago
- Spark structured streaming examples with using of version 3.5.1☆26Updated last year
- Spark Structured Streaming JDBC Sink☆16Updated 4 years ago
- Custom state store providers for Apache Spark☆92Updated 5 months ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆160Updated 2 years ago
- Code for processing AVRO data in Spark Streaming + Kafka (DirectKafka approach with custom offset management in zookeeper)☆29Updated 8 years ago
- This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.☆41Updated 8 years ago
- spark structured streaming via HTTP communication☆18Updated 3 years ago
- A library for querying Druid data sources with Apache Spark☆23Updated 4 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 7 years ago
- Examples of Spark 2.0☆212Updated 4 years ago
- Kafka stream for Spark with storage of the offsets in ZooKeeper☆60Updated 8 years ago
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35Updated last month
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 4 years ago
- Self-contained examples of Apache Spark streaming integrated with Apache Kafka.☆199Updated 7 years ago
- Developing Spark External Data Sources using the V2 API☆48Updated 7 years ago
- Alerting and monitoring tool for Apache Spark☆23Updated 3 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆178Updated 3 years ago
- Java library to integrate Flink and Kudu☆54Updated 8 years ago
- ☆103Updated 5 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆89Updated last year
- Spark code to analyze HBase Snapshots☆35Updated 7 years ago
- Write your Spark data to Kafka seamlessly☆174Updated last year