saikrishnapujari / Spark-Drools-Integration
☆23Updated 5 years ago
Alternatives and similar repositories for Spark-Drools-Integration:
Users that are interested in Spark-Drools-Integration are comparing it to the libraries listed below
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- spark-drools tutorials☆16Updated 9 months ago
- ☆39Updated 5 years ago
- Apache Spark ETL Utilities☆40Updated 2 months ago
- A modern real-time streaming application serving as a reference framework for developing a big data pipeline, complete with a broad range…☆41Updated 4 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆49Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆25Updated 5 years ago
- Spark cloud integration: tests, cloud committers and more☆19Updated 10 months ago
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- Library for generating avro schema files (.avsc) based on DB tables structure☆50Updated last month
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Updated 5 years ago
- Kafka-Connect SMT (Single Message Transformations) with SQL syntax (Using Apache Calcite for the SQL parsing)☆32Updated 4 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated 9 months ago
- This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.☆41Updated 8 years ago
- Kafka Examples repository.☆43Updated 5 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 5 years ago
- ☆13Updated 2 months ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆86Updated 10 months ago
- Flink Examples☆39Updated 8 years ago
- Code snippets used in demos recorded for the blog.☆29Updated this week
- Custom state store providers for Apache Spark☆92Updated 2 years ago
- Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.☆31Updated 7 years ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 4 years ago
- ☆48Updated 4 years ago
- These are some code examples☆55Updated 5 years ago