gschmutz / various-demos
Various Demos mostly based on docker environments
☆34Updated 2 years ago
Alternatives and similar repositories for various-demos:
Users that are interested in various-demos are comparing it to the libraries listed below
- ☆47Updated 6 months ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Code snippets used in demos recorded for the blog.☆29Updated last week
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Kafka Examples repository.☆44Updated 6 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- ☆63Updated 5 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆118Updated last week
- A repository containing materials for Stateful Functions workshop☆44Updated last year
- ☆81Updated last year
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Code and presentation for Strata Model Serving tutorial☆68Updated 5 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago
- These are some code examples☆55Updated 5 years ago
- spark on kubernetes☆105Updated 2 years ago
- Apache Flink (Pyflink) and Related Projects☆30Updated 8 months ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆62Updated 4 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Flowchart for debugging Spark applications☆104Updated 4 months ago
- Spark with Scala example projects☆34Updated 5 years ago
- Examples for High Performance Spark☆15Updated 3 months ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 11 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆163Updated 3 weeks ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆75Updated 9 months ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated last month