gschmutz / various-demos
Various Demos mostly based on docker environments
☆33Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for various-demos
- These are some code examples☆55Updated 4 years ago
- Presentations and other resources.☆31Updated 4 years ago
- A proof of concept using Divolte, Kafka, Druid and Superset☆61Updated 4 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆111Updated this week
- Real-time anomaly detection using Kafka, KSQL User Defined Function and a pre-trained model☆30Updated 11 months ago
- Kafka Examples repository.☆43Updated 5 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 4 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- ☆81Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆154Updated 2 weeks ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆75Updated 6 months ago
- Code snippets used in demos recorded for the blog.☆29Updated last month
- Code and presentation for Strata Model Serving tutorial☆69Updated 5 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Updated 2 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Graph Analytics with Apache Kafka☆101Updated last week
- Schema Registry integration for Apache Spark☆39Updated 2 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- ☆63Updated 5 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆48Updated 10 months ago
- Kubeflow example of machine learning/model serving☆35Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆86Updated 8 months ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆60Updated 2 months ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated 8 months ago
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Updated 4 years ago
- plumber helps you tame NiFi flow☆45Updated last year
- Spark to Tableau Extractor library☆18Updated 7 years ago