gschmutz / various-demosLinks
Various Demos mostly based on docker environments
☆33Updated 2 years ago
Alternatives and similar repositories for various-demos
Users that are interested in various-demos are comparing it to the libraries listed below
Sorting:
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- ☆61Updated last year
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated last month
- Materials (slides and code) for Kafka and Kafka Streams Workshops☆62Updated last year
- Presentations and other resources.☆36Updated 5 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Updated 2 years ago
- Code and presentation for Strata Model Serving tutorial☆68Updated 6 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 9 years ago
- Spark with Scala example projects☆34Updated 6 years ago
- ☆81Updated last year
- ❤for real-time DataOps - where the application and data fabric blends - Lenses☆159Updated 2 weeks ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 2 months ago
- Deep Learning UDF for KSQL, the Streaming SQL Engine for Apache Kafka with Elasticsearch Sink Example☆79Updated 7 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- Real-world Spark pipelines examples☆84Updated 7 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- Egeria's Guidance on Governance as well as large media files such as presentations and movies☆106Updated 3 years ago
- Code Samples for my Ververica Webinar "99 Ways to Enrich Streaming Data with Apache Flink"☆41Updated 3 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆66Updated 2 years ago
- Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm☆103Updated last year
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 4 months ago
- A Table format agnostic data sharing framework☆41Updated last year
- ☆63Updated 5 years ago
- A hybrid Big Data pipeline architecture that combines a real-time streaming layer with a batch layer to process large datasets(Lambda Arc…☆184Updated last month
- Data quality control tool built on spark and deequ☆25Updated 7 months ago
- A K8s-based infrastructure for analytics☆24Updated 5 years ago
- spark on kubernetes☆104Updated 2 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago