charlesb / CDF-workshop
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆20Updated 5 years ago
Alternatives and similar repositories for CDF-workshop:
Users that are interested in CDF-workshop are comparing it to the libraries listed below
- ☆27Updated last year
- HDF masterclass materials☆28Updated 9 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- ☆10Updated 2 years ago
- Code snippets used in demos recorded for the blog.☆34Updated last week
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Edge2AI Workshop☆69Updated 3 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Examples for High Performance Spark☆15Updated 5 months ago
- ☆32Updated 6 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Materials for various Hadoop & Nifi related workshops☆52Updated 6 years ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- A Spark datasource for the HadoopOffice library☆38Updated 2 years ago
- CDF Tech Bootcamp☆9Updated 5 years ago
- Terraform / NiFi on the Google Cloud Platform☆28Updated 5 months ago
- Collection of examples integrating NiFi with stream process frameworks.☆59Updated 8 years ago
- ☆16Updated 4 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- ☆26Updated 4 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- This repository is to help with the Partner Demonstration of the Apache Atlas project.☆30Updated 9 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- ☆63Updated 5 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago