charlesb / CDF-workshopLinks
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Updated 5 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Edge2AI Workshop☆70Updated 2 weeks ago
- HDF masterclass materials☆28Updated 9 years ago
- ☆32Updated 6 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- CDF Tech Bootcamp☆9Updated 5 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Examples for High Performance Spark☆16Updated 7 months ago
- Memory / Configuration Calculator for Hive LLAP☆14Updated 4 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆64Updated last year
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- ☆16Updated 4 years ago
- MapReduce performance testing using teragen and terasort☆18Updated 3 years ago
- Code snippets used in demos recorded for the blog.☆37Updated last week
- Single view demo☆14Updated 9 years ago
- A Spark datasource for the HadoopOffice library☆38Updated 2 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Updated 5 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 4 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- ☆10Updated 3 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- Terraform / NiFi on the Google Cloud Platform☆28Updated 7 months ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆25Updated 5 years ago
- Materials for various Hadoop & Nifi related workshops☆19Updated 3 years ago