charlesb / CDF-workshop
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆20Updated 5 years ago
Alternatives and similar repositories for CDF-workshop:
Users that are interested in CDF-workshop are comparing it to the libraries listed below
- ☆28Updated last year
- HDF masterclass materials☆28Updated 9 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆62Updated last year
- ☆27Updated 2 months ago
- Edge2AI Workshop☆69Updated 2 months ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- ☆32Updated 6 years ago
- ☆10Updated 2 years ago
- A general purpose framework for automating Cloudera Products☆66Updated 3 weeks ago
- CDF Tech Bootcamp☆9Updated 5 years ago
- A Spark datasource for the HadoopOffice library☆38Updated 2 years ago
- Terraform / NiFi on the Google Cloud Platform☆28Updated 4 months ago
- Hadoop Data Pipeline using Falcon☆15Updated 8 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Single view demo☆14Updated 9 years ago
- Collection of examples integrating NiFi with stream process frameworks.☆58Updated 8 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆33Updated last week
- Multi-stage, config driven, SQL based ETL framework using PySpark☆25Updated 5 years ago
- Examples for High Performance Spark☆15Updated 5 months ago
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated last year
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Apache Spark ETL Utilities☆40Updated 5 months ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Updated 3 months ago
- Code snippets used in demos recorded for the blog.☆30Updated this week
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago