charlesb / CDF-workshopLinks
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Updated 5 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below
Sorting:
- Edge2AI Workshop☆70Updated last month
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆27Updated this week
- ☆27Updated last year
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated last month
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated 2 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆36Updated 6 months ago
- Kafka sink for Kusto☆50Updated last week
- ☆22Updated 2 years ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆66Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated last week
- Delta Lake Documentation☆49Updated last year
- EverythingApacheNiFi☆113Updated last year
- HDF masterclass materials☆28Updated 9 years ago
- Testing framework for Databricks notebooks☆306Updated last year
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆33Updated 2 weeks ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- ☆8Updated 6 years ago
- How do to CI/CD with Azure Data Factory☆41Updated 4 years ago
- How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…☆61Updated 9 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated last week
- Examples of metadata driven SQL processes implemented in Databricks☆16Updated 4 years ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆109Updated 2 years ago
- Collection of Databricks and Jupyter Notebooks☆22Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Updated 5 years ago
- Delta Lake examples☆227Updated 10 months ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆73Updated 3 weeks ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆45Updated 3 weeks ago