charlesb / CDF-workshopLinks
Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API into a live dashboard using NiFi, Kafka, Hive LLAP with Druid integration and Superset. This workshop will also cover steps to remotely manage MiNiFi to send data to NiFi using Edge Flow Manager (EFM).
☆19Updated 6 years ago
Alternatives and similar repositories for CDF-workshop
Users that are interested in CDF-workshop are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- Edge2AI Workshop☆70Updated 7 months ago
- Apache Spark Connector for SQL Server and Azure SQL☆287Updated 11 months ago
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated 2 years ago
- ☆32Updated 6 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- A collection of templates for use with Apache NiFi.☆278Updated 9 years ago
- Delta Lake examples☆237Updated last year
- dbt adapter for Azure Synapse Dedicated SQL Pools☆76Updated 5 months ago
- The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the …☆46Updated 6 months ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated last week
- DataQuality for BigData☆147Updated 2 years ago
- EverythingApacheNiFi☆116Updated 2 years ago
- Testing framework for Databricks notebooks☆314Updated last year
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆237Updated 11 months ago
- MonitoFi: Health & Performance Monitor for your Apache NiFi☆68Updated 2 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆29Updated 2 weeks ago
- A general purpose framework for automating Cloudera Products☆69Updated 10 months ago
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- Example code for doing DataOps☆49Updated 5 years ago
- The Internals of Spark on Kubernetes☆72Updated 3 years ago
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆37Updated 11 months ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆111Updated 2 years ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆279Updated 3 months ago
- This Apache Atlas is built from the latest release source tarball and patched to be run in a Docker container.☆143Updated 2 years ago
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆21Updated 4 years ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- ☆24Updated 2 years ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆45Updated last week
- Delta Lake Documentation☆53Updated last year